r2.8 doc updates (#3757)

ZailiWang · chunyuan-w · web-flow · commit 216d20b5679c · 2025-07-04T09:07:28.000+08:00
Co-authored-by: Chunyuan WU &lt;chunyuan.wu@intel.com&gt;
diff --git a/README.md b/README.md
@@ -5,7 +5,7 @@ Intel® Extension for PyTorch\*
 
 </div>
 
-**CPU** [💻main branch](https://github.com/intel/intel-extension-for-pytorch/tree/main)&nbsp;&nbsp;&nbsp;|&nbsp;&nbsp;&nbsp;[🌱Quick Start](https://intel.github.io/intel-extension-for-pytorch/cpu/latest/tutorials/getting_started.html)&nbsp;&nbsp;&nbsp;|&nbsp;&nbsp;&nbsp;[📖Documentations](https://intel.github.io/intel-extension-for-pytorch/cpu/latest/)&nbsp;&nbsp;&nbsp;|&nbsp;&nbsp;&nbsp;[🏃Installation](https://intel.github.io/intel-extension-for-pytorch/index.html#installation?platform=cpu&version=v2.7.0%2Bcpu)&nbsp;&nbsp;&nbsp;|&nbsp;&nbsp;&nbsp;[💻LLM Example](https://github.com/intel/intel-extension-for-pytorch/tree/main/examples/cpu/llm) <br>
+**CPU** [💻main branch](https://github.com/intel/intel-extension-for-pytorch/tree/main)&nbsp;&nbsp;&nbsp;|&nbsp;&nbsp;&nbsp;[🌱Quick Start](https://intel.github.io/intel-extension-for-pytorch/cpu/latest/tutorials/getting_started.html)&nbsp;&nbsp;&nbsp;|&nbsp;&nbsp;&nbsp;[📖Documentations](https://intel.github.io/intel-extension-for-pytorch/cpu/latest/)&nbsp;&nbsp;&nbsp;|&nbsp;&nbsp;&nbsp;[🏃Installation](https://intel.github.io/intel-extension-for-pytorch/index.html#installation?platform=cpu&version=v2.8.0%2Bcpu)&nbsp;&nbsp;&nbsp;|&nbsp;&nbsp;&nbsp;[💻LLM Example](https://github.com/intel/intel-extension-for-pytorch/tree/release/2.8/examples/cpu/llm) <br>
 **GPU** [💻main branch](https://github.com/intel/intel-extension-for-pytorch/tree/xpu-main)&nbsp;&nbsp;&nbsp;|&nbsp;&nbsp;&nbsp;[🌱Quick Start](https://intel.github.io/intel-extension-for-pytorch/xpu/latest/tutorials/getting_started.html)&nbsp;&nbsp;&nbsp;|&nbsp;&nbsp;&nbsp;[📖Documentations](https://intel.github.io/intel-extension-for-pytorch/xpu/latest/)&nbsp;&nbsp;&nbsp;|&nbsp;&nbsp;&nbsp;[🏃Installation](https://intel.github.io/intel-extension-for-pytorch/index.html#installation?platform=gpu)&nbsp;&nbsp;&nbsp;|&nbsp;&nbsp;&nbsp;[💻LLM Example](https://github.com/intel/intel-extension-for-pytorch/tree/xpu-main/examples/gpu/llm)<br>  
 
 Intel® Extension for PyTorch\* extends PyTorch\* with up-to-date features optimizations for an extra performance boost on Intel hardware. Optimizations take advantage of Intel® Advanced Vector Extensions 512 (Intel® AVX-512) Vector Neural Network Instructions (VNNI) and Intel® Advanced Matrix Extensions (Intel® AMX) on Intel CPUs as well as Intel X<sup>e</sup> Matrix Extensions (XMX) AI engines on Intel discrete GPUs. Moreover, Intel® Extension for PyTorch* provides easy GPU acceleration for Intel discrete GPUs through the PyTorch* xpu device.
diff --git a/docker/Dockerfile.prebuilt b/docker/Dockerfile.prebuilt
@@ -35,11 +35,10 @@ RUN update-alternatives --install /usr/bin/python python /usr/bin/python3 100
 
 WORKDIR /root
 
-ARG IPEX_VERSION=2.7.0
-ARG TORCHCCL_VERSION=2.7.0
-ARG PYTORCH_VERSION=2.7.0
-ARG TORCHAUDIO_VERSION=2.7.0
-ARG TORCHVISION_VERSION=0.22.0
+ARG IPEX_VERSION=2.8.0
+ARG PYTORCH_VERSION=2.8.0
+ARG TORCHAUDIO_VERSION=2.8.0
+ARG TORCHVISION_VERSION=0.23.0
 RUN python -m venv venv && \
     . ./venv/bin/activate && \
     python -m pip --no-cache-dir install --upgrade \
@@ -49,7 +48,7 @@ RUN python -m venv venv && \
     python -m pip install --no-cache-dir \
     torch==${PYTORCH_VERSION}+cpu torchvision==${TORCHVISION_VERSION}+cpu torchaudio==${TORCHAUDIO_VERSION}+cpu --index-url https://download.pytorch.org/whl/cpu && \
     python -m pip install --no-cache-dir \
-    intel_extension_for_pytorch==${IPEX_VERSION} oneccl_bind_pt==${TORCHCCL_VERSION} --extra-index-url https://pytorch-extension.intel.com/release-whl/stable/cpu/us/ && \
+    intel_extension_for_pytorch==${IPEX_VERSION} --extra-index-url https://pytorch-extension.intel.com/release-whl/stable/cpu/us/ && \
     python -m pip install intel-openmp && \
     python -m pip cache purge
 
@@ -67,7 +66,6 @@ RUN ENTRYPOINT=/usr/local/bin/entrypoint.sh && \
     echo "CMD=\"\"; for i in \${!CMDS[@]}; do CMD=\"\${CMD} \${CMDS[\$i]}\"; done;" >> ${ENTRYPOINT} && \
     echo ". ~/venv/bin/activate" >> ${ENTRYPOINT} && \
     echo "TMP=\$(python -c \"import torch; import os; print(os.path.abspath(os.path.dirname(torch.__file__)))\")" >> ${ENTRYPOINT} && \
-    echo ". \${TMP}/../oneccl_bindings_for_pytorch/env/setvars.sh" >> ${ENTRYPOINT} && \
     echo "echo \"**Note:** For better performance, please consider to launch workloads with command 'ipexrun'.\"" >> ${ENTRYPOINT} && \
     echo "exec \${CMD}" >> ${ENTRYPOINT} && \
     chmod +x ${ENTRYPOINT}
diff --git a/docker/README.md b/docker/README.md
@@ -10,7 +10,7 @@
 
   ```console
   $ cd $DOCKERFILE_DIR
-  $ DOCKER_BUILDKIT=1 docker build -f Dockerfile.prebuilt -t intel-extension-for-pytorch:main .
+  $ DOCKER_BUILDKIT=1 docker build -f Dockerfile.prebuilt -t intel-extension-for-pytorch:2.8.0 .
   ```
 
   Run the following commands to build a `conda` based container with Intel® Extension for PyTorch\* compiled from source:
@@ -20,14 +20,14 @@
   $ cd intel-extension-for-pytorch
   $ git submodule sync
   $ git submodule update --init --recursive
-  $ DOCKER_BUILDKIT=1 docker build -f docker/Dockerfile.compile -t intel-extension-for-pytorch:main .
+  $ DOCKER_BUILDKIT=1 docker build -f docker/Dockerfile.compile -t intel-extension-for-pytorch:2.8.0 .
   ```
 
 * Sanity Test
 
   When a docker image is built out, Run the command below to launch into a container:
   ```console
-  $ docker run --rm -it intel-extension-for-pytorch:main bash
+  $ docker run --rm -it intel-extension-for-pytorch:2.8.0 bash
   ```
 
   Then run the command below inside the container to verify correct installation.
diff --git a/docs/tutorials/getting_started.md b/docs/tutorials/getting_started.md
@@ -1,6 +1,6 @@
 # Quick Start
 
-The following instructions assume you have installed the Intel® Extension for PyTorch\*. For installation instructions, refer to [Installation](../../../index.html#installation?platform=cpu&version=v2.7.0%2Bcpu).
+The following instructions assume you have installed the Intel® Extension for PyTorch\*. For installation instructions, refer to [Installation](../../../index.html#installation?platform=cpu&version=v2.8.0%2Bcpu).
 
 To start using the Intel® Extension for PyTorch\* in your code, you need to make the following changes:
 
@@ -157,4 +157,4 @@ with torch.inference_mode(), torch.cpu.amp.autocast(enabled=amp_enabled):
     print(gen_text, total_new_tokens, flush=True)
 ```
 
-More LLM examples, including usage of low precision data types are available in the [LLM Examples](https://github.com/intel/intel-extension-for-pytorch/tree/main/examples/cpu/llm) section.
+More LLM examples, including usage of low precision data types are available in the [LLM Examples](https://github.com/intel/intel-extension-for-pytorch/tree/release/2.8/examples/cpu/llm) section.
diff --git a/docs/tutorials/installation.md b/docs/tutorials/installation.md
@@ -1,8 +1,8 @@
 Installation
 ============
 
-Select your preferences and follow the installation instructions provided on the [Installation page](../../../index.html#installation?platform=cpu&version=v2.7.0%2Bcpu).
+Select your preferences and follow the installation instructions provided on the [Installation page](../../../index.html#installation?platform=cpu&version=v2.8.0%2Bcpu).
 
 After successful installation, refer to the [Quick Start](getting_started.md) and [Examples](examples.md) sections to start using the extension in your code.
 
-**NOTE:** For detailed instructions on installing and setting up the environment for Large Language Models (LLM), as well as example scripts, refer to the [LLM best practices](https://github.com/intel/intel-extension-for-pytorch/tree/main/examples/cpu/llm).
+**NOTE:** For detailed instructions on installing and setting up the environment for Large Language Models (LLM), as well as example scripts, refer to the [LLM best practices](https://github.com/intel/intel-extension-for-pytorch/tree/release/2.8/examples/cpu/llm).
diff --git a/docs/tutorials/introduction.rst b/docs/tutorials/introduction.rst
@@ -16,7 +16,7 @@ the `Large Language Models (LLM) <llm.html>`_ section.
 
 Get Started
 -----------
-- `Installation <../../../index.html#installation?platform=cpu&version=v2.7.0%2Bcpu>`_
+- `Installation <../../../index.html#installation?platform=cpu&version=v2.8.0%2Bcpu>`_
 - `Quick Start <getting_started.md>`_
 - `Examples <examples.md>`_
 
diff --git a/docs/tutorials/llm.rst b/docs/tutorials/llm.rst
@@ -30,7 +30,7 @@ Verified for distributed inference mode via DeepSpeed
 
 *Note*: The above verified models (including other models in the same model family, like "codellama/CodeLlama-7b-hf" from LLAMA family) are well supported with all optimizations like indirect access KV cache, fused ROPE, and customized linear kernels. We are working in progress to better support the models in the tables with various data types. In addition, more models will be optimized in the future.
 
-Please check `LLM best known practice <https://github.com/intel/intel-extension-for-pytorch/tree/main/examples/cpu/llm>`_ for instructions to install/setup environment and example scripts.
+Please check `LLM best known practice <https://github.com/intel/intel-extension-for-pytorch/tree/release/2.8/examples/cpu/llm>`_ for instructions to install/setup environment and example scripts.
 
 Module Level Optimization API for customized LLM (Prototype)
 ------------------------------------------------------------
diff --git a/docs/tutorials/llm/llm_optimize.md b/docs/tutorials/llm/llm_optimize.md
@@ -9,12 +9,10 @@ This API currently supports for inference workloads of certain models.
 API documentation is available at [API Docs page](https://intel.github.io/intel-extension-for-pytorch/cpu/latest/tutorials/api_doc.html#ipex.llm.optimize),
 and supported model list can be found at [this page](https://intel.github.io/intel-extension-for-pytorch/cpu/latest/tutorials/llm.html#ipexllm-optimized-model-list-for-inference).
 
-For LLM fine-tuning, please check the [LLM fine-tuning tutorial](https://github.com/intel/intel-extension-for-pytorch/tree/main/examples/cpu/llm/fine-tuning).
-
 ## Pseudocode of Common Usage Scenarios
 
 The following sections show pseudocode snippets to invoke Intel® Extension for PyTorch\* APIs to work with LLM models.
-Complete examples can be found at [the Example directory](https://github.com/intel/intel-extension-for-pytorch/tree/main/examples/cpu/llm/inference).
+Complete examples can be found at [the Example directory](https://github.com/intel/intel-extension-for-pytorch/tree/release/2.8/examples/cpu/llm/inference).
 
 ### FP32/BF16
 
@@ -59,7 +57,7 @@ model = ipex.llm.optimize(model, quantization_config=qconfig, low_precision_chec
 
 Distributed inference can be performed with `DeepSpeed`. Based on original Intel® Extension for PyTorch\* scripts, the following code changes are required.
 
-Check [LLM distributed inference examples](https://github.com/intel/intel-extension-for-pytorch/tree/main/examples/cpu/llm/inference/distributed) for complete codes.
+Check [LLM distributed inference examples](https://github.com/intel/intel-extension-for-pytorch/tree/release/2.8/examples/cpu/llm/inference/distributed) for complete codes.
 
 ``` python
 import torch
diff --git a/examples/cpu/inference/cpp/README.md b/examples/cpu/inference/cpp/README.md
@@ -16,15 +16,15 @@ We can have `libtorch` and `libintel-ext-pt` installed via the following command
 Download zip file of `libtorch` and decompress it:
 
 ```bash
-wget https://download.pytorch.org/libtorch/cpu/libtorch-cxx11-abi-shared-with-deps-2.7.0%2Bcpu.zip
-unzip libtorch-cxx11-abi-shared-with-deps-2.7.0+cpu.zip
+wget https://download.pytorch.org/libtorch/cpu/libtorch-cxx11-abi-shared-with-deps-2.8.0%2Bcpu.zip
+unzip libtorch-cxx11-abi-shared-with-deps-2.8.0+cpu.zip
 ```
 
 Download and execute `libintel-ext-pt` installation script:
 
 ```bash
-wget https://intel-extension-for-pytorch.s3.amazonaws.com/libipex/cpu/libintel-ext-pt-cxx11-abi-2.7.0%2Bcpu.run
-bash libintel-ext-pt-cxx11-abi-2.7.0+cpu.run install ./libtorch
+wget https://intel-extension-for-pytorch.s3.amazonaws.com/libipex/cpu/libintel-ext-pt-cxx11-abi-2.8.0%2Bcpu.run
+bash libintel-ext-pt-cxx11-abi-2.8.0+cpu.run install ./libtorch
 ```
 
 Please view the `cppsdk` part in [the installation guide](https://intel.github.io/intel-extension-for-pytorch/index.html#installation?platform=cpu) 
diff --git a/scripts/compile_bundle.py b/scripts/compile_bundle.py
@@ -353,7 +353,7 @@ def process(*args):
     if BASEDIR != SCRIPTDIR:
         assert (
             args.ver_ipex == ""
-        ), "Argument --ver-ipex cannot be set if you run the script from a exisiting source code directory."
+        ), "Argument --ver-ipex cannot be set if you run the script from a existing source code directory."
     else:
         assert (
             args.ver_ipex != ""