chore(deps): Bump intel-extension-for-pytorch from 2.3.110+xpu to 2.6.0 in /backend/python/rerankers #4849

dependabot · 2025-02-17T18:44:02Z

Bumps intel-extension-for-pytorch from 2.3.110+xpu to 2.6.0.

Release notes

Sourced from intel-extension-for-pytorch's releases.

Intel® Extension for PyTorch* v2.5.10+xpu Release Notes

2.5.10+xpu

We are excited to announce the release of Intel® Extension for PyTorch* v2.5.10+xpu. This is the new release which supports Intel® GPU platforms (Intel® Data Center GPU Max Series, Intel® Arc™ Graphics family, Intel® Core™ Ultra Processors with Intel® Arc™ Graphics, Intel® Core™ Ultra Series 2 with Intel® Arc™ Graphics and Intel® Data Center GPU Flex Series) based on PyTorch* 2.5.1.

Highlights

Intel® oneDNN v3.6 integration

Intel® oneAPI Base Toolkit 2025.0.1 compatibility

Intel® Arc™ B-series Graphics support on Windows (prototype)

Large Language Model (LLM) optimization

Intel® Extension for PyTorch* enhances KV Cache management to cover both Dynamic Cache and Static Cache methods defined by Hugging Face, which helps reduce computation time and improve response rates so as to optimize the performance of models in various generative tasks. Intel® Extension for PyTorch* also supports new LLM features including speculative decoding which optimizes inference by making educated guesses about future tokens while generating the current token, sliding window attention which uses a fixed-size window to limit the attention span of each token thus significantly improves processing speed and efficiency for long documents, and multi-round conversations for supporting a natural human conversation where information is exchanged in multiple turns back and forth.

Besides that, Intel® Extension for PyTorch* optimizes more LLM models for inference and finetuning. A full list of optimized models can be found at LLM Optimizations Overview.

Serving framework support

Typical LLM serving frameworks including vLLM and TGI can co-work with Intel® Extension for PyTorch* on Intel® GPU platforms on Linux (intensively verified on Intel® Data Center GPU Max Series). The support to low precision such as INT4 Weight Only Quantization, which is based on Generalized Post-Training Quantization (GPTQ) algorithm, is enhanced in this release.

Beta support of full fine-tuning and LoRA PEFT with mixed precision

Intel® Extension for PyTorch* enhances this feature for optimizing typical LLM models and makes it reach Beta quality.

Kineto Profiler Support

Intel® Extension for PyTorch* removes this redundant feature as the support of Kineto Profiler based on PTI on Intel® GPU platforms is available in PyTorch* 2.5.

Hybrid ATen operator implementation

Intel® Extension for PyTorch* uses ATen operators available in Torch XPU Operators as much as possible and overrides very limited operators for better performance and broad data type support.

Breaking Changes

Block format support: oneDNN Block format integration support has been removed since v2.5.10+xpu.

Known Issues

Please refer to Known Issues webpage.

Intel® Extension for PyTorch* v2.5.0+cpu Release Notes

We are excited to announce the release of Intel® Extension for PyTorch* 2.5.0+cpu which accompanies PyTorch 2.5. This release mainly brings you the support for Llama3.2, optimization on newly launched Intel® Xeon® 6 P-core platform, GPTQ/AWQ format support, and latest optimization to push better performance for LLM models. This release also includes a set of bug fixing and small optimizations. We want to sincerely thank our dedicated community for your contributions. As always, we encourage you to try this release and feedback as to improve further on this product.

Highlights

Llama 3.2 support Meta has newly released Llama 3.2, which includes small and medium-sized vision LLMs (11B and 90B), and lightweight, text-only models (1B and 3B). Intel® Extension for PyTorch* provides support of Llama 3.2 since its launch date with early release version, and now support with this official release.

Optimization for Intel® Xeon® 6 Intel® Xeon® 6 deliver new degrees of performance with more cores, a choice of microarchitecture, additional memory bandwidth, and exceptional input/output (I/O) across a range of workloads. Intel® Extension for PyTorch* provides dedicated optimization on this new processor family for features like Multiplexed Rank DIMM (MRDIMM), SNC=3 scenario, etc..

Large Language Model (LLM) optimization:

... (truncated)

Commits

See full diff in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR
@dependabot recreate will recreate this PR, overwriting any edits that have been made to it
@dependabot merge will merge this PR after your CI passes on it
@dependabot squash and merge will squash and merge this PR after your CI passes on it
@dependabot cancel merge will cancel a previously requested merge and block automerging
@dependabot reopen will reopen this PR if it is closed
@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
@dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [intel-extension-for-pytorch](https://github.com/intel/intel-extension-for-pytorch) from 2.3.110+xpu to 2.6.0. - [Release notes](https://github.com/intel/intel-extension-for-pytorch/releases) - [Commits](https://github.com/intel/intel-extension-for-pytorch/commits) --- updated-dependencies: - dependency-name: intel-extension-for-pytorch dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]>

netlify · 2025-02-17T18:45:18Z

✅ Deploy Preview for localai ready!

Name	Link
🔨 Latest commit	`0615891`
🔍 Latest deploy log	https://app.netlify.com/sites/localai/deploys/67b38374a8608b0009abb8de
😎 Deploy Preview	https://deploy-preview-4849--localai.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

dependabot · 2025-02-18T08:19:46Z

OK, I won't notify you again about this release, but will get in touch when a new version is available. If you'd rather skip all updates until the next major or minor version, let me know by commenting @dependabot ignore this major version or @dependabot ignore this minor version. You can also ignore all major, minor, or patch releases for a dependency by adding an ignore condition with the desired update_types to your config file.

If you change your mind, just re-open this PR and I'll resolve any conflicts on it.

dependabot bot added dependencies python Pull requests that update Python code labels Feb 17, 2025

github-actions bot approved these changes Feb 17, 2025

View reviewed changes

github-actions bot enabled auto-merge (squash) February 17, 2025 18:44

mudler closed this Feb 18, 2025

auto-merge was automatically disabled February 18, 2025 08:19
Pull request was closed

dependabot bot deleted the dependabot/pip/backend/python/rerankers/intel-extension-for-pytorch-2.6.0 branch February 18, 2025 08:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(deps): Bump intel-extension-for-pytorch from 2.3.110+xpu to 2.6.0 in /backend/python/rerankers #4849

chore(deps): Bump intel-extension-for-pytorch from 2.3.110+xpu to 2.6.0 in /backend/python/rerankers #4849

dependabot bot commented on behalf of github Feb 17, 2025

netlify bot commented Feb 17, 2025 •

edited

Loading

dependabot bot commented on behalf of github Feb 18, 2025

chore(deps): Bump intel-extension-for-pytorch from 2.3.110+xpu to 2.6.0 in /backend/python/rerankers #4849

chore(deps): Bump intel-extension-for-pytorch from 2.3.110+xpu to 2.6.0 in /backend/python/rerankers #4849

Conversation

dependabot bot commented on behalf of github Feb 17, 2025

Intel® Extension for PyTorch* v2.5.10+xpu Release Notes

2.5.10+xpu

Highlights

Breaking Changes

Known Issues

Intel® Extension for PyTorch* v2.5.0+cpu Release Notes

Highlights

netlify bot commented Feb 17, 2025 • edited Loading

✅ Deploy Preview for localai ready!

dependabot bot commented on behalf of github Feb 18, 2025

netlify bot commented Feb 17, 2025 •

edited

Loading