[TRANSFORMATIONS] Slice the correct dimension for sinks in SDPA decomposition (#32488)
Details:
Slice the correct dimension for sinks in SDPA decomposition. Instead of using seq_len (L in spec.), use prev_seq_len (S in spec.).
Tickets:
- CVS-175041
Signed-off-by: Andrii Staikov andrii.staikov@intel.com
Open-source software toolkit for optimizing and deploying deep learning models.
Documentation • Blog • Key Features • Tutorials • Integrations • Benchmarks • Generative AI
Check out the OpenVINO Cheat Sheet and Key Features for a quick reference.
Installation
Get your preferred distribution of OpenVINO or use this command for quick installation:
Check system requirements and supported devices for detailed information.
Tutorials and Examples
OpenVINO Quickstart example will walk you through the basics of deploying your first model.
Learn how to optimize and deploy popular models with the OpenVINO Notebooks📚:
Discover more examples in the OpenVINO Samples (Python & C++) and Notebooks (Python).
Here are easy-to-follow code examples demonstrating how to run PyTorch and TensorFlow model inference using OpenVINO:
PyTorch Model
TensorFlow Model
OpenVINO supports the CPU, GPU, and NPU devices and works with models from PyTorch, TensorFlow, ONNX, TensorFlow Lite, PaddlePaddle, and JAX/Flax frameworks. It includes APIs in C++, Python, C, NodeJS, and offers the GenAI API for optimized model pipelines and performance.
Generative AI with OpenVINO
Get started with the OpenVINO GenAI installation and refer to the detailed guide to explore the capabilities of Generative AI using OpenVINO.
Learn how to run LLMs and GenAI with Samples in the OpenVINO™ GenAI repo. See GenAI in action with Jupyter notebooks: LLM-powered Chatbot and LLM Instruction-following pipeline.
Documentation
User documentation contains detailed information about OpenVINO and guides you from installation through optimizing and deploying models for your AI applications.
Developer documentation focuses on the OpenVINO architecture and describes building and contributing processes.
OpenVINO Ecosystem
OpenVINO Tools
Integrations
Check out the Awesome OpenVINO repository to discover a collection of community-made AI projects based on OpenVINO!
Performance
Explore OpenVINO Performance Benchmarks to discover the optimal hardware configurations and plan your AI deployment based on verified data.
Contribution and Support
Check out Contribution Guidelines for more details. Read the Good First Issues section, if you’re looking for a place to start contributing. We welcome contributions of all kinds!
You can ask questions and get support on:
openvinotag on Stack Overflow*.Resources
Telemetry
OpenVINO™ collects software performance and usage data for the purpose of improving OpenVINO™ tools. This data is collected directly by OpenVINO™ or through the use of Google Analytics 4. You can opt-out at any time by running the command:
More Information is available at OpenVINO™ Telemetry.
License
OpenVINO™ Toolkit is licensed under Apache License Version 2.0. By contributing to the project, you agree to the license and copyright terms therein and release your contribution under these terms.
* Other names and brands may be claimed as the property of others.