Intel® Extension for Transformers
latest▼
Click link above to switch version
Getting Started
Installation
User Guide
Features
Neural Engine
Kernels
Transformers-Accelerated Libraries
Performance
Implementation Details
3D Inference
Binary Injectors
Element-wise Injector
Sparse GEMM VNNI
Sparse GEMM AMX
Sparse GEMM AVX512F
Sparse GEMM with Layer-Normalize
Transposed MatMul
Transposed MHA
Dynamic Quant Matmul
Example
API
OpenSSF Badge
Security Policy
Release
Legal Information
Repo
Intel® Extension for Transformers
User Guide
Kernels
Implementation Details
View page source
Implementation Details
3D Inference
Binary Injectors
Element-wise Injector
Sparse GEMM VNNI
Sparse GEMM AMX
Sparse GEMM AVX512F
Sparse GEMM with Layer-Normalize
Transposed MatMul
Transposed MHA
Dynamic Quant Matmul