Intel® Extension for Transformers
latest▼

Click link above to switch version

  • Getting Started
  • Installation
  • User Guide
    • Features
    • Neural Engine
    • Kernels
      • Transformers-Accelerated Libraries
      • Performance
      • Implementation Details
        • 3D Inference
        • Binary Injectors
        • Element-wise Injector
        • Sparse GEMM VNNI
        • Sparse GEMM AMX
        • Sparse GEMM AVX512F
        • Sparse GEMM with Layer-Normalize
        • Transposed MatMul
        • Transposed MHA
        • Dynamic Quant Matmul
  • Example
  • API
  • OpenSSF Badge
  • Security Policy
  • Release
  • Legal Information
  • Repo
Intel® Extension for Transformers
  • User Guide
  • Kernels
  • Implementation Details
  • View page source

Implementation Details

  • 3D Inference
  • Binary Injectors
  • Element-wise Injector
  • Sparse GEMM VNNI
  • Sparse GEMM AMX
  • Sparse GEMM AVX512F
  • Sparse GEMM with Layer-Normalize
  • Transposed MatMul
  • Transposed MHA
  • Dynamic Quant Matmul
Previous Next

© Copyright 2022, Intel® Extension for Transformers, Intel.

Built with Sphinx using a theme provided by Read the Docs.

Cookies | Privacy