Basic Linear Algebra Subprograms


BLAS(Basic Linear Algebra Subprograms) is a set of low-level routines for performing common linear algebra operations such as:

  • vector addition

  • scalar multiplication

  • dot products

  • linear combinations

  • matrix multiplication


BLAS implementations are often optimized for speed on a particular machine.

  • Vector register

  • SIMD instructions

So using them can bring substantial performance benefits.

  • Some performance improvements in prompt processing using batch sizes higher than 32 (the default is 512).

Several BLAS implementations

  • Apple's implementation

  • OpenBLAS

  • Intel's implementation


    • It is a C-style interface to the BLAS routines that can be called using regular C-style calls


