MippMIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.
OsacaOpen Source Architecture Code Analyzer
HlslppMath library using hlsl syntax with SSE/NEON support
NsimdAgenium Scale vectorization library for CPUs and GPUs
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
PackettracerThe SIMD-accelereted ray tracing in C# powered by Intel hardware intrinsic of .NET Core.
DespacerC library to remove white space from strings as fast as possible
SimdC++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
UmesimdUME::SIMD A library for explicit simd vectorization.
EngraverPoCC Burstcoin Reference Plotter
SimdeImplementations of SIMD instruction sets for systems which don't natively support them.
VcSIMD Vector Classes for C++
KfrFast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
DirectxmathDirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Cglm📽 Highly Optimized Graphics Math (glm) for C
Sha256 SimdAccelerate SHA256 computations in pure Go using Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performance boost of close to 4x over native.
LibxsmmLibrary for specialized dense and sparse matrix operations, and deep learning primitives.
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Std Simdstd::experimental::simd for GCC [ISO/IEC TS 19570:2018]
CoriumCorium is a modern scripting language which combines simple, safe and efficient programming.
peakperfAchieve peak performance on x86 CPUs and NVIDIA GPUs
cpuwhatNim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics
oversimpleA library for audio oversampling, which tries to offer a simple api while wrapping HIIR, by Laurent De Soras, for minimum phase antialiasing, and r8brain-free-src, by Aleksey Vaneev, for linear phase antialiasing.
penguinVSimple and fast C++ image processing library with focus on heterogeneous systems
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs