Top 36 avx512 open source projects

Sse Popcount
SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Libpopcnt
🚀 Fast C/C++ bit population count library
Toys
Storage for my snippets, toy programs, etc.
Osaca
Open Source Architecture Code Analyzer
Base64 Avx512
Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"
Nsimd
Agenium Scale vectorization library for CPUs and GPUs
Sse4 Strstr
SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Corrfunc
⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Base64simd
Base64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Simd
C++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Univdisasm
x86 Disassembler and Analyzer
Md5 Simd
Accelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Unisimd Assembler
SIMD macro assembler unified for ARM, MIPS, PPC and x86
Simde
Implementations of SIMD instruction sets for systems which don't natively support them.
Kfr
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Xsimd
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Libsimdpp
Portable header-only C++ low level SIMD library
Quadray Engine
Realtime raytracer using SIMD on ARM, MIPS, PPC and x86
Sha256 Simd
Accelerate SHA256 computations in pure Go using Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performance boost of close to 4x over native.
Libxsmm
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Asm Dude
Visual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Sleef
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Highway
Performance-portable, length-agnostic SIMD with runtime dispatch
Std Simd
std::experimental::simd for GCC [ISO/IEC TS 19570:2018]
positional-popcount
Fast C functions for the computing the positional popcount (pospopcnt).
yask
YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-difference methods and similar applications.
md5-optimisation
The fastest MD5 implementation using x86 assembly
simd-byte-lookup
SIMDized check which bytes are in a set
ternary-logic
Support for ternary logic in SSE, XOP, AVX2 and x86 programs
1-36 of 36 avx512 projects