Top 56 avx2 open source projects

Sse Popcount
SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Libpopcnt
🚀 Fast C/C++ bit population count library
Turbo Run Length Encoding
TurboRLE-Fastest Run Length Encoding
Toys
Storage for my snippets, toy programs, etc.
Highwayhash
Node.js implementation of HighwayHash, Google's fast and strong hash function
Osaca
Open Source Architecture Code Analyzer
Nsimd
Agenium Scale vectorization library for CPUs and GPUs
Tensorflow Optimized Wheels
TensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Sse4 Strstr
SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Corrfunc
⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Base64simd
Base64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Simd
C++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Md5 Simd
Accelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Unisimd Assembler
SIMD macro assembler unified for ARM, MIPS, PPC and x86
Op rbf
Optimized Recursive Bilateral Filter
Simde
Implementations of SIMD instruction sets for systems which don't natively support them.
Sixtyfour
How fast can we brute force a 64-bit comparison?
Libsimdpp
Portable header-only C++ low level SIMD library
Quadray Engine
Realtime raytracer using SIMD on ARM, MIPS, PPC and x86
Ksim
The little simulator that could.
Directxmath
DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Wheels
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Highwayhash
Native Go version of HighwayHash with optimized assembly implementations on Intel and ARM. Able to process over 10 GB/sec on a single core on Intel CPUs - https://en.wikipedia.org/wiki/HighwayHash
Fastnoisesimd
C++ SIMD Noise Library
Libxsmm
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Simdjsonsharp
C# bindings for lemire/simdjson (and full C# port)
✭ 506
jsonsimdavx2
Asm Dude
Visual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Fastbase64
SIMD-accelerated base64 codecs
✭ 309
csimdavx2
Highway
Performance-portable, length-agnostic SIMD with runtime dispatch
simdutf
Unicode routines (UTF8, UTF16): billions of characters per second.
awesome-simd
A curated list of awesome SIMD frameworks, libraries and software
block-aligner
SIMD-accelerated library for computing global and X-drop affine gap penalty sequence-to-sequence or sequence-to-profile alignments using an adaptive block-based algorithm.
positional-popcount
Fast C functions for the computing the positional popcount (pospopcnt).
utf8
Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)
simdjson-rs
Rust version of lemire's SimdJson
cpuwhat
Nim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics
sliceslice-rs
A fast implementation of single-pattern substring search using SIMD acceleration.
argon2
Implementation of argon2 (i, d, id) algorithms with CPU dispatching
simd-byte-lookup
SIMDized check which bytes are in a set
ternary-logic
Support for ternary logic in SSE, XOP, AVX2 and x86 programs
1-56 of 56 avx2 projects