SimdjsonParsing gigabytes of JSON per second
Stars: ✭ 15,115 (+25091.67%)
SimdC++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (+2005%)
SimdeImplementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+1586.67%)
simdutf8SIMD-accelerated UTF-8 validation for Rust.
Stars: ✭ 426 (+610%)
DirectxmathDirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Stars: ✭ 859 (+1331.67%)
Quadray EngineRealtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-78.33%)
HighwayPerformance-portable, length-agnostic SIMD with runtime dispatch
Stars: ✭ 301 (+401.67%)
simdutfUnicode routines (UTF8, UTF16): billions of characters per second.
Stars: ✭ 108 (+80%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (+10%)
cpuwhatNim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics
Stars: ✭ 25 (-58.33%)
LibsimdppPortable header-only C++ low level SIMD library
Stars: ✭ 914 (+1423.33%)
VcSIMD Vector Classes for C++
Stars: ✭ 985 (+1541.67%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (+488.33%)
positional-popcountFast C functions for the computing the positional popcount (pospopcnt).
Stars: ✭ 47 (-21.67%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (+91.67%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (+130%)
Unisimd AssemblerSIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (+5%)
Sse2neonA translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
Stars: ✭ 316 (+426.67%)
ComputelibraryThe Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
Stars: ✭ 2,123 (+3438.33%)
awesome-simdA curated list of awesome SIMD frameworks, libraries and software
Stars: ✭ 39 (-35%)
Fastbase64SIMD-accelerated base64 codecs
Stars: ✭ 309 (+415%)
LibxsmmLibrary for specialized dense and sparse matrix operations, and deep learning primitives.
Stars: ✭ 518 (+763.33%)
Std Simdstd::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Stars: ✭ 275 (+358.33%)
block-alignerSIMD-accelerated library for computing global and X-drop affine gap penalty sequence-to-sequence or sequence-to-profile alignments using an adaptive block-based algorithm.
Stars: ✭ 58 (-3.33%)
SimdjsonsharpC# bindings for lemire/simdjson (and full C# port)
Stars: ✭ 506 (+743.33%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (+90%)
Md5 SimdAccelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Stars: ✭ 71 (+18.33%)
Cglm📽 Highly Optimized Graphics Math (glm) for C
Stars: ✭ 887 (+1378.33%)
HighwayhashNative Go version of HighwayHash with optimized assembly implementations on Intel and ARM. Able to process over 10 GB/sec on a single core on Intel CPUs - https://en.wikipedia.org/wiki/HighwayHash
Stars: ✭ 670 (+1016.67%)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (+1506.67%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (-16.67%)
pedalevitePédale Vite — DIY multi-FX pedalboard for guitar/bass/etc.
Stars: ✭ 68 (+13.33%)
Sse4 StrstrSIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Stars: ✭ 115 (+91.67%)
FasterSIMD for humans
Stars: ✭ 1,304 (+2073.33%)
JevoisJeVois smart machine vision framework
Stars: ✭ 128 (+113.33%)
MippMIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.
Stars: ✭ 253 (+321.67%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-65%)
QreverseA small study in hardware accelerated AoS reversal
Stars: ✭ 97 (+61.67%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+1385%)
Arm VoEfficient monocular visual odometry for ground vehicles on ARM processors
Stars: ✭ 115 (+91.67%)
ultra-sortDSL for SIMD Sorting on AVX2 & AVX512
Stars: ✭ 29 (-51.67%)
jevoisbaseJeVois base collection of algorithms and modules
Stars: ✭ 41 (-31.67%)
oversimpleA library for audio oversampling, which tries to offer a simple api while wrapping HIIR, by Laurent De Soras, for minimum phase antialiasing, and r8brain-free-src, by Aleksey Vaneev, for linear phase antialiasing.
Stars: ✭ 25 (-58.33%)
sliceslice-rsA fast implementation of single-pattern substring search using SIMD acceleration.
Stars: ✭ 66 (+10%)
Libpopcnt🚀 Fast C/C++ bit population count library
Stars: ✭ 219 (+265%)
SoftLightA shader-based Software Renderer Using The LightSky Framework.
Stars: ✭ 2 (-96.67%)
simdjson-rsRust version of lemire's SimdJson
Stars: ✭ 18 (-70%)
modest-pyFMI-compliant Model Estimation in Python
Stars: ✭ 40 (-33.33%)
Hades🔥 A Nintendo Game Boy Advance emulator
Stars: ✭ 44 (-26.67%)
android-opensslOpenSSL build for Android (arm, armv7, x86)
Stars: ✭ 69 (+15%)