ultra-sortDSL for SIMD Sorting on AVX2 & AVX512
Stars: ✭ 29 (-38.3%)
VcSIMD Vector Classes for C++
Stars: ✭ 985 (+1995.74%)
Quadray EngineRealtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-72.34%)
utf8Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)
Stars: ✭ 60 (+27.66%)
Md5 SimdAccelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Stars: ✭ 71 (+51.06%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-55.32%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (+193.62%)
SimdeImplementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+2053.19%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (+144.68%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (+142.55%)
HighwayPerformance-portable, length-agnostic SIMD with runtime dispatch
Stars: ✭ 301 (+540.43%)
LibxsmmLibrary for specialized dense and sparse matrix operations, and deep learning primitives.
Stars: ✭ 518 (+1002.13%)
sliceslice-rsA fast implementation of single-pattern substring search using SIMD acceleration.
Stars: ✭ 66 (+40.43%)
SimdC++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (+2587.23%)
Unisimd AssemblerSIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (+34.04%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (+40.43%)
LibsimdppPortable header-only C++ low level SIMD library
Stars: ✭ 914 (+1844.68%)
SimdjsonParsing gigabytes of JSON per second
Stars: ✭ 15,115 (+32059.57%)
simdjson-rsRust version of lemire's SimdJson
Stars: ✭ 18 (-61.7%)
Sse4 StrstrSIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Stars: ✭ 115 (+144.68%)
Base64 Avx512Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"
Stars: ✭ 158 (+236.17%)
OsacaOpen Source Architecture Code Analyzer
Stars: ✭ 162 (+244.68%)
ToysStorage for my snippets, toy programs, etc.
Stars: ✭ 187 (+297.87%)
awesome-simdA curated list of awesome SIMD frameworks, libraries and software
Stars: ✭ 39 (-17.02%)
simdutfUnicode routines (UTF8, UTF16): billions of characters per second.
Stars: ✭ 108 (+129.79%)
Std Simdstd::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Stars: ✭ 275 (+485.11%)
SimdjsonsharpC# bindings for lemire/simdjson (and full C# port)
Stars: ✭ 506 (+976.6%)
Asm DudeVisual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Stars: ✭ 3,898 (+8193.62%)
OnednnoneAPI Deep Neural Network Library (oneDNN)
Stars: ✭ 2,600 (+5431.91%)
Fastbase64SIMD-accelerated base64 codecs
Stars: ✭ 309 (+557.45%)
SIMDxorshiftFast random number generators: Vectorized (SIMD) version of xorshift128+
Stars: ✭ 84 (+78.72%)
block-alignerSIMD-accelerated library for computing global and X-drop affine gap penalty sequence-to-sequence or sequence-to-profile alignments using an adaptive block-based algorithm.
Stars: ✭ 58 (+23.4%)
Sse PopcountSIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Stars: ✭ 226 (+380.85%)
Libpopcnt🚀 Fast C/C++ bit population count library
Stars: ✭ 219 (+365.96%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (+651.06%)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (+1951.06%)
simdutf8SIMD-accelerated UTF-8 validation for Rust.
Stars: ✭ 426 (+806.38%)
cpuwhatNim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics
Stars: ✭ 25 (-46.81%)
lsp-dsp-libDSP library for signal processing
Stars: ✭ 37 (-21.28%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (+6.38%)
DirectxmathDirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Stars: ✭ 859 (+1727.66%)
KfrFast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Stars: ✭ 985 (+1995.74%)
Guided Missile SimulationGuided Missile, Radar and Infrared EOS Simulation Framework written in Fortran.
Stars: ✭ 33 (-29.79%)
frpFRP: Fast Random Projections
Stars: ✭ 40 (-14.89%)
pyrodigalCython bindings and Python interface to Prodigal, an ORF finder for genomes and metagenomes. Now with SIMD!
Stars: ✭ 38 (-19.15%)
SNPRelateR package: parallel computing toolset for relatedness and principal component analysis of SNP data (Development Version)
Stars: ✭ 74 (+57.45%)
heyokaC++ library for ODE integration via Taylor's method and LLVM
Stars: ✭ 151 (+221.28%)
Chromium ClangChromium browser compiled with the Clang/LLVM compiler.
Stars: ✭ 77 (+63.83%)
mir-glas[Experimental] LLVM-accelerated Generic Linear Algebra Subprograms
Stars: ✭ 99 (+110.64%)
T13xAn Extended Version of the T0x multithreaded cores, with a custom general purpose parametrized SIMD/MIMD vector coprocessor and support for 3-5 way superscalar execution. The core is pin-to-pin compatible with the RISCY cores from PULP
Stars: ✭ 28 (-40.43%)
generic-simdGeneric SIMD abstractions for Rust.
Stars: ✭ 45 (-4.26%)