ultra-sortDSL for SIMD Sorting on AVX2 & AVX512
Stars: ✭ 29 (-56.06%)
positional-popcountFast C functions for the computing the positional popcount (pospopcnt).
Stars: ✭ 47 (-28.79%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-68.18%)
block-alignerSIMD-accelerated library for computing global and X-drop affine gap penalty sequence-to-sequence or sequence-to-profile alignments using an adaptive block-based algorithm.
Stars: ✭ 58 (-12.12%)
psimdPortable 128-bit SIMD intrinsics
Stars: ✭ 48 (-27.27%)
LibsimdppPortable header-only C++ low level SIMD library
Stars: ✭ 914 (+1284.85%)
simdutf8SIMD-accelerated UTF-8 validation for Rust.
Stars: ✭ 426 (+545.45%)
simdjson-rsRust version of lemire's SimdJson
Stars: ✭ 18 (-72.73%)
lsp-dsp-libDSP library for signal processing
Stars: ✭ 37 (-43.94%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (+109.09%)
VcSIMD Vector Classes for C++
Stars: ✭ 985 (+1392.42%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (+72.73%)
cpuwhatNim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics
Stars: ✭ 25 (-62.12%)
SIMDxorshiftFast random number generators: Vectorized (SIMD) version of xorshift128+
Stars: ✭ 84 (+27.27%)
awesome-simdA curated list of awesome SIMD frameworks, libraries and software
Stars: ✭ 39 (-40.91%)
simdutfUnicode routines (UTF8, UTF16): billions of characters per second.
Stars: ✭ 108 (+63.64%)
Fastbase64SIMD-accelerated base64 codecs
Stars: ✭ 309 (+368.18%)
LibxsmmLibrary for specialized dense and sparse matrix operations, and deep learning primitives.
Stars: ✭ 518 (+684.85%)
DirectxmathDirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Stars: ✭ 859 (+1201.52%)
Unisimd AssemblerSIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (-4.55%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (+0%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (+74.24%)
SimdjsonsharpC# bindings for lemire/simdjson (and full C# port)
Stars: ✭ 506 (+666.67%)
HighwayPerformance-portable, length-agnostic SIMD with runtime dispatch
Stars: ✭ 301 (+356.06%)
SimdjsonParsing gigabytes of JSON per second
Stars: ✭ 15,115 (+22801.52%)
SimdC++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (+1813.64%)
SimdeImplementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+1433.33%)
utf8Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)
Stars: ✭ 60 (-9.09%)
Quadray EngineRealtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-80.3%)
Md5 SimdAccelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Stars: ✭ 71 (+7.58%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (-24.24%)
MippMIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.
Stars: ✭ 253 (+283.33%)
Jsturbo.js - perform massive parallel computations in your browser with GPGPU.
Stars: ✭ 2,591 (+3825.76%)
multiversionEasy function multiversioning for Rust
Stars: ✭ 152 (+130.3%)
hermesA Haskell library for fast, memory-efficient decoding of JSON documents using the simdjson C++ library
Stars: ✭ 37 (-43.94%)
Hh SuiteRemote protein homology detection suite.
Stars: ✭ 230 (+248.48%)
42 cheatsheetAlso referred to as "The C Man"
Stars: ✭ 204 (+209.09%)
r4stringsHandling Strings in R
Stars: ✭ 39 (-40.91%)
WeTextProcessingText Normalization & Inverse Text Normalization
Stars: ✭ 213 (+222.73%)
ReedsolomonReed-Solomon Erasure Code engine in Go, could more than 15GB/s per core
Stars: ✭ 203 (+207.58%)
Fastnoise2Modular node based noise generation library using SIMD, C++17 and templates
Stars: ✭ 196 (+196.97%)
StreamvbyteFast integer compression in C using the StreamVByte codec
Stars: ✭ 195 (+195.45%)
Ambar🔍 Ambar: Document Search Engine
Stars: ✭ 1,829 (+2671.21%)
LaserThe HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
Stars: ✭ 191 (+189.39%)
aes-gcm-siv.NET Core 3.0 implementation of AES-GCM-SIV nonce misuse-resistant authenticated encryption
Stars: ✭ 22 (-66.67%)
FlexsearchNext-Generation full text search library for Browser and Node.js
Stars: ✭ 8,108 (+12184.85%)
DecomposedCATransform3D manipulation made easy.
Stars: ✭ 184 (+178.79%)
UgmUbpa Graphics Mathematics
Stars: ✭ 178 (+169.7%)
The silver searcherA code-searching tool similar to ack, but faster.
Stars: ✭ 23,030 (+34793.94%)
ComputelibraryThe Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
Stars: ✭ 2,123 (+3116.67%)
bulksearchLightweight and read-write optimized full text search library.
Stars: ✭ 108 (+63.64%)