hatrackFast, multi-reader, multi-writer, lockless data structures for parallel programming
Stars: ✭ 55 (+12.24%)
ADbHashReally fast C++ hash table
Stars: ✭ 12 (-75.51%)
Compute EngineHighly optimized inference engine for Binarized Neural Networks
Stars: ✭ 138 (+181.63%)
Hh SuiteRemote protein homology detection suite.
Stars: ✭ 230 (+369.39%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (+134.69%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (+210.2%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-57.14%)
ImpalaAn imperative and functional programming language
Stars: ✭ 118 (+140.82%)
multiversionEasy function multiversioning for Rust
Stars: ✭ 152 (+210.2%)
Amplifier.netAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 92 (+87.76%)
Fastnoise2Modular node based noise generation library using SIMD, C++17 and templates
Stars: ✭ 196 (+300%)
SimdC++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (+2477.55%)
ComputelibraryThe Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
Stars: ✭ 2,123 (+4232.65%)
hermesA Haskell library for fast, memory-efficient decoding of JSON documents using the simdjson C++ library
Stars: ✭ 37 (-24.49%)
IspcIntel SPMD Program Compiler
Stars: ✭ 1,924 (+3826.53%)
uwufastest text uwuifier in the west
Stars: ✭ 1,193 (+2334.69%)
Jpeg QuantsmoothJPEG artifacts removal based on quantization coefficients.
Stars: ✭ 134 (+173.47%)
ThorinThe Higher-Order Intermediate Representation
Stars: ✭ 116 (+136.73%)
sliceslice-rsA fast implementation of single-pattern substring search using SIMD acceleration.
Stars: ✭ 66 (+34.69%)
QreverseA small study in hardware accelerated AoS reversal
Stars: ✭ 97 (+97.96%)
42 cheatsheetAlso referred to as "The C Man"
Stars: ✭ 204 (+316.33%)
FasterSIMD for humans
Stars: ✭ 1,304 (+2561.22%)
aes-gcm-siv.NET Core 3.0 implementation of AES-GCM-SIV nonce misuse-resistant authenticated encryption
Stars: ✭ 22 (-55.1%)
DictionaryHigh-performance dictionary coding
Stars: ✭ 77 (+57.14%)
LaserThe HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
Stars: ✭ 191 (+289.8%)
Md5 SimdAccelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Stars: ✭ 71 (+44.9%)
UgmUbpa Graphics Mathematics
Stars: ✭ 178 (+263.27%)
Base64 Avx512Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"
Stars: ✭ 158 (+222.45%)
memchrOptimized string search routines for Rust.
Stars: ✭ 474 (+867.35%)
Ncnnncnn is a high-performance neural network inference framework optimized for the mobile platform
Stars: ✭ 13,376 (+27197.96%)
lsp-dsp-libDSP library for signal processing
Stars: ✭ 37 (-24.49%)
ThermiteThermite SIMD: Melt your CPU
Stars: ✭ 141 (+187.76%)
processorA compiler, assembler, and processor.
Stars: ✭ 24 (-51.02%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (+181.63%)
MippMIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.
Stars: ✭ 253 (+416.33%)
FastapproxApproximate and vectorized versions of common mathematical functions
Stars: ✭ 128 (+161.22%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (+2.04%)
NnpackAcceleration package for neural networks on multi-core CPUs
Stars: ✭ 1,538 (+3038.78%)
Jsturbo.js - perform massive parallel computations in your browser with GPGPU.
Stars: ✭ 2,591 (+5187.76%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (+132.65%)
ctlMy variant of the C Template Library
Stars: ✭ 105 (+114.29%)
PackettracerThe SIMD-accelereted ray tracing in C# powered by Intel hardware intrinsic of .NET Core.
Stars: ✭ 109 (+122.45%)
SketchC++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
Stars: ✭ 96 (+95.92%)
patchmapA fast and memory efficient hashmap using sorting to resolve collisions
Stars: ✭ 41 (-16.33%)
MaskedvbyteFast decoder for VByte-compressed integers
Stars: ✭ 91 (+85.71%)
ReedsolomonReed-Solomon Erasure Code engine in Go, could more than 15GB/s per core
Stars: ✭ 203 (+314.29%)
DespacerC library to remove white space from strings as fast as possible
Stars: ✭ 90 (+83.67%)
go-left-rightA faster RWLock primitive in Go, 2-3 times faster than RWMutex. A Go implementation of concurrency control algorithm in paper <Left-Right - A Concurrency Control Technique with Wait-Free Population Oblivious Reads>
Stars: ✭ 42 (-14.29%)
Ozz AnimationOpen source c++ skeletal animation library and toolset
Stars: ✭ 1,250 (+2451.02%)
StreamvbyteFast integer compression in C using the StreamVByte codec
Stars: ✭ 195 (+297.96%)
WideA crate to help you go wide. By which I mean use SIMD stuff.
Stars: ✭ 72 (+46.94%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (+34.69%)
SimdjsonParsing gigabytes of JSON per second
Stars: ✭ 15,115 (+30746.94%)
SCNMathExtensionsMath extensions for SCNVector3, SCNQuaternion, SCNMatrix4
Stars: ✭ 32 (-34.69%)
Base64Base64 encoding / decoding with SIMD-support, also base64Url
Stars: ✭ 44 (-10.2%)
HLMLAuto-generated maths library for C and C++ based on HLSL/Cg
Stars: ✭ 23 (-53.06%)