MongooseMinimalistic Vulkan engine for fast propotyping.
Stars: ✭ 41 (-16.33%)
KfrFast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Stars: ✭ 985 (+1910.2%)
Base64 Avx512Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"
Stars: ✭ 158 (+222.45%)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (+1867.35%)
memchrOptimized string search routines for Rust.
Stars: ✭ 474 (+867.35%)
Quadray EngineRealtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-73.47%)
Ncnnncnn is a high-performance neural network inference framework optimized for the mobile platform
Stars: ✭ 13,376 (+27197.96%)
Compressed VecSIMD Floating point and integer compressed vector library
Stars: ✭ 25 (-48.98%)
lsp-dsp-libDSP library for signal processing
Stars: ✭ 37 (-24.49%)
EdgeExtreme-scale Discontinuous Galerkin Environment (EDGE)
Stars: ✭ 18 (-63.27%)
ThermiteThermite SIMD: Melt your CPU
Stars: ✭ 141 (+187.76%)
XnnpackHigh-efficiency floating-point neural network inference operators for mobile, server, and Web
Stars: ✭ 808 (+1548.98%)
processorA compiler, assembler, and processor.
Stars: ✭ 24 (-51.02%)
LinqfasterLinq-like extension functions for Arrays, Span<T>, and List<T> that are faster and allocate less.
Stars: ✭ 615 (+1155.1%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (+181.63%)
MippMIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.
Stars: ✭ 253 (+416.33%)
LibxsmmLibrary for specialized dense and sparse matrix operations, and deep learning primitives.
Stars: ✭ 518 (+957.14%)
FastapproxApproximate and vectorized versions of common mathematical functions
Stars: ✭ 128 (+161.22%)
JohnJohn the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
Stars: ✭ 5,656 (+11442.86%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (+2.04%)
KleinP(R*_{3, 0, 1}) specialized SIMD Geometric Algebra Library
Stars: ✭ 463 (+844.9%)
NnpackAcceleration package for neural networks on multi-core CPUs
Stars: ✭ 1,538 (+3038.78%)
PysimdjsonPython bindings for the simdjson project.
Stars: ✭ 432 (+781.63%)
Jsturbo.js - perform massive parallel computations in your browser with GPGPU.
Stars: ✭ 2,591 (+5187.76%)
Glam RsA simple and fast linear algebra library for games and graphics
Stars: ✭ 406 (+728.57%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (+132.65%)
SeqanSeqAn's official repository.
Stars: ✭ 386 (+687.76%)
ctlMy variant of the C Template Library
Stars: ✭ 105 (+114.29%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (+620.41%)
PackettracerThe SIMD-accelereted ray tracing in C# powered by Intel hardware intrinsic of .NET Core.
Stars: ✭ 109 (+122.45%)
Mangomango fun framework
Stars: ✭ 343 (+600%)
SketchC++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
Stars: ✭ 96 (+95.92%)
Sse2neonA translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
Stars: ✭ 316 (+544.9%)
patchmapA fast and memory efficient hashmap using sorting to resolve collisions
Stars: ✭ 41 (-16.33%)
HighwayPerformance-portable, length-agnostic SIMD with runtime dispatch
Stars: ✭ 301 (+514.29%)
MaskedvbyteFast decoder for VByte-compressed integers
Stars: ✭ 91 (+85.71%)
TsimdFundamental C++ SIMD types for Intel CPUs (sse, avx, avx2, avx512)
Stars: ✭ 290 (+491.84%)
ReedsolomonReed-Solomon Erasure Code engine in Go, could more than 15GB/s per core
Stars: ✭ 203 (+314.29%)
Std Simdstd::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Stars: ✭ 275 (+461.22%)
DespacerC library to remove white space from strings as fast as possible
Stars: ✭ 90 (+83.67%)
BitmagicBitMagic Library
Stars: ✭ 263 (+436.73%)
go-left-rightA faster RWLock primitive in Go, 2-3 times faster than RWMutex. A Go implementation of concurrency control algorithm in paper <Left-Right - A Concurrency Control Technique with Wait-Free Population Oblivious Reads>
Stars: ✭ 42 (-14.29%)
simdutfUnicode routines (UTF8, UTF16): billions of characters per second.
Stars: ✭ 108 (+120.41%)
Ozz AnimationOpen source c++ skeletal animation library and toolset
Stars: ✭ 1,250 (+2451.02%)
awesome-simdA curated list of awesome SIMD frameworks, libraries and software
Stars: ✭ 39 (-20.41%)
StreamvbyteFast integer compression in C using the StreamVByte codec
Stars: ✭ 195 (+297.96%)
WideA crate to help you go wide. By which I mean use SIMD stuff.
Stars: ✭ 72 (+46.94%)
block-alignerSIMD-accelerated library for computing global and X-drop affine gap penalty sequence-to-sequence or sequence-to-profile alignments using an adaptive block-based algorithm.
Stars: ✭ 58 (+18.37%)
Pocket TensorRun Keras models from a C++ application on embedded devices
Stars: ✭ 65 (+32.65%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (+34.69%)
SCNMathExtensionsMath extensions for SCNVector3, SCNQuaternion, SCNMatrix4
Stars: ✭ 32 (-34.69%)
Base64Base64 encoding / decoding with SIMD-support, also base64Url
Stars: ✭ 44 (-10.2%)
HLMLAuto-generated maths library for C and C++ based on HLSL/Cg
Stars: ✭ 23 (-53.06%)
Guided Missile SimulationGuided Missile, Radar and Infrared EOS Simulation Framework written in Fortran.
Stars: ✭ 33 (-32.65%)