Ozz AnimationOpen source c++ skeletal animation library and toolset
Stars: ✭ 1,250 (+3278.38%)
StrA SIMD optimized fixed-length string class along with an adaptive hash table for fast searching
Stars: ✭ 60 (+62.16%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (+272.97%)
MaskedvbyteFast decoder for VByte-compressed integers
Stars: ✭ 91 (+145.95%)
Quadray EngineRealtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-64.86%)
Ncnnncnn is a high-performance neural network inference framework optimized for the mobile platform
Stars: ✭ 13,376 (+36051.35%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (+78.38%)
StreamvbyteFast integer compression in C using the StreamVByte codec
Stars: ✭ 195 (+427.03%)
KfrFast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Stars: ✭ 985 (+2562.16%)
NnpackAcceleration package for neural networks on multi-core CPUs
Stars: ✭ 1,538 (+4056.76%)
SketchC++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
Stars: ✭ 96 (+159.46%)
EdgeExtreme-scale Discontinuous Galerkin Environment (EDGE)
Stars: ✭ 18 (-51.35%)
Base64 Avx512Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"
Stars: ✭ 158 (+327.03%)
DespacerC library to remove white space from strings as fast as possible
Stars: ✭ 90 (+143.24%)
ReedsolomonReed-Solomon Erasure Code engine in Go, could more than 15GB/s per core
Stars: ✭ 203 (+448.65%)
WideA crate to help you go wide. By which I mean use SIMD stuff.
Stars: ✭ 72 (+94.59%)
ThermiteThermite SIMD: Melt your CPU
Stars: ✭ 141 (+281.08%)
Pocket TensorRun Keras models from a C++ application on embedded devices
Stars: ✭ 65 (+75.68%)
Jsturbo.js - perform massive parallel computations in your browser with GPGPU.
Stars: ✭ 2,591 (+6902.7%)
MongooseMinimalistic Vulkan engine for fast propotyping.
Stars: ✭ 41 (+10.81%)
FastapproxApproximate and vectorized versions of common mathematical functions
Stars: ✭ 128 (+245.95%)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (+2505.41%)
SimdjsonParsing gigabytes of JSON per second
Stars: ✭ 15,115 (+40751.35%)
Compressed VecSIMD Floating point and integer compressed vector library
Stars: ✭ 25 (-32.43%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (+208.11%)
QreverseA small study in hardware accelerated AoS reversal
Stars: ✭ 97 (+162.16%)
Cglm📽 Highly Optimized Graphics Math (glm) for C
Stars: ✭ 887 (+2297.3%)
ComputelibraryThe Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
Stars: ✭ 2,123 (+5637.84%)
Amplifier.netAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 92 (+148.65%)
42 cheatsheetAlso referred to as "The C Man"
Stars: ✭ 204 (+451.35%)
FasterSIMD for humans
Stars: ✭ 1,304 (+3424.32%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (+310.81%)
SimdC++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (+3313.51%)
DictionaryHigh-performance dictionary coding
Stars: ✭ 77 (+108.11%)
IspcIntel SPMD Program Compiler
Stars: ✭ 1,924 (+5100%)
Md5 SimdAccelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Stars: ✭ 71 (+91.89%)
Fastnoise2Modular node based noise generation library using SIMD, C++17 and templates
Stars: ✭ 196 (+429.73%)
Go CvComputer Vision package in pure Go taking advantage of SIMD acceleration
Stars: ✭ 66 (+78.38%)
Compute EngineHighly optimized inference engine for Binarized Neural Networks
Stars: ✭ 138 (+272.97%)
Unisimd AssemblerSIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (+70.27%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-43.24%)
SimdeImplementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+2635.14%)
Jpeg QuantsmoothJPEG artifacts removal based on quantization coefficients.
Stars: ✭ 134 (+262.16%)
VcSIMD Vector Classes for C++
Stars: ✭ 985 (+2562.16%)
LaserThe HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
Stars: ✭ 191 (+416.22%)
Parallel XxhashCompute xxHash hash codes for 8 keys in parallel
Stars: ✭ 36 (-2.7%)
ImpalaAn imperative and functional programming language
Stars: ✭ 118 (+218.92%)
LibsimdppPortable header-only C++ low level SIMD library
Stars: ✭ 914 (+2370.27%)
Hh SuiteRemote protein homology detection suite.
Stars: ✭ 230 (+521.62%)
DirectxmathDirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Stars: ✭ 859 (+2221.62%)
ThorinThe Higher-Order Intermediate Representation
Stars: ✭ 116 (+213.51%)
Simdsetoperationstestbed for different SIMD implementations for set intersection and set union
Stars: ✭ 24 (-35.14%)
DecomposedCATransform3D manipulation made easy.
Stars: ✭ 184 (+397.3%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (+210.81%)
lsp-dsp-libDSP library for signal processing
Stars: ✭ 37 (+0%)
MippMIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.
Stars: ✭ 253 (+583.78%)
UgmUbpa Graphics Mathematics
Stars: ✭ 178 (+381.08%)
PackettracerThe SIMD-accelereted ray tracing in C# powered by Intel hardware intrinsic of .NET Core.
Stars: ✭ 109 (+194.59%)