articThe AlteRnaTive Impala Compiler
Stars: ✭ 16 (-27.27%)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (+4281.82%)
ThorinThe Higher-Order Intermediate Representation
Stars: ✭ 116 (+427.27%)
FastapproxApproximate and vectorized versions of common mathematical functions
Stars: ✭ 128 (+481.82%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (+1504.55%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (+200%)
ImpalaAn imperative and functional programming language
Stars: ✭ 118 (+436.36%)
VcSIMD Vector Classes for C++
Stars: ✭ 985 (+4377.27%)
ultra-sortDSL for SIMD Sorting on AVX2 & AVX512
Stars: ✭ 29 (+31.82%)
runtimeAnyDSL Runtime Library
Stars: ✭ 17 (-22.73%)
SimdeImplementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+4500%)
HLMLAuto-generated maths library for C and C++ based on HLSL/Cg
Stars: ✭ 23 (+4.55%)
42 cheatsheetAlso referred to as "The C Man"
Stars: ✭ 204 (+827.27%)
memchrOptimized string search routines for Rust.
Stars: ✭ 474 (+2054.55%)
Fastnoise2Modular node based noise generation library using SIMD, C++17 and templates
Stars: ✭ 196 (+790.91%)
LaserThe HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
Stars: ✭ 191 (+768.18%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (+127.27%)
DecomposedCATransform3D manipulation made easy.
Stars: ✭ 184 (+736.36%)
ComputelibraryThe Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
Stars: ✭ 2,123 (+9550%)
aes-gcm-siv.NET Core 3.0 implementation of AES-GCM-SIV nonce misuse-resistant authenticated encryption
Stars: ✭ 22 (+0%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (+590.91%)
IspcIntel SPMD Program Compiler
Stars: ✭ 1,924 (+8645.45%)
recursion-and-dynamic-programmingJulia and Python recursion algorithm, fractal geometry and dynamic programming applications including Edit Distance, Knapsack (Multiple Choice), Stock Trading, Pythagorean Tree, Koch Snowflake, Jerusalem Cross, Sierpiński Carpet, Hilbert Curve, Pascal Triangle, Prime Factorization, Palindrome, Egg Drop, Coin Change, Hanoi Tower, Cantor Set, Fibo…
Stars: ✭ 37 (+68.18%)
ReedsolomonReed-Solomon Erasure Code engine in Go, could more than 15GB/s per core
Stars: ✭ 203 (+822.73%)
fasterRasterFaster raster processing using GRASS GIS
Stars: ✭ 18 (-18.18%)
StreamvbyteFast integer compression in C using the StreamVByte codec
Stars: ✭ 195 (+786.36%)
uwufastest text uwuifier in the west
Stars: ✭ 1,193 (+5322.73%)
SimdjsonParsing gigabytes of JSON per second
Stars: ✭ 15,115 (+68604.55%)
Amplifier.NETAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 142 (+545.45%)
UgmUbpa Graphics Mathematics
Stars: ✭ 178 (+709.09%)
multiversionEasy function multiversioning for Rust
Stars: ✭ 152 (+590.91%)
Base64 Avx512Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"
Stars: ✭ 158 (+618.18%)
SCNMathExtensionsMath extensions for SCNVector3, SCNQuaternion, SCNMatrix4
Stars: ✭ 32 (+45.45%)
Ncnnncnn is a high-performance neural network inference framework optimized for the mobile platform
Stars: ✭ 13,376 (+60700%)
KuiBaDBAnother OLAP database
Stars: ✭ 297 (+1250%)
QuickenshteinMaking the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support
Stars: ✭ 204 (+827.27%)
Base64Base64 encoding / decoding with SIMD-support, also base64Url
Stars: ✭ 44 (+100%)
ThermiteThermite SIMD: Melt your CPU
Stars: ✭ 141 (+540.91%)
Compute EngineHighly optimized inference engine for Binarized Neural Networks
Stars: ✭ 138 (+527.27%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (+527.27%)
Jpeg QuantsmoothJPEG artifacts removal based on quantization coefficients.
Stars: ✭ 134 (+509.09%)
nQuantCppnQuantCpp includes top 6 color quantization algorithms for visual c++ producing high quality optimized images.
Stars: ✭ 83 (+277.27%)
hermesA Haskell library for fast, memory-efficient decoding of JSON documents using the simdjson C++ library
Stars: ✭ 37 (+68.18%)
lsp-dsp-libDSP library for signal processing
Stars: ✭ 37 (+68.18%)
NnpackAcceleration package for neural networks on multi-core CPUs
Stars: ✭ 1,538 (+6890.91%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (+418.18%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (+77.27%)
processorA compiler, assembler, and processor.
Stars: ✭ 24 (+9.09%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-4.55%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (+422.73%)
PackettracerThe SIMD-accelereted ray tracing in C# powered by Intel hardware intrinsic of .NET Core.
Stars: ✭ 109 (+395.45%)
MippMIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.
Stars: ✭ 253 (+1050%)
QreverseA small study in hardware accelerated AoS reversal
Stars: ✭ 97 (+340.91%)