Fastnoise2Modular node based noise generation library using SIMD, C++17 and templates
Stars: ✭ 196 (+317.02%)
LaserThe HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
Stars: ✭ 191 (+306.38%)
generic-simdGeneric SIMD abstractions for Rust.
Stars: ✭ 45 (-4.26%)
FFmpegPlayerSimple FFmpeg video player
Stars: ✭ 72 (+53.19%)
xorMove to: https://github.com/templexxx/xorsimd
Stars: ✭ 27 (-42.55%)
DecomposedCATransform3D manipulation made easy.
Stars: ✭ 184 (+291.49%)
SNPRelateR package: parallel computing toolset for relatedness and principal component analysis of SNP data (Development Version)
Stars: ✭ 74 (+57.45%)
ReedsolomonReed-Solomon Erasure Code engine in Go, could more than 15GB/s per core
Stars: ✭ 203 (+331.91%)
heyokaC++ library for ODE integration via Taylor's method and LLVM
Stars: ✭ 151 (+221.28%)
StreamvbyteFast integer compression in C using the StreamVByte codec
Stars: ✭ 195 (+314.89%)
mir-glas[Experimental] LLVM-accelerated Generic Linear Algebra Subprograms
Stars: ✭ 99 (+110.64%)
T13xAn Extended Version of the T0x multithreaded cores, with a custom general purpose parametrized SIMD/MIMD vector coprocessor and support for 3-5 way superscalar execution. The core is pin-to-pin compatible with the RISCY cores from PULP
Stars: ✭ 28 (-40.43%)
runtimeAnyDSL Runtime Library
Stars: ✭ 17 (-63.83%)
UgmUbpa Graphics Mathematics
Stars: ✭ 178 (+278.72%)
ComputelibraryThe Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
Stars: ✭ 2,123 (+4417.02%)
dcurlHardware-accelerated Multi-threaded IOTA PoW, drop-in replacement for ccurl
Stars: ✭ 39 (-17.02%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (+223.4%)
Ncnnncnn is a high-performance neural network inference framework optimized for the mobile platform
Stars: ✭ 13,376 (+28359.57%)
psimdPortable 128-bit SIMD intrinsics
Stars: ✭ 48 (+2.13%)
IspcIntel SPMD Program Compiler
Stars: ✭ 1,924 (+3993.62%)
ThermiteThermite SIMD: Melt your CPU
Stars: ✭ 141 (+200%)
optimathA #[no_std] LinAlg library
Stars: ✭ 47 (+0%)
wasmfast wasm modules
Stars: ✭ 37 (-21.28%)
qHilbertqHilbert is a vectorized speedup of Hilbert curve generation using SIMD intrinsics
Stars: ✭ 22 (-53.19%)
Compute EngineHighly optimized inference engine for Binarized Neural Networks
Stars: ✭ 138 (+193.62%)
penguinVSimple and fast C++ image processing library with focus on heterogeneous systems
Stars: ✭ 110 (+134.04%)
Jpeg QuantsmoothJPEG artifacts removal based on quantization coefficients.
Stars: ✭ 134 (+185.11%)
FastapproxApproximate and vectorized versions of common mathematical functions
Stars: ✭ 128 (+172.34%)
md5-optimisationThe fastest MD5 implementation using x86 assembly
Stars: ✭ 45 (-4.26%)
SIMDArraySIMD enhanced Array operations
Stars: ✭ 123 (+161.7%)
ImpalaAn imperative and functional programming language
Stars: ✭ 118 (+151.06%)
NnpackAcceleration package for neural networks on multi-core CPUs
Stars: ✭ 1,538 (+3172.34%)
ADbHashReally fast C++ hash table
Stars: ✭ 12 (-74.47%)
Amplifier.NETAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 142 (+202.13%)
ThorinThe Higher-Order Intermediate Representation
Stars: ✭ 116 (+146.81%)
yaskYASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-difference methods and similar applications.
Stars: ✭ 81 (+72.34%)
zig-gamedevBuilding game development ecosystem for @ziglang!
Stars: ✭ 1,059 (+2153.19%)
QuickenshteinMaking the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support
Stars: ✭ 204 (+334.04%)
PackettracerThe SIMD-accelereted ray tracing in C# powered by Intel hardware intrinsic of .NET Core.
Stars: ✭ 109 (+131.91%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (-17.02%)
QreverseA small study in hardware accelerated AoS reversal
Stars: ✭ 97 (+106.38%)
SketchC++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
Stars: ✭ 96 (+104.26%)
glmOpenGL Mathematics (GLM)
Stars: ✭ 6,667 (+14085.11%)
heyoka.pyPython library for ODE integration via Taylor's method and LLVM
Stars: ✭ 45 (-4.26%)
Amplifier.netAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 92 (+95.74%)
MaskedvbyteFast decoder for VByte-compressed integers
Stars: ✭ 91 (+93.62%)
FasterSIMD for humans
Stars: ✭ 1,304 (+2674.47%)
ndzipA High-Throughput Parallel Lossless Compressor for Scientific Data
Stars: ✭ 19 (-59.57%)
SCNMathExtensionsMath extensions for SCNVector3, SCNQuaternion, SCNMatrix4
Stars: ✭ 32 (-31.91%)
DespacerC library to remove white space from strings as fast as possible
Stars: ✭ 90 (+91.49%)
Ozz AnimationOpen source c++ skeletal animation library and toolset
Stars: ✭ 1,250 (+2559.57%)
Base64Base64 encoding / decoding with SIMD-support, also base64Url
Stars: ✭ 44 (-6.38%)
DictionaryHigh-performance dictionary coding
Stars: ✭ 77 (+63.83%)
processorA compiler, assembler, and processor.
Stars: ✭ 24 (-48.94%)
WideA crate to help you go wide. By which I mean use SIMD stuff.
Stars: ✭ 72 (+53.19%)
Go CvComputer Vision package in pure Go taking advantage of SIMD acceleration
Stars: ✭ 66 (+40.43%)
tbslasA parallel, fast solver for the scalar advection-diffusion and the incompressible Navier-Stokes equations based on semi-Lagrangian/Volume-Integral method.
Stars: ✭ 21 (-55.32%)