SimdjsonParsing gigabytes of JSON per second
Stars: ✭ 15,115 (+55881.48%)
ThermiteThermite SIMD: Melt your CPU
Stars: ✭ 141 (+422.22%)
ReedsolomonReed-Solomon Erasure Code engine in Go, could more than 15GB/s per core
Stars: ✭ 203 (+651.85%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (+322.22%)
uwufastest text uwuifier in the west
Stars: ✭ 1,193 (+4318.52%)
Base64 Avx512Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"
Stars: ✭ 158 (+485.19%)
oxoranyobfuscated any constant encryption in compile time on any platform
Stars: ✭ 155 (+474.07%)
FastapproxApproximate and vectorized versions of common mathematical functions
Stars: ✭ 128 (+374.07%)
lsp-dsp-libDSP library for signal processing
Stars: ✭ 37 (+37.04%)
SketchC++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
Stars: ✭ 96 (+255.56%)
HLMLAuto-generated maths library for C and C++ based on HLSL/Cg
Stars: ✭ 23 (-14.81%)
StreamvbyteFast integer compression in C using the StreamVByte codec
Stars: ✭ 195 (+622.22%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (+44.44%)
UgmUbpa Graphics Mathematics
Stars: ✭ 178 (+559.26%)
multiversionEasy function multiversioning for Rust
Stars: ✭ 152 (+462.96%)
Ncnnncnn is a high-performance neural network inference framework optimized for the mobile platform
Stars: ✭ 13,376 (+49440.74%)
penguinVSimple and fast C++ image processing library with focus on heterogeneous systems
Stars: ✭ 110 (+307.41%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (+411.11%)
NnpackAcceleration package for neural networks on multi-core CPUs
Stars: ✭ 1,538 (+5596.3%)
SCNMathExtensionsMath extensions for SCNVector3, SCNQuaternion, SCNMatrix4
Stars: ✭ 32 (+18.52%)
PackettracerThe SIMD-accelereted ray tracing in C# powered by Intel hardware intrinsic of .NET Core.
Stars: ✭ 109 (+303.7%)
MippMIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.
Stars: ✭ 253 (+837.04%)
Hh SuiteRemote protein homology detection suite.
Stars: ✭ 230 (+751.85%)
Amplifier.netAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 92 (+240.74%)
sliceslice-rsA fast implementation of single-pattern substring search using SIMD acceleration.
Stars: ✭ 66 (+144.44%)
42 cheatsheetAlso referred to as "The C Man"
Stars: ✭ 204 (+655.56%)
QuickenshteinMaking the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support
Stars: ✭ 204 (+655.56%)
Fastnoise2Modular node based noise generation library using SIMD, C++17 and templates
Stars: ✭ 196 (+625.93%)
memchrOptimized string search routines for Rust.
Stars: ✭ 474 (+1655.56%)
LaserThe HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
Stars: ✭ 191 (+607.41%)
qHilbertqHilbert is a vectorized speedup of Hilbert curve generation using SIMD intrinsics
Stars: ✭ 22 (-18.52%)
DecomposedCATransform3D manipulation made easy.
Stars: ✭ 184 (+581.48%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (+85.19%)
ComputelibraryThe Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
Stars: ✭ 2,123 (+7762.96%)
heyoka.pyPython library for ODE integration via Taylor's method and LLVM
Stars: ✭ 45 (+66.67%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (+462.96%)
aes-gcm-siv.NET Core 3.0 implementation of AES-GCM-SIV nonce misuse-resistant authenticated encryption
Stars: ✭ 22 (-18.52%)
IspcIntel SPMD Program Compiler
Stars: ✭ 1,924 (+7025.93%)
Compute EngineHighly optimized inference engine for Binarized Neural Networks
Stars: ✭ 138 (+411.11%)
Guided Missile SimulationGuided Missile, Radar and Infrared EOS Simulation Framework written in Fortran.
Stars: ✭ 33 (+22.22%)
Jpeg QuantsmoothJPEG artifacts removal based on quantization coefficients.
Stars: ✭ 134 (+396.3%)
ImpalaAn imperative and functional programming language
Stars: ✭ 118 (+337.04%)
hermesA Haskell library for fast, memory-efficient decoding of JSON documents using the simdjson C++ library
Stars: ✭ 37 (+37.04%)
ThorinThe Higher-Order Intermediate Representation
Stars: ✭ 116 (+329.63%)
SIMDArraySIMD enhanced Array operations
Stars: ✭ 123 (+355.56%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (+325.93%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-22.22%)
QreverseA small study in hardware accelerated AoS reversal
Stars: ✭ 97 (+259.26%)
Base64Base64 encoding / decoding with SIMD-support, also base64Url
Stars: ✭ 44 (+62.96%)
runtimeAnyDSL Runtime Library
Stars: ✭ 17 (-37.04%)
psimdPortable 128-bit SIMD intrinsics
Stars: ✭ 48 (+77.78%)
Amplifier.NETAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 142 (+425.93%)
processorA compiler, assembler, and processor.
Stars: ✭ 24 (-11.11%)
Jsturbo.js - perform massive parallel computations in your browser with GPGPU.
Stars: ✭ 2,591 (+9496.3%)