sliceslice-rsA fast implementation of single-pattern substring search using SIMD acceleration.
Stars: ✭ 66 (+144.44%)
Md5 SimdAccelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Stars: ✭ 71 (+162.96%)
42 cheatsheetAlso referred to as "The C Man"
Stars: ✭ 204 (+655.56%)
Go CvComputer Vision package in pure Go taking advantage of SIMD acceleration
Stars: ✭ 66 (+144.44%)
QuickenshteinMaking the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support
Stars: ✭ 204 (+655.56%)
Unisimd AssemblerSIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (+133.33%)
Fastnoise2Modular node based noise generation library using SIMD, C++17 and templates
Stars: ✭ 196 (+625.93%)
SimdeImplementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+3648.15%)
memchrOptimized string search routines for Rust.
Stars: ✭ 474 (+1655.56%)
VcSIMD Vector Classes for C++
Stars: ✭ 985 (+3548.15%)
LaserThe HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
Stars: ✭ 191 (+607.41%)
Parallel XxhashCompute xxHash hash codes for 8 keys in parallel
Stars: ✭ 36 (+33.33%)
qHilbertqHilbert is a vectorized speedup of Hilbert curve generation using SIMD intrinsics
Stars: ✭ 22 (-18.52%)
LibsimdppPortable header-only C++ low level SIMD library
Stars: ✭ 914 (+3285.19%)
DecomposedCATransform3D manipulation made easy.
Stars: ✭ 184 (+581.48%)
DirectxmathDirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Stars: ✭ 859 (+3081.48%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (+85.19%)
Simdsetoperationstestbed for different SIMD implementations for set intersection and set union
Stars: ✭ 24 (-11.11%)
ComputelibraryThe Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
Stars: ✭ 2,123 (+7762.96%)
Cglm📽 Highly Optimized Graphics Math (glm) for C
Stars: ✭ 887 (+3185.19%)
heyoka.pyPython library for ODE integration via Taylor's method and LLVM
Stars: ✭ 45 (+66.67%)
CgmathA linear algebra and mathematics library for computer graphics.
Stars: ✭ 773 (+2762.96%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (+462.96%)
PikkrJSON parser which picks up values directly without performing tokenization in Rust
Stars: ✭ 580 (+2048.15%)
aes-gcm-siv.NET Core 3.0 implementation of AES-GCM-SIV nonce misuse-resistant authenticated encryption
Stars: ✭ 22 (-18.52%)
IspcIntel SPMD Program Compiler
Stars: ✭ 1,924 (+7025.93%)
SimdjsonsharpC# bindings for lemire/simdjson (and full C# port)
Stars: ✭ 506 (+1774.07%)
Simd JsonRust port of simdjson
Stars: ✭ 499 (+1748.15%)
Compute EngineHighly optimized inference engine for Binarized Neural Networks
Stars: ✭ 138 (+411.11%)
Simd VisualiserA tool to graphically visualize SIMD code
Stars: ✭ 459 (+1600%)
Guided Missile SimulationGuided Missile, Radar and Infrared EOS Simulation Framework written in Fortran.
Stars: ✭ 33 (+22.22%)
Jpeg QuantsmoothJPEG artifacts removal based on quantization coefficients.
Stars: ✭ 134 (+396.3%)
StdarchRust's standard library vendor-specific APIs and run-time feature detection
Stars: ✭ 399 (+1377.78%)
RtmRealtime Math
Stars: ✭ 373 (+1281.48%)
ImpalaAn imperative and functional programming language
Stars: ✭ 118 (+337.04%)
VisionarayA C++-based, cross platform ray tracing library
Stars: ✭ 342 (+1166.67%)
hermesA Haskell library for fast, memory-efficient decoding of JSON documents using the simdjson C++ library
Stars: ✭ 37 (+37.04%)
SimdcompA simple C library for compressing lists of integers using binary packing
Stars: ✭ 331 (+1125.93%)
ThorinThe Higher-Order Intermediate Representation
Stars: ✭ 116 (+329.63%)
DatafuseDatafuse is a free Cloud-Native Analytics DBMS(Inspired by ClickHouse) implemented in Rust
Stars: ✭ 327 (+1111.11%)
SIMDArraySIMD enhanced Array operations
Stars: ✭ 123 (+355.56%)
Fastbase64SIMD-accelerated base64 codecs
Stars: ✭ 309 (+1044.44%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (+325.93%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-22.22%)
FastorA lightweight high performance tensor algebra framework for modern C++
Stars: ✭ 280 (+937.04%)
QreverseA small study in hardware accelerated AoS reversal
Stars: ✭ 97 (+259.26%)
GrapheneA thin layer of graphic data types
Stars: ✭ 268 (+892.59%)
Base64Base64 encoding / decoding with SIMD-support, also base64Url
Stars: ✭ 44 (+62.96%)
Amplifier.netAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 92 (+240.74%)
runtimeAnyDSL Runtime Library
Stars: ✭ 17 (-37.04%)
psimdPortable 128-bit SIMD intrinsics
Stars: ✭ 48 (+77.78%)
Amplifier.NETAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 142 (+425.93%)
processorA compiler, assembler, and processor.
Stars: ✭ 24 (-11.11%)
Jsturbo.js - perform massive parallel computations in your browser with GPGPU.
Stars: ✭ 2,591 (+9496.3%)
DespacerC library to remove white space from strings as fast as possible
Stars: ✭ 90 (+233.33%)