heyokaC++ library for ODE integration via Taylor's method and LLVM
Stars: ✭ 151 (+1158.33%)
MippMIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.
Stars: ✭ 253 (+2008.33%)
Jsturbo.js - perform massive parallel computations in your browser with GPGPU.
Stars: ✭ 2,591 (+21491.67%)
AlgoDailyjust for fun
Stars: ✭ 118 (+883.33%)
mir-glas[Experimental] LLVM-accelerated Generic Linear Algebra Subprograms
Stars: ✭ 99 (+725%)
ReedsolomonReed-Solomon Erasure Code engine in Go, could more than 15GB/s per core
Stars: ✭ 203 (+1591.67%)
Data-StructuresAlgorithmic Problems Solutions -- hash table code featured in geeksforgeeks
Stars: ✭ 44 (+266.67%)
StreamvbyteFast integer compression in C using the StreamVByte codec
Stars: ✭ 195 (+1525%)
runtimeAnyDSL Runtime Library
Stars: ✭ 17 (+41.67%)
SimdjsonParsing gigabytes of JSON per second
Stars: ✭ 15,115 (+125858.33%)
tbslasA parallel, fast solver for the scalar advection-diffusion and the incompressible Navier-Stokes equations based on semi-Lagrangian/Volume-Integral method.
Stars: ✭ 21 (+75%)
UgmUbpa Graphics Mathematics
Stars: ✭ 178 (+1383.33%)
psimdPortable 128-bit SIMD intrinsics
Stars: ✭ 48 (+300%)
Base64 Avx512Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"
Stars: ✭ 158 (+1216.67%)
wasmfast wasm modules
Stars: ✭ 37 (+208.33%)
Ncnnncnn is a high-performance neural network inference framework optimized for the mobile platform
Stars: ✭ 13,376 (+111366.67%)
penguinVSimple and fast C++ image processing library with focus on heterogeneous systems
Stars: ✭ 110 (+816.67%)
ThermiteThermite SIMD: Melt your CPU
Stars: ✭ 141 (+1075%)
ndzipA High-Throughput Parallel Lossless Compressor for Scientific Data
Stars: ✭ 19 (+58.33%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (+1050%)
Amplifier.NETAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 142 (+1083.33%)
FastapproxApproximate and vectorized versions of common mathematical functions
Stars: ✭ 128 (+966.67%)
solidity-utilssolidity utils to make your life easier
Stars: ✭ 16 (+33.33%)
NnpackAcceleration package for neural networks on multi-core CPUs
Stars: ✭ 1,538 (+12716.67%)
HarbolHarbol is a collection of data structure and miscellaneous libraries, similar in nature to C++'s Boost, STL, and GNOME's GLib
Stars: ✭ 18 (+50%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (+850%)
T13xAn Extended Version of the T0x multithreaded cores, with a custom general purpose parametrized SIMD/MIMD vector coprocessor and support for 3-5 way superscalar execution. The core is pin-to-pin compatible with the RISCY cores from PULP
Stars: ✭ 28 (+133.33%)
PackettracerThe SIMD-accelereted ray tracing in C# powered by Intel hardware intrinsic of .NET Core.
Stars: ✭ 109 (+808.33%)
heyoka.pyPython library for ODE integration via Taylor's method and LLVM
Stars: ✭ 45 (+275%)
simdutf8SIMD-accelerated UTF-8 validation for Rust.
Stars: ✭ 426 (+3450%)
MaskedvbyteFast decoder for VByte-compressed integers
Stars: ✭ 91 (+658.33%)
SCNMathExtensionsMath extensions for SCNVector3, SCNQuaternion, SCNMatrix4
Stars: ✭ 32 (+166.67%)
DespacerC library to remove white space from strings as fast as possible
Stars: ✭ 90 (+650%)
pybase64Fast Base64 encoding/decoding in Python
Stars: ✭ 84 (+600%)
Ozz AnimationOpen source c++ skeletal animation library and toolset
Stars: ✭ 1,250 (+10316.67%)
Base64Base64 encoding / decoding with SIMD-support, also base64Url
Stars: ✭ 44 (+266.67%)
WideA crate to help you go wide. By which I mean use SIMD stuff.
Stars: ✭ 72 (+500%)
fecppForward error correction with SIMD optimizations
Stars: ✭ 79 (+558.33%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (+450%)
sliceslice-rsA fast implementation of single-pattern substring search using SIMD acceleration.
Stars: ✭ 66 (+450%)
Pocket TensorRun Keras models from a C++ application on embedded devices
Stars: ✭ 65 (+441.67%)
utf8Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)
Stars: ✭ 60 (+400%)
StrA SIMD optimized fixed-length string class along with an adaptive hash table for fast searching
Stars: ✭ 60 (+400%)
memchrOptimized string search routines for Rust.
Stars: ✭ 474 (+3850%)
MongooseMinimalistic Vulkan engine for fast propotyping.
Stars: ✭ 41 (+241.67%)
SingeliHigh-level interface for low-level programming
Stars: ✭ 31 (+158.33%)
KfrFast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Stars: ✭ 985 (+8108.33%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (+316.67%)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (+7933.33%)
frpFRP: Fast Random Projections
Stars: ✭ 40 (+233.33%)
patchmapA fast and memory efficient hashmap using sorting to resolve collisions
Stars: ✭ 41 (+241.67%)
SIMDxorshiftFast random number generators: Vectorized (SIMD) version of xorshift128+
Stars: ✭ 84 (+600%)
generic-simdGeneric SIMD abstractions for Rust.
Stars: ✭ 45 (+275%)
simdjson-rsRust version of lemire's SimdJson
Stars: ✭ 18 (+50%)
ultra-sortDSL for SIMD Sorting on AVX2 & AVX512
Stars: ✭ 29 (+141.67%)