recursion-and-dynamic-programmingJulia and Python recursion algorithm, fractal geometry and dynamic programming applications including Edit Distance, Knapsack (Multiple Choice), Stock Trading, Pythagorean Tree, Koch Snowflake, Jerusalem Cross, Sierpiński Carpet, Hilbert Curve, Pascal Triangle, Prime Factorization, Palindrome, Egg Drop, Coin Change, Hanoi Tower, Cantor Set, Fibo…
Stars: ✭ 37 (+68.18%)
processorA compiler, assembler, and processor.
Stars: ✭ 24 (+9.09%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-4.55%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (+422.73%)
Fastbase64SIMD-accelerated base64 codecs
Stars: ✭ 309 (+1304.55%)
ReedsolomonReed-Solomon Erasure Code engine in Go, could more than 15GB/s per core
Stars: ✭ 203 (+822.73%)
Pocket TensorRun Keras models from a C++ application on embedded devices
Stars: ✭ 65 (+195.45%)
fasterRasterFaster raster processing using GRASS GIS
Stars: ✭ 18 (-18.18%)
StreamvbyteFast integer compression in C using the StreamVByte codec
Stars: ✭ 195 (+786.36%)
uwufastest text uwuifier in the west
Stars: ✭ 1,193 (+5322.73%)
KfrFast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Stars: ✭ 985 (+4377.27%)
SimdjsonParsing gigabytes of JSON per second
Stars: ✭ 15,115 (+68604.55%)
Amplifier.NETAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 142 (+545.45%)
Quadray EngineRealtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-40.91%)
UgmUbpa Graphics Mathematics
Stars: ✭ 178 (+709.09%)
Compressed VecSIMD Floating point and integer compressed vector library
Stars: ✭ 25 (+13.64%)
multiversionEasy function multiversioning for Rust
Stars: ✭ 152 (+590.91%)
EdgeExtreme-scale Discontinuous Galerkin Environment (EDGE)
Stars: ✭ 18 (-18.18%)
Base64 Avx512Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"
Stars: ✭ 158 (+618.18%)
XnnpackHigh-efficiency floating-point neural network inference operators for mobile, server, and Web
Stars: ✭ 808 (+3572.73%)
SCNMathExtensionsMath extensions for SCNVector3, SCNQuaternion, SCNMatrix4
Stars: ✭ 32 (+45.45%)
LinqfasterLinq-like extension functions for Arrays, Span<T>, and List<T> that are faster and allocate less.
Stars: ✭ 615 (+2695.45%)
Ncnnncnn is a high-performance neural network inference framework optimized for the mobile platform
Stars: ✭ 13,376 (+60700%)
LibxsmmLibrary for specialized dense and sparse matrix operations, and deep learning primitives.
Stars: ✭ 518 (+2254.55%)
ThermiteThermite SIMD: Melt your CPU
Stars: ✭ 141 (+540.91%)
KuiBaDBAnother OLAP database
Stars: ✭ 297 (+1250%)
KleinP(R*_{3, 0, 1}) specialized SIMD Geometric Algebra Library
Stars: ✭ 463 (+2004.55%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (+527.27%)
PysimdjsonPython bindings for the simdjson project.
Stars: ✭ 432 (+1863.64%)
Glam RsA simple and fast linear algebra library for games and graphics
Stars: ✭ 406 (+1745.45%)
QreverseA small study in hardware accelerated AoS reversal
Stars: ✭ 97 (+340.91%)
FastorA lightweight high performance tensor algebra framework for modern C++
Stars: ✭ 280 (+1172.73%)
nQuantCppnQuantCpp includes top 6 color quantization algorithms for visual c++ producing high quality optimized images.
Stars: ✭ 83 (+277.27%)
GrapheneA thin layer of graphic data types
Stars: ✭ 268 (+1118.18%)
NnpackAcceleration package for neural networks on multi-core CPUs
Stars: ✭ 1,538 (+6890.91%)
lsp-dsp-libDSP library for signal processing
Stars: ✭ 37 (+68.18%)
BitmagicBitMagic Library
Stars: ✭ 263 (+1095.45%)
Amplifier.netAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 92 (+318.18%)
QuestdbAn open source SQL database designed to process time series data, faster
Stars: ✭ 7,544 (+34190.91%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (+418.18%)
Sse2neonA translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
Stars: ✭ 316 (+1336.36%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (+77.27%)
HighwayPerformance-portable, length-agnostic SIMD with runtime dispatch
Stars: ✭ 301 (+1268.18%)
PackettracerThe SIMD-accelereted ray tracing in C# powered by Intel hardware intrinsic of .NET Core.
Stars: ✭ 109 (+395.45%)
TsimdFundamental C++ SIMD types for Intel CPUs (sse, avx, avx2, avx512)
Stars: ✭ 290 (+1218.18%)
MippMIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.
Stars: ✭ 253 (+1050%)
Std Simdstd::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Stars: ✭ 275 (+1150%)
SketchC++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
Stars: ✭ 96 (+336.36%)
sliceslice-rsA fast implementation of single-pattern substring search using SIMD acceleration.
Stars: ✭ 66 (+200%)
simdutfUnicode routines (UTF8, UTF16): billions of characters per second.
Stars: ✭ 108 (+390.91%)
Jsturbo.js - perform massive parallel computations in your browser with GPGPU.
Stars: ✭ 2,591 (+11677.27%)
FasterSIMD for humans
Stars: ✭ 1,304 (+5827.27%)
MaskedvbyteFast decoder for VByte-compressed integers
Stars: ✭ 91 (+313.64%)
penguinVSimple and fast C++ image processing library with focus on heterogeneous systems
Stars: ✭ 110 (+400%)
SIMDArraySIMD enhanced Array operations
Stars: ✭ 123 (+459.09%)