Quadray EngineRealtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-97.49%)
LaserThe HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
Stars: ✭ 191 (-63.13%)
SimdC++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (+143.82%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (-87.26%)
Cglm📽 Highly Optimized Graphics Math (glm) for C
Stars: ✭ 887 (+71.24%)
Unisimd AssemblerSIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (-87.84%)
VcSIMD Vector Classes for C++
Stars: ✭ 985 (+90.15%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-95.95%)
SimdeImplementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+95.37%)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (+86.1%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (-77.99%)
DirectxmathDirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Stars: ✭ 859 (+65.83%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-73.36%)
LibsimdppPortable header-only C++ low level SIMD library
Stars: ✭ 914 (+76.45%)
MippMIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.
Stars: ✭ 253 (-51.16%)
ultra-sortDSL for SIMD Sorting on AVX2 & AVX512
Stars: ✭ 29 (-94.4%)
cpuwhatNim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics
Stars: ✭ 25 (-95.17%)
Std Simdstd::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Stars: ✭ 275 (-46.91%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (-77.8%)
HlslppMath library using hlsl syntax with SSE/NEON support
Stars: ✭ 153 (-70.46%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (-90.35%)
TensorA library and extension that provides objects for scientific computing in PHP.
Stars: ✭ 146 (-71.81%)
Sse PopcountSIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Stars: ✭ 226 (-56.37%)
mir-glas[Experimental] LLVM-accelerated Generic Linear Algebra Subprograms
Stars: ✭ 99 (-80.89%)
penguinVSimple and fast C++ image processing library with focus on heterogeneous systems
Stars: ✭ 110 (-78.76%)
ToysStorage for my snippets, toy programs, etc.
Stars: ✭ 187 (-63.9%)
oversimpleA library for audio oversampling, which tries to offer a simple api while wrapping HIIR, by Laurent De Soras, for minimum phase antialiasing, and r8brain-free-src, by Aleksey Vaneev, for linear phase antialiasing.
Stars: ✭ 25 (-95.17%)
Sse4 StrstrSIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Stars: ✭ 115 (-77.8%)
GenericTensorThe only library allowing to create Tensors (matrices extension) with custom types
Stars: ✭ 42 (-91.89%)
KfrFast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Stars: ✭ 985 (+90.15%)
Md5 SimdAccelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Stars: ✭ 71 (-86.29%)
DespacerC library to remove white space from strings as fast as possible
Stars: ✭ 90 (-82.63%)
glmOpenGL Mathematics (GLM)
Stars: ✭ 6,667 (+1187.07%)
Armadillo CodeArmadillo: fast C++ library for linear algebra & scientific computing - http://arma.sourceforge.net
Stars: ✭ 388 (-25.1%)
hlmlvectorized high-level math library
Stars: ✭ 42 (-91.89%)
CgmathA linear algebra and mathematics library for computer graphics.
Stars: ✭ 773 (+49.23%)
FmatvecA fast vector/matrix library
Stars: ✭ 5 (-99.03%)
VectoriousLinear algebra in TypeScript.
Stars: ✭ 616 (+18.92%)
positional-popcountFast C functions for the computing the positional popcount (pospopcnt).
Stars: ✭ 47 (-90.93%)
UgmUbpa Graphics Mathematics
Stars: ✭ 178 (-65.64%)
BlasjsPure Javascript manually written 👌 implementation of BLAS, Many numerical software applications use BLAS computations, including Armadillo, LAPACK, LINPACK, GNU Octave, Mathematica, MATLAB, NumPy, R, and Julia.
Stars: ✭ 241 (-53.47%)
Sha256 SimdAccelerate SHA256 computations in pure Go using Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performance boost of close to 4x over native.
Stars: ✭ 657 (+26.83%)
LacamlOCaml bindings for BLAS/LAPACK (high-performance linear algebra Fortran libraries)
Stars: ✭ 101 (-80.5%)
SCNMathExtensionsMath extensions for SCNVector3, SCNQuaternion, SCNMatrix4
Stars: ✭ 32 (-93.82%)
HighwayPerformance-portable, length-agnostic SIMD with runtime dispatch
Stars: ✭ 301 (-41.89%)
GrapheneA thin layer of graphic data types
Stars: ✭ 268 (-48.26%)
Guided Missile SimulationGuided Missile, Radar and Infrared EOS Simulation Framework written in Fortran.
Stars: ✭ 33 (-93.63%)
OsacaOpen Source Architecture Code Analyzer
Stars: ✭ 162 (-68.73%)
sparseSparse matrix formats for linear algebra supporting scientific and machine learning applications
Stars: ✭ 136 (-73.75%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (-92.47%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (-31.85%)