LibsimdppPortable header-only C++ low level SIMD library
Stars: ✭ 914 (+694.78%)
SimdC++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (+998.26%)
Quadray EngineRealtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-88.7%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (+0%)
Unisimd AssemblerSIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (-45.22%)
SimdeImplementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+780%)
VcSIMD Vector Classes for C++
Stars: ✭ 985 (+756.52%)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (+738.26%)
ToysStorage for my snippets, toy programs, etc.
Stars: ✭ 187 (+62.61%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (-42.61%)
DirectxmathDirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Stars: ✭ 859 (+646.96%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-81.74%)
HighwayPerformance-portable, length-agnostic SIMD with runtime dispatch
Stars: ✭ 301 (+161.74%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (+20%)
Std Simdstd::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Stars: ✭ 275 (+139.13%)
Libpopcnt🚀 Fast C/C++ bit population count library
Stars: ✭ 219 (+90.43%)
LibxsmmLibrary for specialized dense and sparse matrix operations, and deep learning primitives.
Stars: ✭ 518 (+350.43%)
Sse PopcountSIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Stars: ✭ 226 (+96.52%)
simdutf8SIMD-accelerated UTF-8 validation for Rust.
Stars: ✭ 426 (+270.43%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (+206.96%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (-56.52%)
Md5 SimdAccelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Stars: ✭ 71 (-38.26%)
OsacaOpen Source Architecture Code Analyzer
Stars: ✭ 162 (+40.87%)
utf8Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)
Stars: ✭ 60 (-47.83%)
OnednnoneAPI Deep Neural Network Library (oneDNN)
Stars: ✭ 2,600 (+2160.87%)
SimdjsonParsing gigabytes of JSON per second
Stars: ✭ 15,115 (+13043.48%)
Tensorflow Optimized WheelsTensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (+2.61%)
HlslppMath library using hlsl syntax with SSE/NEON support
Stars: ✭ 153 (+33.04%)
oversimpleA library for audio oversampling, which tries to offer a simple api while wrapping HIIR, by Laurent De Soras, for minimum phase antialiasing, and r8brain-free-src, by Aleksey Vaneev, for linear phase antialiasing.
Stars: ✭ 25 (-78.26%)
SoftLightA shader-based Software Renderer Using The LightSky Framework.
Stars: ✭ 2 (-98.26%)
Cglm📽 Highly Optimized Graphics Math (glm) for C
Stars: ✭ 887 (+671.3%)
cpuwhatNim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics
Stars: ✭ 25 (-78.26%)
positional-popcountFast C functions for the computing the positional popcount (pospopcnt).
Stars: ✭ 47 (-59.13%)
Sse2neonA translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
Stars: ✭ 316 (+174.78%)
HighwayhashNative Go version of HighwayHash with optimized assembly implementations on Intel and ARM. Able to process over 10 GB/sec on a single core on Intel CPUs - https://en.wikipedia.org/wiki/HighwayHash
Stars: ✭ 670 (+482.61%)
ultra-sortDSL for SIMD Sorting on AVX2 & AVX512
Stars: ✭ 29 (-74.78%)
MippMIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.
Stars: ✭ 253 (+120%)
simdutfUnicode routines (UTF8, UTF16): billions of characters per second.
Stars: ✭ 108 (-6.09%)
Asm DudeVisual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Stars: ✭ 3,898 (+3289.57%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (-0.87%)
KfrFast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Stars: ✭ 985 (+756.52%)
Univdisasmx86 Disassembler and Analyzer
Stars: ✭ 74 (-35.65%)
Strman JavaA Java 8 string manipulation library.
Stars: ✭ 1,362 (+1084.35%)
Left Pad⬅️ String left pad -- deprecated, use String.prototype.padStart()
Stars: ✭ 1,179 (+925.22%)
Shell FunctoolsFunctional programming tools for the shell
Stars: ✭ 971 (+744.35%)
Cookbook🎶 Cookbook for Nette Framework (@nette) & Contributte (@contributte). Read it while its HOT!
Stars: ✭ 30 (-73.91%)
EventsourceSSE Swiss Army Knife for Go
Stars: ✭ 29 (-74.78%)
StrtkC++ String Toolkit Library
Stars: ✭ 113 (-1.74%)
RaySmall pathtracing library with GPU and CPU backends
Stars: ✭ 95 (-17.39%)
MightystringMaking Ruby Strings Powerful
Stars: ✭ 28 (-75.65%)
BetaAn open source reimplementation of Benny Brodda's BETA in Python
Stars: ✭ 65 (-43.48%)
OpensseOpen Sketch Search Engine- 3D object retrieval based on sketch image as input
Stars: ✭ 883 (+667.83%)