SimdC++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (+24.8%)
Unisimd AssemblerSIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (-93.77%)
VcSIMD Vector Classes for C++
Stars: ✭ 985 (-2.67%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (-65.12%)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (-4.74%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (-93.48%)
Quadray EngineRealtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-98.72%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-86.36%)
cpuwhatNim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics
Stars: ✭ 25 (-97.53%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (-88.64%)
LibsimdppPortable header-only C++ low level SIMD library
Stars: ✭ 914 (-9.68%)
Std Simdstd::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Stars: ✭ 275 (-72.83%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-97.92%)
LibxsmmLibrary for specialized dense and sparse matrix operations, and deep learning primitives.
Stars: ✭ 518 (-48.81%)
SimdjsonParsing gigabytes of JSON per second
Stars: ✭ 15,115 (+1393.58%)
Sse2neonA translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
Stars: ✭ 316 (-68.77%)
DirectxmathDirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Stars: ✭ 859 (-15.12%)
Sse4 StrstrSIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Stars: ✭ 115 (-88.64%)
utf8Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)
Stars: ✭ 60 (-94.07%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (-88.74%)
MippMIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.
Stars: ✭ 253 (-75%)
HighwayPerformance-portable, length-agnostic SIMD with runtime dispatch
Stars: ✭ 301 (-70.26%)
Guided Missile SimulationGuided Missile, Radar and Infrared EOS Simulation Framework written in Fortran.
Stars: ✭ 33 (-96.74%)
Cglm📽 Highly Optimized Graphics Math (glm) for C
Stars: ✭ 887 (-12.35%)
simdutf8SIMD-accelerated UTF-8 validation for Rust.
Stars: ✭ 426 (-57.91%)
ultra-sortDSL for SIMD Sorting on AVX2 & AVX512
Stars: ✭ 29 (-97.13%)
oversimpleA library for audio oversampling, which tries to offer a simple api while wrapping HIIR, by Laurent De Soras, for minimum phase antialiasing, and r8brain-free-src, by Aleksey Vaneev, for linear phase antialiasing.
Stars: ✭ 25 (-97.53%)
Libpopcnt🚀 Fast C/C++ bit population count library
Stars: ✭ 219 (-78.36%)
OsacaOpen Source Architecture Code Analyzer
Stars: ✭ 162 (-83.99%)
HlslppMath library using hlsl syntax with SSE/NEON support
Stars: ✭ 153 (-84.88%)
KeypatchMulti-architecture assembler for IDA Pro. Powered by Keystone Engine.
Stars: ✭ 939 (-7.21%)
Sse PopcountSIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Stars: ✭ 226 (-77.67%)
ToysStorage for my snippets, toy programs, etc.
Stars: ✭ 187 (-81.52%)
Md5 SimdAccelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Stars: ✭ 71 (-92.98%)
KfrFast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Stars: ✭ 985 (-2.67%)
DespacerC library to remove white space from strings as fast as possible
Stars: ✭ 90 (-91.11%)
positional-popcountFast C functions for the computing the positional popcount (pospopcnt).
Stars: ✭ 47 (-95.36%)
ComputelibraryThe Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
Stars: ✭ 2,123 (+109.78%)
simdutfUnicode routines (UTF8, UTF16): billions of characters per second.
Stars: ✭ 108 (-89.33%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (-95.06%)
HLMLAuto-generated maths library for C and C++ based on HLSL/Cg
Stars: ✭ 23 (-97.73%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (-96.15%)
UnicornUnicorn CPU emulator framework (ARM, AArch64, M68K, Mips, Sparc, PowerPC, RiscV, X86)
Stars: ✭ 4,934 (+387.55%)
CapstoneCapstone disassembly/disassembler framework: Core (Arm, Arm64, BPF, EVM, M68K, M680X, MOS65xx, Mips, PPC, RISCV, Sparc, SystemZ, TMS320C64x, Web Assembly, X86, X86_64, XCore) + bindings.
Stars: ✭ 5,374 (+431.03%)
Sha256 SimdAccelerate SHA256 computations in pure Go using Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performance boost of close to 4x over native.
Stars: ✭ 657 (-35.08%)
KeystoneKeystone assembler framework: Core (Arm, Arm64, Hexagon, Mips, PowerPC, Sparc, SystemZ & X86) + bindings
Stars: ✭ 1,654 (+63.44%)
penguinVSimple and fast C++ image processing library with focus on heterogeneous systems
Stars: ✭ 110 (-89.13%)
SoftLightA shader-based Software Renderer Using The LightSky Framework.
Stars: ✭ 2 (-99.8%)
volumizrSimple kubernetes storage solution using Minio
Stars: ✭ 20 (-98.02%)
pedalevitePédale Vite — DIY multi-FX pedalboard for guitar/bass/etc.
Stars: ✭ 68 (-93.28%)
tor-relay-dockerTor relay Docker images for x86-64, armhf & arm64 (from source)
Stars: ✭ 32 (-96.84%)
CwergA light-weight compiler backend
Stars: ✭ 207 (-79.55%)