NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (+94.37%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (-7.04%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (+61.97%)
OnednnoneAPI Deep Neural Network Library (oneDNN)
Stars: ✭ 2,600 (+3561.97%)
LibxsmmLibrary for specialized dense and sparse matrix operations, and deep learning primitives.
Stars: ✭ 518 (+629.58%)
Quadray EngineRealtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-81.69%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-70.42%)
Unisimd AssemblerSIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (-11.27%)
LibsimdppPortable header-only C++ low level SIMD library
Stars: ✭ 914 (+1187.32%)
positional-popcountFast C functions for the computing the positional popcount (pospopcnt).
Stars: ✭ 47 (-33.8%)
SimdC++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (+1678.87%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (+60.56%)
VcSIMD Vector Classes for C++
Stars: ✭ 985 (+1287.32%)
ultra-sortDSL for SIMD Sorting on AVX2 & AVX512
Stars: ✭ 29 (-59.15%)
HighwayPerformance-portable, length-agnostic SIMD with runtime dispatch
Stars: ✭ 301 (+323.94%)
SimdeImplementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+1325.35%)
ToysStorage for my snippets, toy programs, etc.
Stars: ✭ 187 (+163.38%)
Sse PopcountSIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Stars: ✭ 226 (+218.31%)
Base64 Avx512Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"
Stars: ✭ 158 (+122.54%)
SimdjsonParsing gigabytes of JSON per second
Stars: ✭ 15,115 (+21188.73%)
Guided Missile SimulationGuided Missile, Radar and Infrared EOS Simulation Framework written in Fortran.
Stars: ✭ 33 (-53.52%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (-29.58%)
sliceslice-rsA fast implementation of single-pattern substring search using SIMD acceleration.
Stars: ✭ 66 (-7.04%)
OsacaOpen Source Architecture Code Analyzer
Stars: ✭ 162 (+128.17%)
Sse4 StrstrSIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Stars: ✭ 115 (+61.97%)
Libpopcnt🚀 Fast C/C++ bit population count library
Stars: ✭ 219 (+208.45%)
HashratHashing tool supporting md5,sha1,sha256,sha512,whirlpool,jh and hmac versions of these. Includes recursive file hashing and other features.
Stars: ✭ 46 (-35.21%)
cpuwhatNim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics
Stars: ✭ 25 (-64.79%)
simdutf8SIMD-accelerated UTF-8 validation for Rust.
Stars: ✭ 426 (+500%)
block-alignerSIMD-accelerated library for computing global and X-drop affine gap penalty sequence-to-sequence or sequence-to-profile alignments using an adaptive block-based algorithm.
Stars: ✭ 58 (-18.31%)
awesome-simdA curated list of awesome SIMD frameworks, libraries and software
Stars: ✭ 39 (-45.07%)
Std Simdstd::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Stars: ✭ 275 (+287.32%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (+397.18%)
T1haOne of the fastest hash functions
Stars: ✭ 302 (+325.35%)
Asm DudeVisual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Stars: ✭ 3,898 (+5390.14%)
hash-checkerFast and simple application that allows you to generate and compare hashes from files and text
Stars: ✭ 72 (+1.41%)
DirectxmathDirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Stars: ✭ 859 (+1109.86%)
simdutfUnicode routines (UTF8, UTF16): billions of characters per second.
Stars: ✭ 108 (+52.11%)
utf8Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)
Stars: ✭ 60 (-15.49%)
Fastbase64SIMD-accelerated base64 codecs
Stars: ✭ 309 (+335.21%)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (+1257.75%)
simdjson-rsRust version of lemire's SimdJson
Stars: ✭ 18 (-74.65%)
md5-optimisationThe fastest MD5 implementation using x86 assembly
Stars: ✭ 45 (-36.62%)
SimdjsonsharpC# bindings for lemire/simdjson (and full C# port)
Stars: ✭ 506 (+612.68%)
Parallel XxhashCompute xxHash hash codes for 8 keys in parallel
Stars: ✭ 36 (-49.3%)
KfrFast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Stars: ✭ 985 (+1287.32%)
Gl vs vkComparison of OpenGL and Vulkan API in terms of performance.
Stars: ✭ 65 (-8.45%)
Kirby3 AutoidAutomatic unique ID for Pages, Files and Structures including performant helpers to retrieve them. Bonus: Tiny-URL.
Stars: ✭ 58 (-18.31%)
Pc Optimization Hubcollection of various resources devoted to performance and input lag optimization
Stars: ✭ 55 (-22.54%)
Go TdigestA T-Digest implementation in golang
Stars: ✭ 67 (-5.63%)