Sse4 StrstrSIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Stars: ✭ 115 (-47.49%)
SimdC++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (+476.71%)
SimdeImplementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+362.1%)
LibsimdppPortable header-only C++ low level SIMD library
Stars: ✭ 914 (+317.35%)
HighwayPerformance-portable, length-agnostic SIMD with runtime dispatch
Stars: ✭ 301 (+37.44%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-36.99%)
Quadray EngineRealtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-94.06%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (-69.86%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (-47.49%)
Unisimd AssemblerSIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (-71.23%)
VcSIMD Vector Classes for C++
Stars: ✭ 985 (+349.77%)
LibxsmmLibrary for specialized dense and sparse matrix operations, and deep learning primitives.
Stars: ✭ 518 (+136.53%)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (+340.18%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (-47.95%)
DirectxmathDirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Stars: ✭ 859 (+292.24%)
ToysStorage for my snippets, toy programs, etc.
Stars: ✭ 187 (-14.61%)
OsacaOpen Source Architecture Code Analyzer
Stars: ✭ 162 (-26.03%)
ultra-sortDSL for SIMD Sorting on AVX2 & AVX512
Stars: ✭ 29 (-86.76%)
HighwayhashNative Go version of HighwayHash with optimized assembly implementations on Intel and ARM. Able to process over 10 GB/sec on a single core on Intel CPUs - https://en.wikipedia.org/wiki/HighwayHash
Stars: ✭ 670 (+205.94%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-90.41%)
OnednnoneAPI Deep Neural Network Library (oneDNN)
Stars: ✭ 2,600 (+1087.21%)
SimdjsonParsing gigabytes of JSON per second
Stars: ✭ 15,115 (+6801.83%)
Md5 SimdAccelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Stars: ✭ 71 (-67.58%)
Sse PopcountSIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Stars: ✭ 226 (+3.2%)
Asm DudeVisual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Stars: ✭ 3,898 (+1679.91%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (+61.19%)
simdutf8SIMD-accelerated UTF-8 validation for Rust.
Stars: ✭ 426 (+94.52%)
utf8Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)
Stars: ✭ 60 (-72.6%)
Std Simdstd::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Stars: ✭ 275 (+25.57%)
positional-popcountFast C functions for the computing the positional popcount (pospopcnt).
Stars: ✭ 47 (-78.54%)
simdutfUnicode routines (UTF8, UTF16): billions of characters per second.
Stars: ✭ 108 (-50.68%)
Tensorflow Optimized WheelsTensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (-46.12%)
KfrFast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Stars: ✭ 985 (+349.77%)
ComputelibraryThe Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
Stars: ✭ 2,123 (+869.41%)
Cookbook🎶 Cookbook for Nette Framework (@nette) & Contributte (@contributte). Read it while its HOT!
Stars: ✭ 30 (-86.3%)
KsimThe little simulator that could.
Stars: ✭ 11 (-94.98%)
Arm VoEfficient monocular visual odometry for ground vehicles on ARM processors
Stars: ✭ 115 (-47.49%)
MigrifyFuturistic Grinder for Legacy Code with Effortles Confidence
Stars: ✭ 25 (-88.58%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+306.85%)
Cglm📽 Highly Optimized Graphics Math (glm) for C
Stars: ✭ 887 (+305.02%)
Neon Color SchemeA colorful bright-on-black color scheme for Sublime Text and TextMate. Its aim is to make as many languages as possible look as good as possible. Includes extended support for Python, Ruby, Clojure, JavaScript/JSON, C/C++, diff, HTML/XML, Markdown, PHP, CSS/SCSS/SASS, GitGutter, Find In Files, PackageDev, Regex, SublimeLinter, and much more.
Stars: ✭ 159 (-27.4%)
Blake3BLAKE3 hashing for JavaScript: native Node bindings (where available) and WebAssembly
Stars: ✭ 100 (-54.34%)
Neon🍸 Encodes and decodes NEON file format.
Stars: ✭ 674 (+207.76%)
Sha256 SimdAccelerate SHA256 computations in pure Go using Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performance boost of close to 4x over native.
Stars: ✭ 657 (+200%)
Napi RsA minimal library for building compiled Node.js add-ons in Rust
Stars: ✭ 539 (+146.12%)
Base64 Avx512Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"
Stars: ✭ 158 (-27.85%)
Univdisasmx86 Disassembler and Analyzer
Stars: ✭ 74 (-66.21%)
SimdjsonsharpC# bindings for lemire/simdjson (and full C# port)
Stars: ✭ 506 (+131.05%)
HlslppMath library using hlsl syntax with SSE/NEON support
Stars: ✭ 153 (-30.14%)
Firefox Sweet Theme🍬 A dark and modern theme for firefox with vibrant colors
Stars: ✭ 496 (+126.48%)
MaceMACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
Stars: ✭ 4,536 (+1971.23%)
NeonIntel® Nervana™ reference deep learning framework committed to best performance on all hardware
Stars: ✭ 3,855 (+1660.27%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (-36.07%)
Op rbfOptimized Recursive Bilateral Filter
Stars: ✭ 47 (-78.54%)