DirectxmathDirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Stars: ✭ 859 (+1076.71%)
Fastbase64SIMD-accelerated base64 codecs
Stars: ✭ 309 (+323.29%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (+89.04%)
VcSIMD Vector Classes for C++
Stars: ✭ 985 (+1249.32%)
utf8Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)
Stars: ✭ 60 (-17.81%)
SiaFpgaMinerVHDL FPGA design of an optimized Blake2b pipeline to mine Siacoin
Stars: ✭ 58 (-20.55%)
awesome-simdA curated list of awesome SIMD frameworks, libraries and software
Stars: ✭ 39 (-46.58%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (+57.53%)
SimdeImplementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+1286.3%)
ultra-sortDSL for SIMD Sorting on AVX2 & AVX512
Stars: ✭ 29 (-60.27%)
ToysStorage for my snippets, toy programs, etc.
Stars: ✭ 187 (+156.16%)
Quadray EngineRealtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-82.19%)
monteThe bare minimum for high performance, fully-encrypted bidirectional RPC over TCP in Go with zero memory allocations.
Stars: ✭ 103 (+41.1%)
HighwayhashNative Go version of HighwayHash with optimized assembly implementations on Intel and ARM. Able to process over 10 GB/sec on a single core on Intel CPUs - https://en.wikipedia.org/wiki/HighwayHash
Stars: ✭ 670 (+817.81%)
OsacaOpen Source Architecture Code Analyzer
Stars: ✭ 162 (+121.92%)
SimdjsonsharpC# bindings for lemire/simdjson (and full C# port)
Stars: ✭ 506 (+593.15%)
Guided Missile SimulationGuided Missile, Radar and Infrared EOS Simulation Framework written in Fortran.
Stars: ✭ 33 (-54.79%)
Sse4 StrstrSIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Stars: ✭ 115 (+57.53%)
positional-popcountFast C functions for the computing the positional popcount (pospopcnt).
Stars: ✭ 47 (-35.62%)
Sse PopcountSIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Stars: ✭ 226 (+209.59%)
simdutf8SIMD-accelerated UTF-8 validation for Rust.
Stars: ✭ 426 (+483.56%)
Md5 SimdAccelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Stars: ✭ 71 (-2.74%)
Op rbfOptimized Recursive Bilateral Filter
Stars: ✭ 47 (-35.62%)
OnednnoneAPI Deep Neural Network Library (oneDNN)
Stars: ✭ 2,600 (+3461.64%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-43.84%)
LibsimdppPortable header-only C++ low level SIMD library
Stars: ✭ 914 (+1152.05%)
SimdjsonParsing gigabytes of JSON per second
Stars: ✭ 15,115 (+20605.48%)
KsimThe little simulator that could.
Stars: ✭ 11 (-84.93%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+1120.55%)
HighwayhashNode.js implementation of HighwayHash, Google's fast and strong hash function
Stars: ✭ 183 (+150.68%)
KryptorA simple, modern, and secure encryption and signing tool that aims to be a better version of age and Minisign.
Stars: ✭ 267 (+265.75%)
LibxsmmLibrary for specialized dense and sparse matrix operations, and deep learning primitives.
Stars: ✭ 518 (+609.59%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (+91.78%)
Asm DudeVisual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Stars: ✭ 3,898 (+5239.73%)
neopoA lightweight solution for local Particle development.
Stars: ✭ 19 (-73.97%)
HighwayPerformance-portable, length-agnostic SIMD with runtime dispatch
Stars: ✭ 301 (+312.33%)
Tensorflow Optimized WheelsTensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (+61.64%)
simdutfUnicode routines (UTF8, UTF16): billions of characters per second.
Stars: ✭ 108 (+47.95%)
block-alignerSIMD-accelerated library for computing global and X-drop affine gap penalty sequence-to-sequence or sequence-to-profile alignments using an adaptive block-based algorithm.
Stars: ✭ 58 (-20.55%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (+56.16%)
argon-dashboard-asp-netStart your development with a Bootstrap 4 Admin Dashboard built for ASP.NET Core framework, the newest go-to technology from Microsoft for top companies.
Stars: ✭ 176 (+141.1%)
simdjson-rsRust version of lemire's SimdJson
Stars: ✭ 18 (-75.34%)
SimdC++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (+1630.14%)
cpuwhatNim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics
Stars: ✭ 25 (-65.75%)
Libpopcnt🚀 Fast C/C++ bit population count library
Stars: ✭ 219 (+200%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (-9.59%)
argon-aframeglue to use aframe to author argon applications
Stars: ✭ 45 (-38.36%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (-31.51%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-71.23%)
Unisimd AssemblerSIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (-13.7%)