VcSIMD Vector Classes for C++
Stars: ✭ 985 (+426.74%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (-38.5%)
Quadray EngineRealtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-93.05%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-88.77%)
Sse PopcountSIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Stars: ✭ 226 (+20.86%)
Sse4 StrstrSIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Stars: ✭ 115 (-38.5%)
SimdeImplementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+441.18%)
LibsimdppPortable header-only C++ low level SIMD library
Stars: ✭ 914 (+388.77%)
LibxsmmLibrary for specialized dense and sparse matrix operations, and deep learning primitives.
Stars: ✭ 518 (+177.01%)
SimdC++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (+575.4%)
Unisimd AssemblerSIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (-66.31%)
Tensorflow Optimized WheelsTensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (-36.9%)
OsacaOpen Source Architecture Code Analyzer
Stars: ✭ 162 (-13.37%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (-39.04%)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (+415.51%)
HighwayPerformance-portable, length-agnostic SIMD with runtime dispatch
Stars: ✭ 301 (+60.96%)
DirectxmathDirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Stars: ✭ 859 (+359.36%)
positional-popcountFast C functions for the computing the positional popcount (pospopcnt).
Stars: ✭ 47 (-74.87%)
OnednnoneAPI Deep Neural Network Library (oneDNN)
Stars: ✭ 2,600 (+1290.37%)
Libpopcnt🚀 Fast C/C++ bit population count library
Stars: ✭ 219 (+17.11%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (-73.26%)
ultra-sortDSL for SIMD Sorting on AVX2 & AVX512
Stars: ✭ 29 (-84.49%)
Md5 SimdAccelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Stars: ✭ 71 (-62.03%)
Std Simdstd::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Stars: ✭ 275 (+47.06%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (-64.71%)
cpuwhatNim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics
Stars: ✭ 25 (-86.63%)
Asm DudeVisual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Stars: ✭ 3,898 (+1984.49%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-26.2%)
Op rbfOptimized Recursive Bilateral Filter
Stars: ✭ 47 (-74.87%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-78.07%)
Reddit sse streamA Server Side Event stream to deliver Reddit comments and submissions in near real-time to a client.
Stars: ✭ 39 (-79.14%)
Pulsar BeamPulsar Beam is a streaming service via HTTP built on Apache Pulsar.
Stars: ✭ 37 (-80.21%)
UgmUbpa Graphics Mathematics
Stars: ✭ 178 (-4.81%)
ShotgunFor the times you need more than just a gun.
Stars: ✭ 158 (-15.51%)
KfrFast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Stars: ✭ 985 (+426.74%)
Demo Spring Sse'Server-Sent Events (SSE) in Spring 5 with Web MVC and Web Flux' article and source code.
Stars: ✭ 102 (-45.45%)
EventsourceSSE Swiss Army Knife for Go
Stars: ✭ 29 (-84.49%)
Base64 Avx512Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"
Stars: ✭ 158 (-15.51%)
RaySmall pathtracing library with GPU and CPU backends
Stars: ✭ 95 (-49.2%)
OpensseOpen Sketch Search Engine- 3D object retrieval based on sketch image as input
Stars: ✭ 883 (+372.19%)
DespacerC library to remove white space from strings as fast as possible
Stars: ✭ 90 (-51.87%)
KsimThe little simulator that could.
Stars: ✭ 11 (-94.12%)
Golang Sse Todogolang server sent events (sse) example
Stars: ✭ 23 (-87.7%)
Server Push Hooks🔥 React hooks for Socket.io, SEE, WebSockets and more to come
Stars: ✭ 176 (-5.88%)
HlslppMath library using hlsl syntax with SSE/NEON support
Stars: ✭ 153 (-18.18%)
Spring 5 ExamplesThis repository is contains spring-boot 2 / spring framework 5 project examples. Using reactive programming model / paradigm and Kotlin
Stars: ✭ 87 (-53.48%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+376.47%)
Cglm📽 Highly Optimized Graphics Math (glm) for C
Stars: ✭ 887 (+374.33%)
Iodineiodine - HTTP / WebSockets Server for Ruby with Pub/Sub support
Stars: ✭ 720 (+285.03%)
AgooA High Performance HTTP Server for Ruby
Stars: ✭ 679 (+263.1%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (-25.13%)
Ozz AnimationOpen source c++ skeletal animation library and toolset
Stars: ✭ 1,250 (+568.45%)
HighwayhashNative Go version of HighwayHash with optimized assembly implementations on Intel and ARM. Able to process over 10 GB/sec on a single core on Intel CPUs - https://en.wikipedia.org/wiki/HighwayHash
Stars: ✭ 670 (+258.29%)