LibxsmmLibrary for specialized dense and sparse matrix operations, and deep learning primitives.
Stars: ✭ 518 (+936%)
LibsimdppPortable header-only C++ low level SIMD library
Stars: ✭ 914 (+1728%)
Quadray EngineRealtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-74%)
hlmlvectorized high-level math library
Stars: ✭ 42 (-16%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-58%)
SimdC++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (+2426%)
SimdeImplementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+1924%)
VcSIMD Vector Classes for C++
Stars: ✭ 985 (+1870%)
DirectxmathDirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Stars: ✭ 859 (+1618%)
Unisimd AssemblerSIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (+26%)
UgmUbpa Graphics Mathematics
Stars: ✭ 178 (+256%)
cpuwhatNim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics
Stars: ✭ 25 (-50%)
ndzipA High-Throughput Parallel Lossless Compressor for Scientific Data
Stars: ✭ 19 (-62%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (+130%)
Cglm📽 Highly Optimized Graphics Math (glm) for C
Stars: ✭ 887 (+1674%)
SoftLightA shader-based Software Renderer Using The LightSky Framework.
Stars: ✭ 2 (-96%)
Std Simdstd::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Stars: ✭ 275 (+450%)
Sse2neonA translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
Stars: ✭ 316 (+532%)
KleinP(R*_{3, 0, 1}) specialized SIMD Geometric Algebra Library
Stars: ✭ 463 (+826%)
oversimpleA library for audio oversampling, which tries to offer a simple api while wrapping HIIR, by Laurent De Soras, for minimum phase antialiasing, and r8brain-free-src, by Aleksey Vaneev, for linear phase antialiasing.
Stars: ✭ 25 (-50%)
Ozz AnimationOpen source c++ skeletal animation library and toolset
Stars: ✭ 1,250 (+2400%)
DespacerC library to remove white space from strings as fast as possible
Stars: ✭ 90 (+80%)
penguinVSimple and fast C++ image processing library with focus on heterogeneous systems
Stars: ✭ 110 (+120%)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (+1828%)
Sse4 StrstrSIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Stars: ✭ 115 (+130%)
ToysStorage for my snippets, toy programs, etc.
Stars: ✭ 187 (+274%)
glmOpenGL Mathematics (GLM)
Stars: ✭ 6,667 (+13234%)
GrapheneA thin layer of graphic data types
Stars: ✭ 268 (+436%)
CgmathA linear algebra and mathematics library for computer graphics.
Stars: ✭ 773 (+1446%)
Tensorflow Optimized WheelsTensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (+136%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (-22%)
DecomposedCATransform3D manipulation made easy.
Stars: ✭ 184 (+268%)
MippMIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.
Stars: ✭ 253 (+406%)
kanzi-cppLossless data compression in C++
Stars: ✭ 60 (+20%)
fpzipCython bindings for fpzip, a floating point image compression algorithm.
Stars: ✭ 24 (-52%)
mir-glas[Experimental] LLVM-accelerated Generic Linear Algebra Subprograms
Stars: ✭ 99 (+98%)
SCNMathExtensionsMath extensions for SCNVector3, SCNQuaternion, SCNMatrix4
Stars: ✭ 32 (-36%)
StreamvbyteFast integer compression in C using the StreamVByte codec
Stars: ✭ 195 (+290%)
HlslppMath library using hlsl syntax with SSE/NEON support
Stars: ✭ 153 (+206%)
Mangomango fun framework
Stars: ✭ 343 (+586%)
decompressPure OCaml implementation of Zlib.
Stars: ✭ 103 (+106%)
lzbase62LZ77(LZSS) based compression algorithm in base62 for JavaScript.
Stars: ✭ 38 (-24%)
SimdcompA simple C library for compressing lists of integers using binary packing
Stars: ✭ 331 (+562%)
fpzipLossless compressor of multidimensional floating-point arrays
Stars: ✭ 58 (+16%)
SimdjsonParsing gigabytes of JSON per second
Stars: ✭ 15,115 (+30130%)
HLMLAuto-generated maths library for C and C++ based on HLSL/Cg
Stars: ✭ 23 (-54%)
Sse PopcountSIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Stars: ✭ 226 (+352%)
zpackervery simple LZ77-based compression
Stars: ✭ 15 (-70%)
Compressed VecSIMD Floating point and integer compressed vector library
Stars: ✭ 25 (-50%)
MaskedvbyteFast decoder for VByte-compressed integers
Stars: ✭ 91 (+82%)