C++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.

Stars: ✭ 1,263 (+38.18%)

Mutual labels: simd, sse, neon, avx2, avx512

Simde

Implementations of SIMD instruction sets for systems which don't natively support them.

Stars: ✭ 1,012 (+10.72%)

Mutual labels: simd, sse, neon, avx2, avx512

ternary-logic

Support for ternary logic in SSE, XOP, AVX2 and x86 programs

Stars: ✭ 21 (-97.7%)

Mutual labels: sse, simd, avx2, avx512

Libxsmm

Library for specialized dense and sparse matrix operations, and deep learning primitives.

Stars: ✭ 518 (-43.33%)

Mutual labels: simd, sse, avx2, avx512

Directxmath

DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps

Stars: ✭ 859 (-6.02%)

Mutual labels: simd, sse, neon, avx2

Fastnoisesimd

C++ SIMD Noise Library

Stars: ✭ 542 (-40.7%)

Mutual labels: simd, sse, neon, avx2

Umesimd

UME::SIMD A library for explicit simd vectorization.

Stars: ✭ 66 (-92.78%)

Mutual labels: simd, neon, avx2, avx512

Highway

Performance-portable, length-agnostic SIMD with runtime dispatch

Stars: ✭ 301 (-67.07%)

Mutual labels: simd, neon, avx2, avx512

simd-byte-lookup

SIMDized check which bytes are in a set

Stars: ✭ 23 (-97.48%)

Mutual labels: sse, simd, avx2, avx512

Std Simd

std::experimental::simd for GCC [ISO/IEC TS 19570:2018]

Stars: ✭ 275 (-69.91%)

Mutual labels: simd, sse, neon, avx512

Nsimd

Agenium Scale vectorization library for CPUs and GPUs

Stars: ✭ 138 (-84.9%)

Mutual labels: simd, neon, avx2, avx512

Sse4 Strstr

SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification

Stars: ✭ 115 (-87.42%)

Mutual labels: sse, neon, avx2, avx512

Xsimd

C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)

Stars: ✭ 964 (+5.47%)

Mutual labels: simd, sse, neon, avx512

Md5 Simd

Accelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.

Stars: ✭ 71 (-92.23%)

Mutual labels: simd, avx2, avx512

Corrfunc

⚡️⚡️⚡️Blazing fast correlation functions on the CPU.

Stars: ✭ 114 (-87.53%)

Mutual labels: simd, avx2, avx512

Turbo Run Length Encoding

TurboRLE-Fastest Run Length Encoding

Stars: ✭ 212 (-76.81%)

Mutual labels: simd, sse, avx2

Sleef

SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT

Stars: ✭ 353 (-61.38%)

Mutual labels: simd, neon, avx512

Mipp

MIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.

Stars: ✭ 253 (-72.32%)

Mutual labels: simd, sse, neon

Turbo-Histogram

Fastest Histogram Construction

Stars: ✭ 44 (-95.19%)

Mutual labels: sse, simd, avx2

Turbo-Transpose

Transpose: SIMD Integer+Floating Point Compression Filter

Stars: ✭ 50 (-94.53%)

Mutual labels: sse, simd, avx2

Cglm

📽 Highly Optimized Graphics Math (glm) for C

Stars: ✭ 887 (-2.95%)

Mutual labels: simd, sse, neon

std find simd

std::find simd version

Stars: ✭ 19 (-97.92%)

Mutual labels: simd, avx2, avx512

cpuwhat

Nim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics

Stars: ✭ 25 (-97.26%)

Mutual labels: sse, simd, avx2

Sse2neon

A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation

Stars: ✭ 316 (-65.43%)

Mutual labels: simd, sse, neon

oversimple

A library for audio oversampling, which tries to offer a simple api while wrapping HIIR, by Laurent De Soras, for minimum phase antialiasing, and r8brain-free-src, by Aleksey Vaneev, for linear phase antialiasing.

Stars: ✭ 25 (-97.26%)

Mutual labels: neon, sse, simd

ultra-sort

DSL for SIMD Sorting on AVX2 & AVX512

Stars: ✭ 29 (-96.83%)

Mutual labels: simd, avx2, avx512

simdutf8

SIMD-accelerated UTF-8 validation for Rust.

Stars: ✭ 426 (-53.39%)

Mutual labels: neon, simd, avx2

SoftLight

A shader-based Software Renderer Using The LightSky Framework.

Stars: ✭ 2 (-99.78%)

Mutual labels: neon, sse, simd

utf8

Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)

Stars: ✭ 60 (-93.44%)

Mutual labels: neon, simd, avx2

Simdjson

Parsing gigabytes of JSON per second

Stars: ✭ 15,115 (+1553.72%)

Mutual labels: simd, neon, avx2

Toys

Storage for my snippets, toy programs, etc.

Stars: ✭ 187 (-79.54%)

Mutual labels: sse, avx2, avx512

positional-popcount

Fast C functions for the computing the positional popcount (pospopcnt).

Stars: ✭ 47 (-94.86%)

Mutual labels: simd, avx2, avx512

Libpopcnt

🚀 Fast C/C++ bit population count library

Stars: ✭ 219 (-76.04%)

Mutual labels: neon, avx2, avx512

Sse Popcount

SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html

Stars: ✭ 226 (-75.27%)

Mutual labels: sse, avx2, avx512

simdutf

Unicode routines (UTF8, UTF16): billions of characters per second.

Stars: ✭ 108 (-88.18%)

Mutual labels: neon, simd, avx2

Klein

P(R*_{3, 0, 1}) specialized SIMD Geometric Algebra Library

Stars: ✭ 463 (-49.34%)

Mutual labels: simd, sse

Ozz Animation

Open source c++ skeletal animation library and toolset

Stars: ✭ 1,250 (+36.76%)

Mutual labels: simd, sse

Highwayhash

Native Go version of HighwayHash with optimized assembly implementations on Intel and ARM. Able to process over 10 GB/sec on a single core on Intel CPUs - https://en.wikipedia.org/wiki/HighwayHash

Stars: ✭ 670 (-26.7%)

Mutual labels: neon, avx2

Despacer

C library to remove white space from strings as fast as possible

Stars: ✭ 90 (-90.15%)

Mutual labels: simd, sse

Asm Dude

Visual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window

Stars: ✭ 3,898 (+326.48%)

Mutual labels: avx2, avx512

Computelibrary

The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.