C++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.

Stars: ✭ 1,263 (+31.02%)

Mutual labels: simd, sse, neon, avx512, avx

Quadray Engine

Realtime raytracer using SIMD on ARM, MIPS, PPC and x86

Stars: ✭ 13 (-98.65%)

Mutual labels: simd, sse, neon, avx512, avx

Unisimd Assembler

SIMD macro assembler unified for ARM, MIPS, PPC and x86

Stars: ✭ 63 (-93.46%)

Mutual labels: simd, sse, neon, avx512, avx

Base64simd

Base64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)

Stars: ✭ 115 (-88.07%)

Mutual labels: simd, sse, neon, avx512

Nsimd

Agenium Scale vectorization library for CPUs and GPUs

Stars: ✭ 138 (-85.68%)

Mutual labels: simd, neon, avx512, avx

oversimple

A library for audio oversampling, which tries to offer a simple api while wrapping HIIR, by Laurent De Soras, for minimum phase antialiasing, and r8brain-free-src, by Aleksey Vaneev, for linear phase antialiasing.

Stars: ✭ 25 (-97.41%)

Mutual labels: neon, avx, sse, simd

Libxsmm

Library for specialized dense and sparse matrix operations, and deep learning primitives.

Stars: ✭ 518 (-46.27%)

Mutual labels: simd, sse, avx512, avx

Mipp

MIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.

Stars: ✭ 253 (-73.76%)

Mutual labels: simd, sse, neon, avx

ternary-logic

Support for ternary logic in SSE, XOP, AVX2 and x86 programs

Stars: ✭ 21 (-97.82%)

Mutual labels: avx, sse, simd, avx512

Cglm

📽 Highly Optimized Graphics Math (glm) for C

Stars: ✭ 887 (-7.99%)

Mutual labels: simd, sse, neon, avx

Directxmath

DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps

Stars: ✭ 859 (-10.89%)

Mutual labels: simd, sse, neon, avx

Libsimdpp

Portable header-only C++ low level SIMD library

Stars: ✭ 914 (-5.19%)

Mutual labels: simd, sse, neon, avx512

Despacer

C library to remove white space from strings as fast as possible

Stars: ✭ 90 (-90.66%)

Mutual labels: simd, sse, avx

HLML

Auto-generated maths library for C and C++ based on HLSL/Cg

Stars: ✭ 23 (-97.61%)

Mutual labels: sse, simd, vectorization

Corrfunc

⚡️⚡️⚡️Blazing fast correlation functions on the CPU.

Stars: ✭ 114 (-88.17%)

Mutual labels: simd, avx512, avx

std find simd

std::find simd version

Stars: ✭ 19 (-98.03%)

Mutual labels: simd, vectorization, avx512

sse-avx-rasterization

Triangle rasterization routines accelerated by SSE and AVX

Stars: ✭ 53 (-94.5%)

Mutual labels: avx, sse, simd

Guided Missile Simulation

Guided Missile, Radar and Infrared EOS Simulation Framework written in Fortran.

Stars: ✭ 33 (-96.58%)

Mutual labels: avx, simd, vectorization

hpc

Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )

Stars: ✭ 39 (-95.95%)

Mutual labels: avx, sse, simd

simd-byte-lookup

SIMDized check which bytes are in a set

Stars: ✭ 23 (-97.61%)

Mutual labels: sse, simd, avx512

penguinV

Simple and fast C++ image processing library with focus on heterogeneous systems

Stars: ✭ 110 (-88.59%)

Mutual labels: avx, sse, simd

ultra-sort

DSL for SIMD Sorting on AVX2 & AVX512

Stars: ✭ 29 (-96.99%)

Mutual labels: simd, vectorization, avx512

SoftLight

A shader-based Software Renderer Using The LightSky Framework.

Stars: ✭ 2 (-99.79%)

Mutual labels: neon, sse, simd

cpuwhat

Nim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics

Stars: ✭ 25 (-97.41%)

Mutual labels: avx, sse, simd

Fastnoisesimd

C++ SIMD Noise Library

Stars: ✭ 542 (-43.78%)

Mutual labels: simd, sse, neon

Sse2neon

A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation

Stars: ✭ 316 (-67.22%)

Mutual labels: simd, sse, neon

Sse4 Strstr

SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification

Stars: ✭ 115 (-88.07%)

Mutual labels: sse, neon, avx512

Highway

Performance-portable, length-agnostic SIMD with runtime dispatch

Stars: ✭ 301 (-68.78%)

Mutual labels: simd, neon, avx512

Hybridizer Basic Samples

Examples of C# code compiled to GPU by hybridizer

Stars: ✭ 186 (-80.71%)

Mutual labels: vectorization, avx512, avx

Hlslpp

Math library using hlsl syntax with SSE/NEON support

Stars: ✭ 153 (-84.13%)

Mutual labels: sse, neon, avx

Kfr

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

Stars: ✭ 985 (+2.18%)

Mutual labels: simd, avx512, avx

Md5 Simd

Accelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.

Stars: ✭ 71 (-92.63%)

Mutual labels: simd, avx512

Ozz Animation

Open source c++ skeletal animation library and toolset

Stars: ✭ 1,250 (+29.67%)

Mutual labels: simd, sse

Packettracer

The SIMD-accelereted ray tracing in C# powered by Intel hardware intrinsic of .NET Core.

Stars: ✭ 109 (-88.69%)

Mutual labels: simd, avx

Impala

An imperative and functional programming language

Stars: ✭ 118 (-87.76%)

Mutual labels: simd, vectorization

Thorin

The Higher-Order Intermediate Representation

Stars: ✭ 116 (-87.97%)

Mutual labels: simd, vectorization

Fastapprox

Approximate and vectorized versions of common mathematical functions

Stars: ✭ 128 (-86.72%)

Mutual labels: simd, vectorization

Klein

P(R*_{3, 0, 1}) specialized SIMD Geometric Algebra Library

Stars: ✭ 463 (-51.97%)

Mutual labels: simd, sse

Base64 Avx512

Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"

Stars: ✭ 158 (-83.61%)

Mutual labels: simd, avx512

Ugm

Ubpa Graphics Mathematics

Stars: ✭ 178 (-81.54%)

Mutual labels: simd, sse

Simdjson

Parsing gigabytes of JSON per second

Stars: ✭ 15,115 (+1467.95%)

Mutual labels: simd, neon

Turbo Run Length Encoding

TurboRLE-Fastest Run Length Encoding

Stars: ✭ 212 (-78.01%)

Mutual labels: simd, sse

Computelibrary

The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.

Stars: ✭ 2,123 (+120.23%)

Mutual labels: simd, neon

Turbo-Transpose

Transpose: SIMD Integer+Floating Point Compression Filter

Stars: ✭ 50 (-94.81%)

Mutual labels: sse, simd

Sha256 Simd

Accelerate SHA256 computations in pure Go using Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performance boost of close to 4x over native.

Stars: ✭ 657 (-31.85%)

Mutual labels: avx512, avx

FFmpegPlayer

Simple FFmpeg video player

Stars: ✭ 72 (-92.53%)

Mutual labels: sse, simd

utf8

Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)

Stars: ✭ 60 (-93.78%)

Mutual labels: neon, simd

runtime

AnyDSL Runtime Library