All Projects → Nsimd → Similar Projects or Alternatives

892 Open source projects that are alternatives of or similar to Nsimd

Boost.simd
Boost SIMD
Stars: ✭ 238 (+72.46%)
Mutual labels:  simd, aarch64, neon, avx2, avx512, avx
Sleef
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (+155.8%)
Mutual labels:  simd, aarch64, neon, cuda, avx512, avx
Unisimd Assembler
SIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (-54.35%)
Mutual labels:  simd, aarch64, neon, avx2, avx512, avx
Simde
Implementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+633.33%)
Mutual labels:  simd, neon, avx2, avx512, avx
Quadray Engine
Realtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-90.58%)
Mutual labels:  simd, neon, avx2, avx512, avx
Vc
SIMD Vector Classes for C++
Stars: ✭ 985 (+613.77%)
Mutual labels:  simd, neon, avx2, avx512, avx
Umesimd
UME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (-52.17%)
Mutual labels:  simd, neon, avx2, avx512, avx
Simd
C++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (+815.22%)
Mutual labels:  simd, neon, avx2, avx512, avx
Simdjson
Parsing gigabytes of JSON per second
Stars: ✭ 15,115 (+10852.9%)
Mutual labels:  simd, aarch64, neon, avx2
Directxmath
DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Stars: ✭ 859 (+522.46%)
Mutual labels:  simd, neon, avx2, avx
Corrfunc
⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (-17.39%)
Mutual labels:  simd, avx2, avx512, avx
Libsimdpp
Portable header-only C++ low level SIMD library
Stars: ✭ 914 (+562.32%)
Mutual labels:  simd, neon, avx2, avx512
Hybridizer Basic Samples
Examples of C# code compiled to GPU by hybridizer
Stars: ✭ 186 (+34.78%)
Mutual labels:  cuda, avx2, avx512, avx
Base64simd
Base64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (-16.67%)
Mutual labels:  simd, neon, avx2, avx512
ternary-logic
Support for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-84.78%)
Mutual labels:  avx, simd, avx2, avx512
simdutf8
SIMD-accelerated UTF-8 validation for Rust.
Stars: ✭ 426 (+208.7%)
Mutual labels:  neon, simd, avx2, aarch64
Libxsmm
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Stars: ✭ 518 (+275.36%)
Mutual labels:  simd, avx2, avx512, avx
Xsimd
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (+598.55%)
Mutual labels:  simd, neon, avx512, avx
Highway
Performance-portable, length-agnostic SIMD with runtime dispatch
Stars: ✭ 301 (+118.12%)
Mutual labels:  simd, neon, avx2, avx512
Std Simd
std::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Stars: ✭ 275 (+99.28%)
Mutual labels:  simd, neon, avx512, avx
Osaca
Open Source Architecture Code Analyzer
Stars: ✭ 162 (+17.39%)
Mutual labels:  hpc, avx2, avx512, avx
Mipp
MIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.
Stars: ✭ 253 (+83.33%)
Mutual labels:  simd, neon, avx
Wheels
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+545.65%)
Mutual labels:  cuda, avx2, avx
positional-popcount
Fast C functions for the computing the positional popcount (pospopcnt).
Stars: ✭ 47 (-65.94%)
Mutual labels:  simd, avx2, avx512
Sse4 Strstr
SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Stars: ✭ 115 (-16.67%)
Mutual labels:  neon, avx2, avx512
Ctranslate2
Fast inference engine for OpenNMT models
Stars: ✭ 140 (+1.45%)
Mutual labels:  cuda, avx2, avx
simdutf
Unicode routines (UTF8, UTF16): billions of characters per second.
Stars: ✭ 108 (-21.74%)
Mutual labels:  neon, simd, avx2
Sse Popcount
SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Stars: ✭ 226 (+63.77%)
Mutual labels:  aarch64, avx2, avx512
std find simd
std::find simd version
Stars: ✭ 19 (-86.23%)
Mutual labels:  simd, avx2, avx512
Libpopcnt
🚀 Fast C/C++ bit population count library
Stars: ✭ 219 (+58.7%)
Mutual labels:  neon, avx2, avx512
utf8
Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)
Stars: ✭ 60 (-56.52%)
Mutual labels:  neon, simd, avx2
Sse2neon
A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
Stars: ✭ 316 (+128.99%)
Mutual labels:  simd, aarch64, neon
Fastnoisesimd
C++ SIMD Noise Library
Stars: ✭ 542 (+292.75%)
Mutual labels:  simd, neon, avx2
Guided Missile Simulation
Guided Missile, Radar and Infrared EOS Simulation Framework written in Fortran.
Stars: ✭ 33 (-76.09%)
Mutual labels:  avx, simd, avx2
Cglm
📽 Highly Optimized Graphics Math (glm) for C
Stars: ✭ 887 (+542.75%)
Mutual labels:  simd, neon, avx
simd-byte-lookup
SIMDized check which bytes are in a set
Stars: ✭ 23 (-83.33%)
Mutual labels:  simd, avx2, avx512
Kfr
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Stars: ✭ 985 (+613.77%)
Mutual labels:  simd, avx512, avx
ultra-sort
DSL for SIMD Sorting on AVX2 & AVX512
Stars: ✭ 29 (-78.99%)
Mutual labels:  simd, avx2, avx512
Md5 Simd
Accelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Stars: ✭ 71 (-48.55%)
Mutual labels:  simd, avx2, avx512
Onednn
oneAPI Deep Neural Network Library (oneDNN)
Stars: ✭ 2,600 (+1784.06%)
Mutual labels:  aarch64, avx2, avx512
Computelibrary
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
Stars: ✭ 2,123 (+1438.41%)
Mutual labels:  simd, aarch64, neon
hpc
Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (-71.74%)
Mutual labels:  hpc, avx, simd
oversimple
A library for audio oversampling, which tries to offer a simple api while wrapping HIIR, by Laurent De Soras, for minimum phase antialiasing, and r8brain-free-src, by Aleksey Vaneev, for linear phase antialiasing.
Stars: ✭ 25 (-81.88%)
Mutual labels:  neon, avx, simd
cpuwhat
Nim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics
Stars: ✭ 25 (-81.88%)
Mutual labels:  avx, simd, avx2
peakperf
Achieve peak performance on x86 CPUs and NVIDIA GPUs
Stars: ✭ 33 (-76.09%)
Mutual labels:  cuda, avx
allgebra
Base container for developing C++ and Fortran HPC applications
Stars: ✭ 14 (-89.86%)
Mutual labels:  hpc, cuda
Turbo-Histogram
Fastest Histogram Construction
Stars: ✭ 44 (-68.12%)
Mutual labels:  simd, avx2
monolish
monolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Stars: ✭ 166 (+20.29%)
Mutual labels:  hpc, cuda
HiSpatialCluster
Clustering spatial points with algorithm of Fast Search, high performace computing implements of CUDA or parallel in CPU, and runnable implements on python standalone or arcgis.
Stars: ✭ 31 (-77.54%)
Mutual labels:  hpc, cuda
gpubootcamp
This repository consists for gpu bootcamp material for HPC and AI
Stars: ✭ 227 (+64.49%)
Mutual labels:  hpc, cuda
Tensorflow Optimized Wheels
TensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (-14.49%)
Mutual labels:  cuda, avx2
block-aligner
SIMD-accelerated library for computing global and X-drop affine gap penalty sequence-to-sequence or sequence-to-profile alignments using an adaptive block-based algorithm.
Stars: ✭ 58 (-57.97%)
Mutual labels:  simd, avx2
dbcsr
DBCSR: Distributed Block Compressed Sparse Row matrix library
Stars: ✭ 65 (-52.9%)
Mutual labels:  hpc, cuda
cuda memtest
Fork of CUDA GPU memtest 👓
Stars: ✭ 68 (-50.72%)
Mutual labels:  hpc, cuda
MatX
An efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+202.9%)
Mutual labels:  hpc, cuda
mbsolve
An open-source solver tool for the Maxwell-Bloch equations.
Stars: ✭ 14 (-89.86%)
Mutual labels:  hpc, cuda
Bitmagic
BitMagic Library
Stars: ✭ 263 (+90.58%)
Mutual labels:  simd, avx
Fastor
A lightweight high performance tensor algebra framework for modern C++
Stars: ✭ 280 (+102.9%)
Mutual labels:  simd, hpc
Fastbase64
SIMD-accelerated base64 codecs
Stars: ✭ 309 (+123.91%)
Mutual labels:  simd, avx2
Arrayfire
ArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+2576.09%)
Mutual labels:  cuda, hpc
1-60 of 892 similar projects