hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (-64.55%)
SimdC++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (+1048.18%)
DirectxmathDirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Stars: ✭ 859 (+680.91%)
Cglm📽 Highly Optimized Graphics Math (glm) for C
Stars: ✭ 887 (+706.36%)
DespacerC library to remove white space from strings as fast as possible
Stars: ✭ 90 (-18.18%)
oversimpleA library for audio oversampling, which tries to offer a simple api while wrapping HIIR, by Laurent De Soras, for minimum phase antialiasing, and r8brain-free-src, by Aleksey Vaneev, for linear phase antialiasing.
Stars: ✭ 25 (-77.27%)
Quadray EngineRealtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-88.18%)
SimdeImplementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+820%)
Std Simdstd::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Stars: ✭ 275 (+150%)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (+776.36%)
MippMIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX and AVX-512.
Stars: ✭ 253 (+130%)
cpuwhatNim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics
Stars: ✭ 25 (-77.27%)
VcSIMD Vector Classes for C++
Stars: ✭ 985 (+795.45%)
LibxsmmLibrary for specialized dense and sparse matrix operations, and deep learning primitives.
Stars: ✭ 518 (+370.91%)
Unisimd AssemblerSIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (-42.73%)
ternary-logicSupport for ternary logic in SSE, XOP, AVX2 and x86 programs
Stars: ✭ 21 (-80.91%)
dcurlHardware-accelerated Multi-threaded IOTA PoW, drop-in replacement for ccurl
Stars: ✭ 39 (-64.55%)
RaySmall pathtracing library with GPU and CPU backends
Stars: ✭ 95 (-13.64%)
SoftLightA shader-based Software Renderer Using The LightSky Framework.
Stars: ✭ 2 (-98.18%)
FFmpegPlayerSimple FFmpeg video player
Stars: ✭ 72 (-34.55%)
Sse2neonA translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
Stars: ✭ 316 (+187.27%)
UgmUbpa Graphics Mathematics
Stars: ✭ 178 (+61.82%)
Ozz AnimationOpen source c++ skeletal animation library and toolset
Stars: ✭ 1,250 (+1036.36%)
LibsimdppPortable header-only C++ low level SIMD library
Stars: ✭ 914 (+730.91%)
BitmagicBitMagic Library
Stars: ✭ 263 (+139.09%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (-54.55%)
hlmlvectorized high-level math library
Stars: ✭ 42 (-61.82%)
KleinP(R*_{3, 0, 1}) specialized SIMD Geometric Algebra Library
Stars: ✭ 463 (+320.91%)
HLMLAuto-generated maths library for C and C++ based on HLSL/Cg
Stars: ✭ 23 (-79.09%)
Base64simdBase64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Stars: ✭ 115 (+4.55%)
KfrFast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Stars: ✭ 985 (+795.45%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (-40%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (+3.64%)
PackettracerThe SIMD-accelereted ray tracing in C# powered by Intel hardware intrinsic of .NET Core.
Stars: ✭ 109 (-0.91%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (+25.45%)
JohnJohn the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
Stars: ✭ 5,656 (+5041.82%)
Amplifier.netAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 92 (-16.36%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (+38.18%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (+220.91%)
HlslppMath library using hlsl syntax with SSE/NEON support
Stars: ✭ 153 (+39.09%)
ComputelibraryThe Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
Stars: ✭ 2,123 (+1830%)
Amplifier.NETAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 142 (+29.09%)
opencl-in-action-swiftGenerating OpenCL code using Swift and Grand Central Dispatch's OpenCL integration with Xcode. A direct reimplementation of the source code from the book 'OpenCL in Action' by Matthew Scarpino
Stars: ✭ 15 (-86.36%)
ck-clsmithCollective Knowledge extension to crowdsource bug detection in OpenCL compilers using CLSmith tool from Imperial College London
Stars: ✭ 26 (-76.36%)
hassesHyper's Asynchronous Server Sent event (SSE) notification Server
Stars: ✭ 18 (-83.64%)
CUDAfy.NETCUDAfy .NET allows easy development of high performance GPGPU applications completely from the .NET. It's developed in C#.
Stars: ✭ 56 (-49.09%)
QuickenshteinMaking the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support
Stars: ✭ 204 (+85.45%)
ParallelQSlimShape Aware Parallel Mesh Simplification Algorithm
Stars: ✭ 84 (-23.64%)
coriander-dnnPartial implementation of NVIDIA® cuDNN API for Coriander, OpenCL 1.2
Stars: ✭ 22 (-80%)
BruteForceA simple brute forcer written in GO for SHA1, SHA256, SHA512, MD5 and bcrypt
Stars: ✭ 49 (-55.45%)
processorA compiler, assembler, and processor.
Stars: ✭ 24 (-78.18%)
sliceslice-rsA fast implementation of single-pattern substring search using SIMD acceleration.
Stars: ✭ 66 (-40%)