articThe AlteRnaTive Impala Compiler
Stars: ✭ 16 (-5.88%)
Guided Missile SimulationGuided Missile, Radar and Infrared EOS Simulation Framework written in Fortran.
Stars: ✭ 33 (+94.12%)
CARECHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.
Stars: ✭ 22 (+29.41%)
SimdeImplementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+5852.94%)
ultra-sortDSL for SIMD Sorting on AVX2 & AVX512
Stars: ✭ 29 (+70.59%)
HLMLAuto-generated maths library for C and C++ based on HLSL/Cg
Stars: ✭ 23 (+35.29%)
qHilbertqHilbert is a vectorized speedup of Hilbert curve generation using SIMD intrinsics
Stars: ✭ 22 (+29.41%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (+482.35%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (+1976.47%)
CekirdeklerMulti-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Stars: ✭ 76 (+347.06%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (+829.41%)
GpufitGPU-accelerated Levenberg-Marquardt curve fitting in CUDA
Stars: ✭ 174 (+923.53%)
PysnnEfficient Spiking Neural Network framework, built on top of PyTorch for GPU acceleration
Stars: ✭ 129 (+658.82%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (+235.29%)
gpuvmemGPU Framework for Radio Astronomical Image Synthesis
Stars: ✭ 27 (+58.82%)
gpuhdMassively Parallel Huffman Decoding on GPUs
Stars: ✭ 30 (+76.47%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (+235.29%)
ObsidianObsidian Language Repository
Stars: ✭ 38 (+123.53%)
VuhVulkan compute for people
Stars: ✭ 264 (+1452.94%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+1911.76%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+3023.53%)
EmuThe write-once-run-anywhere GPGPU library for Rust
Stars: ✭ 1,350 (+7841.18%)
VcSIMD Vector Classes for C++
Stars: ✭ 985 (+5694.12%)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (+5570.59%)
UmesimdUME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (+288.24%)
Montecarlomeasurements.jlPropagation of distributions by Monte-Carlo sampling: Real number types with uncertainty represented by samples.
Stars: ✭ 168 (+888.24%)
ImpalaAn imperative and functional programming language
Stars: ✭ 118 (+594.12%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+5352.94%)
ThorinThe Higher-Order Intermediate Representation
Stars: ✭ 116 (+582.35%)
FastapproxApproximate and vectorized versions of common mathematical functions
Stars: ✭ 128 (+652.94%)
processorA compiler, assembler, and processor.
Stars: ✭ 24 (+41.18%)
sliceslice-rsA fast implementation of single-pattern substring search using SIMD acceleration.
Stars: ✭ 66 (+288.24%)
KuiBaDBAnother OLAP database
Stars: ✭ 297 (+1647.06%)
gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (+29.41%)
gpuowlGPU Mersenne primality test.
Stars: ✭ 77 (+352.94%)
PetIBMPetIBM - toolbox and applications of the immersed-boundary method on distributed-memory architectures
Stars: ✭ 80 (+370.59%)
brian2cudaA brian2 extension to simulate spiking neural networks on GPUs
Stars: ✭ 46 (+170.59%)
beatmupBeatmup: image and signal processing library
Stars: ✭ 168 (+888.24%)
memchrOptimized string search routines for Rust.
Stars: ✭ 474 (+2688.24%)
uwufastest text uwuifier in the west
Stars: ✭ 1,193 (+6917.65%)
cef-mixerHigh Performance off-screen rendering (OSR) demo using CEF
Stars: ✭ 183 (+976.47%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (+129.41%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (+194.12%)
multiversionEasy function multiversioning for Rust
Stars: ✭ 152 (+794.12%)
euler2d cudaFortran2nd order Godunov solver for 2d Euler equations written in CUDA Fortran and stdpar (standard paralelism)
Stars: ✭ 24 (+41.18%)
heyoka.pyPython library for ODE integration via Taylor's method and LLVM
Stars: ✭ 45 (+164.71%)
aes-gcm-siv.NET Core 3.0 implementation of AES-GCM-SIV nonce misuse-resistant authenticated encryption
Stars: ✭ 22 (+29.41%)
notebooksA docker-based starter kit for machine learning via jupyter notebooks. Designed for those who just want a runtime environment and get on with machine learning. Docker tags:
Stars: ✭ 29 (+70.59%)
LvArrayPortable HPC Containers (C++)
Stars: ✭ 37 (+117.65%)
virtual sketchingGeneral Virtual Sketching Framework for Vector Line Art (SIGGRAPH 2021)
Stars: ✭ 111 (+552.94%)
penguinVSimple and fast C++ image processing library with focus on heterogeneous systems
Stars: ✭ 110 (+547.06%)
SIMDArraySIMD enhanced Array operations
Stars: ✭ 123 (+623.53%)
fasterRasterFaster raster processing using GRASS GIS
Stars: ✭ 18 (+5.88%)
opensbliA framework for the automated derivation and parallel execution of finite difference solvers on a range of computer architectures.
Stars: ✭ 56 (+229.41%)