MirheoComputational Microfluidics
DroNetDroNet: Efficient convolutional neural network detector for Real-Time UAV applications
dragonA host-based framework that transparently extends the GPU addressable global memory space beyond the host memory using NVM-backed data pointers
penguinVSimple and fast C++ image processing library with focus on heterogeneous systems
adoradFast, Expressive, & High-Performance Programming Language for those who dare
bandicoot-codeBandicoot: GPU accelerator add-on for the Armadillo C++ linear algebra library
indiciumPortable, advanced system information utility
Apriori-and-Eclat-Frequent-Itemset-MiningImplementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mining frequent item sets in a set of transactions, implementation in Python.
ml-testing-acceleratorsTesting framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)
beatmupBeatmup: image and signal processing library
euler2d cudaFortran2nd order Godunov solver for 2d Euler equations written in CUDA Fortran and stdpar (standard paralelism)
gpu-camera-sampleHigh quality Image Processing software on GPU (Windows, Linux, ARM) for real time machine vision camera applications. Performance benchmarks and Glass-to-Glass time measurements. MIPI CSI cameras support. RAW2RGB processing on GPU.
GoldenSunA path tracer based on hardware ray tracing
learn-gpgpuAlgorithms implemented in CUDA + resources about GPGPU
HIBAGR package – HLA Genotype Imputation with Attribute Bagging (development version only)
SIRIUSDomain specific library for electronic structure calculations
mandrakeMandrake 🌿/👨🔬🦆 – Fast visualisation of the population structure of pathogens using Stochastic Cluster Embedding
CARECHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.
SneakySnakeSneakySnake🐍 is the first and the only pre-alignment filtering algorithm that works efficiently and fast on modern CPU, FPGA, and GPU architectures. It greatly (by more than two orders of magnitude) expedites sequence alignment calculation for both short and long reads. Described in the Bioinformatics (2020) by Alser et al. https://arxiv.org/abs…
nvbenchCUDA Kernel Benchmarking Library
xenticaGPU-accelerated engine for multi-dimensional cellular automata
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
TigrTransforming Graphs for Efficient Irregular Graph Processing on GPUs
cytonRLreinforcement learning, deep Q-network, double DQN, dueling DQN, prioritized experience replay
heatDistributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
fluctusAn interactive OpenCL wavefront path tracer
GPU-Fog-ParticlesTextureless fog particles using a highly customizable shader to attenuate noise values.
neovideNo Nonsense Neovim Client in Rust
nuanceA tool to run your shaders on the gpu. Also a good demo application for wgpu-rs.
ventVent is a light-weight platform built to automate network collection and analysis pipelines using a flexible set of popular open source tools and technologies. Vent is python-based, extensible, leverages docker containers, and provides both an API and CLI.
VL.FuseA library for visually programming on the GPU, built to enable rapid workflows and modular approaches to accelerated graphics, logic and computation.
asitopPerf monitoring CLI tool for Apple Silicon
hipercHigh Performance Computing Strategies for Boundary Value Problems
WindowsDisplayAPIWindowsDisplayAPI is a .Net wrapper for Windows Display and Windows CCD APIs
HTGSThe Hybrid Task Graph Scheduler API
RgtsvmThe R package for SVM with GPU architecture based on the GTSVM software
arborThe Arbor multi-compartment neural network simulation library.
drm-rsA low-level abstraction of the Direct Rendering Manager API
CUDAfy.NETCUDAfy .NET allows easy development of high performance GPGPU applications completely from the .NET. It's developed in C#.
QUICKQUICK: A GPU-enabled ab intio quantum chemistry software package
coriander-dnnPartial implementation of NVIDIA® cuDNN API for Coriander, OpenCL 1.2
scikit-learn-intelexIntel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
Theano-MPIMPI Parallel framework for training deep learning models built in Theano
NVOCNo description or website provided.
cucimNo description or website provided.