JampackExperimental parallel compression algorithm
SoliditySHA3MinerAll-in-one mixed multi-GPU (nVidia, AMD, Intel) & CPU miner solves proof of work to mine supported EIP918 tokens in a single instance (with API).
ClothTOPGPU-accelerated Cloth TOP node for TouchDesigner using the NVIDIA Flex physics solver.
Arch-Data-ScienceArchlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision
Fat-CloudsGPU Fluid Simulation with Volumetric Rendering
FLAMEGPU2FLAME GPU 2 is a GPU accelerated agent based modelling framework for C++ and Python
monolishmonolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
pyRenderLightweight Cuda Renderer with Python Wrapper.
JetScanJetScan : GPU accelerated portable RGB-D reconstruction system
bifrostA stream processing framework for high-throughput applications.
nBodyGPU-accelerated N-Body particle simulator with visualizer.
NN-CUDA-ExampleSeveral simple examples for popular neural network toolkits calling custom CUDA operators.
k-meansCode accompanying my blog post on k-means in Python, C++ and CUDA
gpubootcampThis repository consists for gpu bootcamp material for HPC and AI
peakperfAchieve peak performance on x86 CPUs and NVIDIA GPUs
GOSHAn ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.
gproshangeometry processing and shape analysis framework
FGPUNo description or website provided.
allgebraBase container for developing C++ and Fortran HPC applications
NewsMTSCTarget-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k sentences and a state-of-the-art classification model.
ctuning-programsCollective Knowledge extension with unified and customizable benchmarks (with extensible JSON meta information) to be easily integrated with customizable and portable Collective Knowledge workflows. You can easily compile and run these benchmarks using different compilers, environments, hardware and OS (Linux, MacOS, Windows, Android). More info:
HiSpatialClusterClustering spatial points with algorithm of Fast Search, high performace computing implements of CUDA or parallel in CPU, and runnable implements on python standalone or arcgis.
dbcsrDBCSR: Distributed Block Compressed Sparse Row matrix library
euler2d kokkosSimple 2d finite volume solver for Euler equations using c++ kokkos library
nodeGPU-accelerated data science and visualization in node
Social-Distancing-and-Face-Mask-DetectionSocial Distancing and Face Mask Detection using TensorFlow. Install all required Libraries and GPU drivers as well. Refer to README.md or REPORT for know to installation requirement
lane detectionLane detection for the Nvidia Jetson TX2 using OpenCV4Tegra
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
OpenPHParallel reduction of boundary matrices for Persistent Homology with CUDA
PySDMPythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package with box, parcel & 1D/2D prescribed-flow examples in Python, Julia and Matlab
gpufetchSimple yet fancy GPU architecture fetching tool
GOMCGOMC - GPU Optimized Monte Carlo is a parallel molecular simulation code designed for high-performance simulation of large systems
doptA numerical optimisation and deep learning framework for D.
ImplicitGlobalGrid.jlAlmost trivial distributed parallelization of stencil-based GPU and CPU applications on a regular staggered grid
cuda spatial deformA fast tool to do image augmentation on GPU(especially elastic_deform), can be helpful to research on Medical Image.
vs-mlrtEfficient ML Filter Runtimes for VapourSynth (with built-in support for waifu2x, DPIR, RealESRGANv2, and Real-CUGAN)
alienALIEN is a CUDA-powered artificial life simulation program.
mcxMonte Carlo eXtreme (MCX) - GPU-accelerated photon transport simulator
Hetero-MarkA Benchmark Suite for Heterogeneous System Computation
openpose-dockerA docker build file for CMU openpose with Python API support
GVProfGVProf: A Value Profiler for GPU-based Clusters
pytorch-softdtw-cudaFast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch using Numba
gender classifierDeep learning, Face detection, CNN, Tensorflow, Keras, OpenCV, Python crawler
pymdeMinimum-distortion embedding with PyTorch
ecudaSTL-like containers (array, vector, matrix, cube) useable in device code.