Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+1670%)
monolishmonolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Stars: ✭ 166 (+453.33%)
gpubootcampThis repository consists for gpu bootcamp material for HPC and AI
Stars: ✭ 227 (+656.67%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (+666.67%)
allgebraBase container for developing C++ and Fortran HPC applications
Stars: ✭ 14 (-53.33%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+3606.67%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (+136.67%)
Cuda Design PatternsSome CUDA design patterns and a bit of template magic for CUDA
Stars: ✭ 78 (+160%)
PynvvlA Python wrapper of NVIDIA Video Loader (NVVL) with CuPy for fast video loading with Python
Stars: ✭ 95 (+216.67%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (+230%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (+306.67%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+6103.33%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (+73.33%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (+90%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (+183.33%)
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
Stars: ✭ 1,120 (+3633.33%)
PlotoptixData visualisation in Python based on OptiX 7.2 ray tracing framework.
Stars: ✭ 252 (+740%)
Deeppipe2Deep Learning library using GPU(CUDA/cuBLAS)
Stars: ✭ 90 (+200%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (+276.67%)
MprReference implementation for "Massively Parallel Rendering of Complex Closed-Form Implicit Surfaces" (SIGGRAPH 2020)
Stars: ✭ 84 (+180%)
Cumf alsCUDA Matrix Factorization Library with Alternating Least Square (ALS)
Stars: ✭ 154 (+413.33%)
Optix PathtracerSimple physically based path tracer based on Nvidia's Optix Ray Tracing Engine
Stars: ✭ 231 (+670%)
CupochRobotics with GPU computing
Stars: ✭ 225 (+650%)
Xmrminer🐜 A CUDA based miner for Monero
Stars: ✭ 158 (+426.67%)
JcudaJCuda - Java bindings for CUDA
Stars: ✭ 165 (+450%)
Nvidia Modded InfModified nVidia .inf files to run drivers on all video cards, research & telemetry free drivers
Stars: ✭ 227 (+656.67%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (+36.67%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (+20%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (+3.33%)
ArboretumGradient Boosting powered by GPU(NVIDIA CUDA)
Stars: ✭ 64 (+113.33%)
GgnnGGNN: State of the Art Graph-based GPU Nearest Neighbor Search
Stars: ✭ 63 (+110%)
Cudart.jlJulia wrapper for CUDA runtime API
Stars: ✭ 75 (+150%)
CubCooperative primitives for CUDA C++.
Stars: ✭ 883 (+2843.33%)
ThundersvmThunderSVM: A Fast SVM Library on GPUs and CPUs
Stars: ✭ 1,282 (+4173.33%)
NumerNumeric Erlang - vector and matrix operations with CUDA. Heavily inspired by Pteracuda - https://github.com/kevsmith/pteracuda
Stars: ✭ 91 (+203.33%)
Training MaterialA collection of code examples as well as presentations for training purposes
Stars: ✭ 85 (+183.33%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+5370%)
PygraphistryPyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Stars: ✭ 1,365 (+4450%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (+333.33%)
GraphviteGraphVite: A General and High-performance Graph Embedding System
Stars: ✭ 865 (+2783.33%)
Optical Flow FilterA real time optical flow algorithm implemented on GPU
Stars: ✭ 146 (+386.67%)
euler2d kokkosSimple 2d finite volume solver for Euler equations using c++ kokkos library
Stars: ✭ 27 (-10%)
RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+6260%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (+446.67%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (+436.67%)
QudaQUDA is a library for performing calculations in lattice QCD on GPUs.
Stars: ✭ 166 (+453.33%)
Hoomd BlueMolecular dynamics and Monte Carlo soft matter simulation on GPUs.
Stars: ✭ 143 (+376.67%)
Ssd Gpu DmaBuild userspace NVMe drivers and storage applications with CUDA support
Stars: ✭ 172 (+473.33%)
CumlcuML - RAPIDS Machine Learning Library
Stars: ✭ 2,504 (+8246.67%)
GenomeworksSDK for GPU accelerated genome assembly and analysis
Stars: ✭ 215 (+616.67%)
Nvidia DockerBuild and run Docker containers leveraging NVIDIA GPUs
Stars: ✭ 13,961 (+46436.67%)
Gmonitorgmonitor is a GPU monitor (Nvidia only at the moment)
Stars: ✭ 169 (+463.33%)
Macos Egpu Cuda GuideSet up CUDA for machine learning (and gaming) on macOS using a NVIDIA eGPU
Stars: ✭ 187 (+523.33%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+2870%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+2990%)
ForwardA library for high performance deep learning inference on NVIDIA GPUs.
Stars: ✭ 136 (+353.33%)