GinkgoNumerical linear algebra software package
Stars: ✭ 149 (-27.32%)
Libgdf[ARCHIVED] C GPU DataFrame Library
Stars: ✭ 142 (-30.73%)
DsmnetDomain-invariant Stereo Matching Networks
Stars: ✭ 153 (-25.37%)
Deformable KernelsDeforming kernels to adapt towards object deformation. In ICLR 2020.
Stars: ✭ 166 (-19.02%)
Marian DevFast Neural Machine Translation in C++ - development repository
Stars: ✭ 136 (-33.66%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-20%)
RmmRAPIDS Memory Manager
Stars: ✭ 154 (-24.88%)
AgencyExecution primitives for C++
Stars: ✭ 127 (-38.05%)
Cuda programmingCode from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch
Stars: ✭ 169 (-17.56%)
JetsonHelmut Hoffer von Ankershoffen experimenting with arm64 based NVIDIA Jetson (Nano and AGX Xavier) edge devices running Kubernetes (K8s) for machine learning (ML) including Jupyter Notebooks, TensorFlow Training and TensorFlow Serving using CUDA for smart IoT.
Stars: ✭ 151 (-26.34%)
Macos Egpu Cuda GuideSet up CUDA for machine learning (and gaming) on macOS using a NVIDIA eGPU
Stars: ✭ 187 (-8.78%)
SketchgraphsA dataset of 15 million CAD sketches with geometric constraint graphs.
Stars: ✭ 148 (-27.8%)
QudaQUDA is a library for performing calculations in lattice QCD on GPUs.
Stars: ✭ 166 (-19.02%)
RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+830.73%)
ViseronSelf-hosted NVR with object detection
Stars: ✭ 192 (-6.34%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (-31.71%)
Partial Order PruningPartial Order Pruning: for Best Speed/Accuracy Trade-off in Neural Architecture Search
Stars: ✭ 135 (-34.15%)
Ssd Gpu DmaBuild userspace NVMe drivers and storage applications with CUDA support
Stars: ✭ 172 (-16.1%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+807.8%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (-21.46%)
Cumf alsCUDA Matrix Factorization Library with Alternating Least Square (ALS)
Stars: ✭ 154 (-24.88%)
Pytorch Spynet a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch
Stars: ✭ 190 (-7.32%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (-25.85%)
DragonDragon: A Computation Graph Virtual Machine Based Deep Learning Framework.
Stars: ✭ 168 (-18.05%)
Cuda CnnCNN accelerated by cuda. Test on mnist and finilly get 99.76%
Stars: ✭ 148 (-27.8%)
FloorA C++ Compute/Graphics Library and Toolchain enabling same-source CUDA/Host/Metal/OpenCL/Vulkan C++ programming and execution.
Stars: ✭ 166 (-19.02%)
Optical Flow FilterA real time optical flow algorithm implemented on GPU
Stars: ✭ 146 (-28.78%)
Nvidia DockerBuild and run Docker containers leveraging NVIDIA GPUs
Stars: ✭ 13,961 (+6710.24%)
GpurirPython library for Room Impulse Response (RIR) simulation with GPU acceleration
Stars: ✭ 145 (-29.27%)
SporcoSparse Optimisation Research Code
Stars: ✭ 164 (-20%)
Hoomd BlueMolecular dynamics and Monte Carlo soft matter simulation on GPUs.
Stars: ✭ 143 (-30.24%)
Pine🌲 Aimbot powered by real-time object detection with neural networks, GPU accelerated with Nvidia. Optimized for use with CS:GO.
Stars: ✭ 202 (-1.46%)
ForwardA library for high performance deep learning inference on NVIDIA GPUs.
Stars: ✭ 136 (-33.66%)
JcudaJCuda - Java bindings for CUDA
Stars: ✭ 165 (-19.51%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-32.68%)
CumlcuML - RAPIDS Machine Learning Library
Stars: ✭ 2,504 (+1121.46%)
SpanetSpatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)
Stars: ✭ 136 (-33.66%)
Multi Gpu Programming ModelsExamples demonstrating available options to program multiple GPUs in a single node or a cluster
Stars: ✭ 165 (-19.51%)
TimemoryModular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
Stars: ✭ 192 (-6.34%)
NnvmNo description or website provided.
Stars: ✭ 1,639 (+699.51%)
Cx db8a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)
Stars: ✭ 164 (-20%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-36.59%)
Gmonitorgmonitor is a GPU monitor (Nvidia only at the moment)
Stars: ✭ 169 (-17.56%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (-22.93%)
OneflowOneFlow is a performance-centered and open-source deep learning framework.
Stars: ✭ 2,868 (+1299.02%)
SimplegpuhashtableA simple GPU hash table implemented in CUDA using lock free techniques
Stars: ✭ 198 (-3.41%)
Ck CaffeCollective Knowledge workflow for Caffe to automate installation across diverse platforms and to collaboratively evaluate and optimize Caffe-based workloads across diverse hardware, software and data sets (compilers, libraries, tools, models, inputs):
Stars: ✭ 192 (-6.34%)
CreepminerBurstcoin C++ CPU and GPU Miner
Stars: ✭ 169 (-17.56%)
Xmrminer🐜 A CUDA based miner for Monero
Stars: ✭ 158 (-22.93%)