WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+190.23%)
CubCooperative primitives for CUDA C++.
Stars: ✭ 883 (+187.62%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+161.56%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-86.64%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-88.27%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+262.21%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+157.33%)
Cuda Design PatternsSome CUDA design patterns and a bit of template magic for CUDA
Stars: ✭ 78 (-74.59%)
Cudart.jlJulia wrapper for CUDA runtime API
Stars: ✭ 75 (-75.57%)
ThundersvmThunderSVM: A Fast SVM Library on GPUs and CPUs
Stars: ✭ 1,282 (+317.59%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-76.87%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-67.75%)
PynvvlA Python wrapper of NVIDIA Video Loader (NVVL) with CuPy for fast video loading with Python
Stars: ✭ 95 (-69.06%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-63.19%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (+153.09%)
RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+521.5%)
Hoomd BlueMolecular dynamics and Monte Carlo soft matter simulation on GPUs.
Stars: ✭ 143 (-53.42%)
Cumf alsCUDA Matrix Factorization Library with Alternating Least Square (ALS)
Stars: ✭ 154 (-49.84%)
ForwardA library for high performance deep learning inference on NVIDIA GPUs.
Stars: ✭ 136 (-55.7%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-46.58%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (-47.56%)
CreepminerBurstcoin C++ CPU and GPU Miner
Stars: ✭ 169 (-44.95%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+506.19%)
CumlcuML - RAPIDS Machine Learning Library
Stars: ✭ 2,504 (+715.64%)
Macos Egpu Cuda GuideSet up CUDA for machine learning (and gaming) on macOS using a NVIDIA eGPU
Stars: ✭ 187 (-39.09%)
Ssd Gpu DmaBuild userspace NVMe drivers and storage applications with CUDA support
Stars: ✭ 172 (-43.97%)
GenomeworksSDK for GPU accelerated genome assembly and analysis
Stars: ✭ 215 (-29.97%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (-31.92%)
GunrockHigh-Performance Graph Primitives on GPUs
Stars: ✭ 718 (+133.88%)
tiny-cuda-nnLightning fast & tiny C++/CUDA neural network framework
Stars: ✭ 908 (+195.77%)
PclPoint Cloud Library (PCL)
Stars: ✭ 6,897 (+2146.58%)
Unsupervisedrr[CVPR 2021 - Oral] UnsupervisedR&R: Unsupervised Point Cloud Registration via Differentiable Rendering
Stars: ✭ 43 (-85.99%)
Depth clustering🚕 Fast and robust clustering of point clouds generated with a Velodyne sensor.
Stars: ✭ 657 (+114.01%)
Lidar camera calibrationLight-weight camera LiDAR calibration package for ROS using OpenCV and PCL (PnP + LM optimization)
Stars: ✭ 133 (-56.68%)
QPT[内测中]前向式Python环境快捷封装工具,快速将Python打包为EXE并添加CUDA、NoAVX等支持。
Stars: ✭ 308 (+0.33%)
CilantroA lean C++ library for working with point cloud data
Stars: ✭ 577 (+87.95%)
TaskflowA General-purpose Parallel and Heterogeneous Task Programming System
Stars: ✭ 6,128 (+1896.09%)
PbfVsImplementation of Macklin, Miles, and Matthias Müller. "Position based fluids.". Visual Studio 2015 + CUDA 8.0
Stars: ✭ 100 (-67.43%)
Fat-CloudsGPU Fluid Simulation with Volumetric Rendering
Stars: ✭ 81 (-73.62%)
Deep DiamondA fast Clojure Tensor & Deep Learning library
Stars: ✭ 288 (-6.19%)
Kinectfusionlib Implementation of the KinectFusion approach in modern C++14 and CUDA
Stars: ✭ 261 (-14.98%)
VqengineDirectX 11 Renderer written in C++11
Stars: ✭ 250 (-18.57%)
PlotoptixData visualisation in Python based on OptiX 7.2 ray tracing framework.
Stars: ✭ 252 (-17.92%)
euler2d kokkosSimple 2d finite volume solver for Euler equations using c++ kokkos library
Stars: ✭ 27 (-91.21%)
OverlapPredator[CVPR 2021, Oral] PREDATOR: Registration of 3D Point Clouds with Low Overlap.
Stars: ✭ 293 (-4.56%)
FGPUNo description or website provided.
Stars: ✭ 30 (-90.23%)
briefmatchBriefMatch real-time GPU optical flow
Stars: ✭ 36 (-88.27%)
gpubootcampThis repository consists for gpu bootcamp material for HPC and AI
Stars: ✭ 227 (-26.06%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+36.16%)
DeepI2PDeepI2P: Image-to-Point Cloud Registration via Deep Classification. CVPR 2021
Stars: ✭ 130 (-57.65%)
PycpdPure Numpy Implementation of the Coherent Point Drift Algorithm
Stars: ✭ 255 (-16.94%)
SpeedtorchLibrary for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (+100.33%)
ChainerA flexible framework of neural networks for deep learning
Stars: ✭ 5,656 (+1742.35%)
SenetSqueeze-and-Excitation Networks
Stars: ✭ 2,850 (+828.34%)
pcl-edge-detectionEdge-detection application with PointCloud Library
Stars: ✭ 32 (-89.58%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-77.85%)
warpcontinuous energy monte carlo neutron transport in general geometries on GPUs
Stars: ✭ 27 (-91.21%)