Set of Python bindings to C++ libraries which provides full HW acceleration for video decoding, encoding and GPU-accelerated color space and pixel format conversions

✭ 519

68. Runx

Deep Learning Experiment Management

✭ 519

python

69. Gvdb Voxels

Sparse volume compute and rendering on NVIDIA GPUs

✭ 467

70. Digits

Deep Learning GPU Training System

✭ 4,056

HTML python lua javascript shell CSS deep-learning machine-learning gpu caffe torch

71. Dataset synthesizer

NVIDIA Deep learning Dataset Synthesizer (NDDS)

✭ 417

deep-learning computer-vision object-detection pose-estimation

72. Gpu Rest Engine

A REST API for Caffe using Docker and Go

✭ 412

deep-learning docker gpu caffe inference

73. Gdrcopy

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

✭ 407

c linux nvidia libraries

74. Jetson Gpio

A Python library that enables the use of Jetson's GPIOs

✭ 398

python

75. Nvpipe

NVIDIA-accelerated zero latency video compression library for interactive remoting applications

✭ 376

cuda

76. Aistore

AIStore: scalable storage for AI applications

✭ 367

go deep-learning high-performance etl object-storage

77. Hugectr

HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

✭ 364

78. Libglvnd

The GL Vendor-Neutral Dispatch library

✭ 364

79. Tensorrt

TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.

✭ 4,644

C++python Jupyter Notebook Cuda CMake Dockerfile deep-learning nvidia tensorrt

80. Jitify

A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).

✭ 314

cpp cuda

81. Thrust

The C++ parallel algorithms library.

✭ 3,595

C++Cuda c CMake python shell algorithms gpu cuda nvidia thrust cpp20 cxx11 cxx14 cxx17 cxx20 nvidia-hpc-sdk

82. Pyprof

A GPU performance profiling tool for PyTorch models

✭ 313

python

83. Nvtabular

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

✭ 305

python

84. Libnvidia Container

NVIDIA container runtime library

✭ 307

85. Dali

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

86. Gpu Operator

NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes

✭ 295

87. Mdl Sdk

NVIDIA Material Definition Language SDK

✭ 284

88. Spark Rapids

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

✭ 265

scala

89. Hpc Container Maker

HPC Container Maker

✭ 260

python docker containers hpc

90. cudnn-frontend

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

✭ 86

C++CMake

91. Torch-TensorRT

PyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT

✭ 1,216

Jupyter Notebook C++python Starlark c Dockerfile machine-learning deep-learning cuda pytorch nvidia jetson tensorrt libtorch

92. GPUStressTest

GPU Stress Test is a tool to stress the compute engine of NVIDIA Tesla GPU’s by running a BLAS matrix multiply using different data types. It can be compiled and run on both Linux and Windows.

✭ 16

C++Cuda c CMake shell

93. vdisc

VDisc is a tool for creating and mounting virtual CD-ROM images backed by object storage

✭ 21

go python shell Cap'n Proto

94. MatX

An efficient C++17 GPU numerical computing library with Python-like syntax

✭ 418

C++Cuda Jupyter Notebook CMake python shell hpc gpu cuda gpgpu gpu-computing

95. ngc-container-replicator

NGC Container Replicator