51. CubCooperative primitives for CUDA C++.
54. Q2rtxNVIDIA’s implementation of RTX ray-tracing in Quake II
56. Nv WavenetReference implementation of real-time autoregressive wavenet inference
59. NvvlA library that uses hardware acceleration to load sequences of video frames to facilitate machine learning training
61. Pix2pixhdSynthesizing and manipulating 2048x1024 images with conditional GANs
62. ApexA PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
63. TrtorchPyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
64. MellotronMellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
65. FlowtronFlowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
67. VideoprocessingframeworkSet of Python bindings to C++ libraries which provides full HW acceleration for video decoding, encoding and GPU-accelerated color space and pixel format conversions
68. RunxDeep Learning Experiment Management
73. GdrcopyA fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
75. NvpipeNVIDIA-accelerated zero latency video compression library for interactive remoting applications
76. AistoreAIStore: scalable storage for AI applications
77. HugectrHugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
79. TensorrtTensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.
80. JitifyA single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).
82. PyprofA GPU performance profiling tool for PyTorch models
83. NvtabularNVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
85. DaliA GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
86. Gpu OperatorNVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes
90. cudnn-frontendcudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
92. GPUStressTestGPU Stress Test is a tool to stress the compute engine of NVIDIA Tesla GPU’s by running a BLAS matrix multiply using different data types. It can be compiled and run on both Linux and Windows.
93. vdiscVDisc is a tool for creating and mounting virtual CD-ROM images backed by object storage
94. MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
99. NVFlareNVIDIA Federated Learning Application Runtime Environment