@ICCV2017: For exploiting second-order statistics, we propose Matrix Power Normalized Covariance pooling (MPN-COV) ConvNets, different from and outperforming those using global average pooling.

Stars: ✭ 63 (-58.82%)

Mutual labels: cuda

Libcudacxx

The C++ Standard Library for your entire system.

Stars: ✭ 1,861 (+1116.34%)

Mutual labels: cuda

Ggnn

GGNN: State of the Art Graph-based GPU Nearest Neighbor Search

Stars: ✭ 63 (-58.82%)

Mutual labels: cuda

Dace

DaCe - Data Centric Parallel Programming

Stars: ✭ 106 (-30.72%)

Mutual labels: cuda

Gdax Orderbook Ml

Application of machine learning to the Coinbase (GDAX) orderbook

Stars: ✭ 60 (-60.78%)

Mutual labels: cuda

Jetson

Helmut Hoffer von Ankershoffen experimenting with arm64 based NVIDIA Jetson (Nano and AGX Xavier) edge devices running Kubernetes (K8s) for machine learning (ML) including Jupyter Notebooks, TensorFlow Training and TensorFlow Serving using CUDA for smart IoT.

Stars: ✭ 151 (-1.31%)

Mutual labels: cuda

Pycuda

CUDA integration for Python, plus shiny features

Stars: ✭ 1,112 (+626.8%)

Mutual labels: cuda

Cuda Winograd

Fast CUDA Kernels for ResNet Inference.

Stars: ✭ 104 (-32.03%)

Mutual labels: cuda

Pytorch Baidu Ctc

PyTorch bindinga for Baidu's Warp-CTC

Stars: ✭ 61 (-60.13%)

Mutual labels: cuda

Agency

Execution primitives for C++

Stars: ✭ 127 (-16.99%)

Mutual labels: cuda

Flattened Cnn

Flattened convolutional neural networks (1D convolution modules for Torch nn)

Stars: ✭ 59 (-61.44%)

Mutual labels: cuda

Deepnet

Deep.Net machine learning framework for F#

Stars: ✭ 99 (-35.29%)

Mutual labels: cuda

Dokai

Collection of Docker images for ML/DL and video processing projects

Stars: ✭ 58 (-62.09%)

Mutual labels: cuda

Libgdf

[ARCHIVED] C GPU DataFrame Library

Stars: ✭ 142 (-7.19%)

Mutual labels: cuda

Heteroflow

Concurrent CPU-GPU Programming using Task Models

Stars: ✭ 57 (-62.75%)

Mutual labels: cuda

Extending Jax

Extending JAX with custom C++ and CUDA code

Stars: ✭ 98 (-35.95%)

Mutual labels: cuda

Nvbio Gpl

NVBIO is a library of reusable components designed to accelerate bioinformatics applications using CUDA.

Stars: ✭ 56 (-63.4%)

Mutual labels: cuda

Py Faster Rcnn Windows

py-faster-rcnn that can compile on windows directly

Stars: ✭ 126 (-17.65%)

Mutual labels: cuda

Dink

点云深度学习框架 | Point cloud Deep learning Framework

Stars: ✭ 56 (-63.4%)

Mutual labels: cuda

Pynvvl

A Python wrapper of NVIDIA Video Loader (NVVL) with CuPy for fast video loading with Python

Stars: ✭ 95 (-37.91%)

Mutual labels: cuda

Deformable conv2d pytorch

deformable_conv2d layer implemented in pytorch

Stars: ✭ 53 (-65.36%)

Mutual labels: cuda

Sketchgraphs

A dataset of 15 million CAD sketches with geometric constraint graphs.

Stars: ✭ 148 (-3.27%)

Mutual labels: cuda

Carlsim3

CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.

Stars: ✭ 52 (-66.01%)

Mutual labels: cuda

Fbtt Embedding

This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as recommendation and natural language processing. We showed this library can reduce the total model size by up to 100x in Facebook’s open sourced DLRM model while achieving same model quality. Our implementation is faster than the state-of-the-art implementations. Existing the state-of-the-art library also decompresses the whole embedding tables on the fly therefore they do not provide memory reduction during runtime of the training. Our library decompresses only the requested rows therefore can provide 10,000 times memory footprint reduction per embedding table. The library also includes a software cache to store a portion of the entries in the table in decompressed format for faster lookup and process.

Stars: ✭ 92 (-39.87%)

Mutual labels: cuda

Cs344

Introduction to Parallel Programming class code

Stars: ✭ 1,051 (+586.93%)

Mutual labels: cuda

Waveglow inference in cuda

C++ Code to run waveglow inference in cuda

Stars: ✭ 125 (-18.3%)

Mutual labels: cuda

Singularity Tutorial

Tutorial for using Singularity containers