Training MaterialA collection of code examples as well as presentations for training purposes
Stars: ✭ 85 (-41.78%)
CudfcuDF - GPU DataFrame Library
Stars: ✭ 4,370 (+2893.15%)
PytorchPyTorch tutorials A to Z
Stars: ✭ 87 (-40.41%)
ThundersvmThunderSVM: A Fast SVM Library on GPUs and CPUs
Stars: ✭ 1,282 (+778.08%)
OdasODAS: Open embeddeD Audition System
Stars: ✭ 435 (+197.95%)
Poppy HumanoidPoppy Humanoid is an open-source and 3D printed humanoid robot. Optimized for research and education purposes, its modularity allows for a wide range of applications and experimentations.
Stars: ✭ 419 (+186.99%)
PynvvlA Python wrapper of NVIDIA Video Loader (NVVL) with CuPy for fast video loading with Python
Stars: ✭ 95 (-34.93%)
RustacudaRusty wrapper for the CUDA Driver API
Stars: ✭ 511 (+250%)
Cuda Design PatternsSome CUDA design patterns and a bit of template magic for CUDA
Stars: ✭ 78 (-46.58%)
XpediteA non-sampling profiler purpose built to measure and optimize performance of ultra low latency/real time systems
Stars: ✭ 89 (-39.04%)
SupraSUPRA: Software Defined Ultrasound Processing for Real-Time Applications - An Open Source 2D and 3D Pipeline from Beamforming to B-Mode
Stars: ✭ 96 (-34.25%)
Compute RuntimeIntel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver
Stars: ✭ 593 (+306.16%)
ThundergbmThunderGBM: Fast GBDTs and Random Forests on GPUs
Stars: ✭ 586 (+301.37%)
VexclVexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP
Stars: ✭ 626 (+328.77%)
TaskflowA General-purpose Parallel and Heterogeneous Task Programming System
Stars: ✭ 6,128 (+4097.26%)
GunrockHigh-Performance Graph Primitives on GPUs
Stars: ✭ 718 (+391.78%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (+432.19%)
TrtorchPyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
Stars: ✭ 583 (+299.32%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+510.27%)
HashcatWorld's fastest and most advanced password recovery utility
Stars: ✭ 11,014 (+7443.84%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+450%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-75.34%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-78.77%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-71.92%)
ThrustThe C++ parallel algorithms library.
Stars: ✭ 3,595 (+2362.33%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+661.64%)
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
Stars: ✭ 1,120 (+667.12%)
ComputeA C++ GPU Computing Library for OpenCL
Stars: ✭ 1,192 (+716.44%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-22.6%)
SpocStream Processing with OCaml
Stars: ✭ 115 (-21.23%)
CekirdeklerMulti-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Stars: ✭ 76 (-47.95%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-60.96%)
Glove As A Tensorflow Embedding LayerTaking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Stars: ✭ 85 (-41.78%)
MprReference implementation for "Massively Parallel Rendering of Complex Closed-Form Implicit Surfaces" (SIGGRAPH 2020)
Stars: ✭ 84 (-42.47%)
Aardvark.renderingThe dependency-aware, high-performance aardvark rendering engine. This repo is part of aardvark - an open-source platform for visual computing, real-time graphics and visualization.
Stars: ✭ 79 (-45.89%)
NumerNumeric Erlang - vector and matrix operations with CUDA. Heavily inspired by Pteracuda - https://github.com/kevsmith/pteracuda
Stars: ✭ 91 (-37.67%)
Deeppipe2Deep Learning library using GPU(CUDA/cuBLAS)
Stars: ✭ 90 (-38.36%)
EmuThe write-once-run-anywhere GPGPU library for Rust
Stars: ✭ 1,350 (+824.66%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (-64.38%)
GrlRobotics tools in C++11. Implements soft real time arm drivers for Kuka LBR iiwa plus V-REP, ROS, Constrained Optimization based planning, Hand Eye Calibration and Inverse Kinematics integration.
Stars: ✭ 105 (-28.08%)
PygraphistryPyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Stars: ✭ 1,365 (+834.93%)
CuxfilterGPU accelerated cross filtering with cuDF.
Stars: ✭ 128 (-12.33%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-16.44%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (-17.12%)
Curved Lane Linesdetect curved lane lines using HSV filtering and sliding window search.
Stars: ✭ 100 (-31.51%)
ContactposeLarge dataset of hand-object contact, hand- and object-pose, and 2.9 M RGB-D grasp images.
Stars: ✭ 129 (-11.64%)
Ds bowl 2018Kaggle Data Science Bowl 2018
Stars: ✭ 116 (-20.55%)
Ipyexperimentsjupyter/ipython experiment containers for GPU and general RAM re-use
Stars: ✭ 128 (-12.33%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-10.96%)
SpanetSpatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)
Stars: ✭ 136 (-6.85%)
Poppy Ergo Jr🤖 Poppy Ergo Jr is an open-source robotic arm based on modular 3D printed conception and low-cost XL-320 motors.
Stars: ✭ 133 (-8.9%)
Multi Task RefinenetMulti-Task (Joint Segmentation / Depth / Surface Normas) Real-Time Light-Weight RefineNet
Stars: ✭ 139 (-4.79%)
RaspberryturkThe Raspberry Turk is a robot that can play chess—it's entirely open source, based on Raspberry Pi, and inspired by the 18th century chess playing machine, the Mechanical Turk.
Stars: ✭ 140 (-4.11%)