Warp CtcFast parallel CTC.
Stars: ✭ 3,954 (+6078.13%)
Compute RuntimeIntel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver
Stars: ✭ 593 (+826.56%)
VuhVulkan compute for people
Stars: ✭ 264 (+312.5%)
Cs344Introduction to Parallel Programming class code
Stars: ✭ 1,051 (+1542.19%)
Ergo🧠 A tool that makes AI easier.
Stars: ✭ 264 (+312.5%)
OcbarrageiOS 弹幕库 OCBarrage, 同时渲染5000条弹幕也不卡, 轻量, 可拓展, 高度自定义动画, 超高性能, 简单易上手; A barrage render-engine with high performance for iOS. At the same time, rendering 5000 barrages is also very smooth, lightweight, scalable, highly custom animation, ultra high performance, simple and easy to use!
Stars: ✭ 589 (+820.31%)
GocvGo package for computer vision using OpenCV 4 and beyond.
Stars: ✭ 4,511 (+6948.44%)
AtlasAn Open Source, Self-Hosted Platform For Applied Deep Learning Development
Stars: ✭ 259 (+304.69%)
TaskflowA General-purpose Parallel and Heterogeneous Task Programming System
Stars: ✭ 6,128 (+9475%)
Go TranscodeLive on-demand transcoding in go using ffmpeg. Also with NVIDIA GPU hardware acceleration.
Stars: ✭ 39 (-39.06%)
JetsonjsEmbed a JavaScript/WebGL application on a Nvidia Jetson TX2 and stream the results through websockets. It does not rely on CUDA/Jetpack. HDMI touchscreen, virtual keyboard, GPIO control, wifi config are included.
Stars: ✭ 18 (-71.87%)
ServingA flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)
Stars: ✭ 403 (+529.69%)
HetuA high-performance distributed deep learning system targeting large-scale and automated distributed training.
Stars: ✭ 78 (+21.88%)
ScannerEfficient video analysis at scale
Stars: ✭ 569 (+789.06%)
Torch-TensorRTPyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
Stars: ✭ 1,216 (+1800%)
GPU-JupyterhubSetting up a Jupyterhub Dockercontainer to spawn Jupyter Notebooks with GPU support (containing Tensorflow, Pytorch and Keras)
Stars: ✭ 23 (-64.06%)
HornetHornet data structure for sparse dynamic graphs and matrices
Stars: ✭ 49 (-23.44%)
tt7zcrack7z辅助破解工具 Fast 7zip crack assistant tool which support GPU/CPU, written in Python.
Stars: ✭ 12 (-81.25%)
AlphaposeReal-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
Stars: ✭ 5,697 (+8801.56%)
Open-GPGPU-FlexGrip-FlexGripPlus: an open-source GPU model for reliability evaluation and micro architectural simulation
Stars: ✭ 15 (-76.56%)
cuda2GLcoreImplementation of Cuda to OpenGL rendering
Stars: ✭ 46 (-28.12%)
Pytorch Pwc a reimplementation of PWC-Net in PyTorch that matches the official Caffe version
Stars: ✭ 402 (+528.13%)
Gpuvideo AndroidThis library apply video filter on generate an Mp4 and on ExoPlayer video and Video Recording with Camera2.
Stars: ✭ 403 (+529.69%)
HyperbandTuning hyperparams fast with Hyperband
Stars: ✭ 555 (+767.19%)
Cuda SamplesSamples for CUDA Developers which demonstrates features in CUDA Toolkit
Stars: ✭ 1,087 (+1598.44%)
GmatrixR package for unleashing the power of NVIDIA GPU's
Stars: ✭ 16 (-75%)
NeuralmonkeyAn open-source tool for sequence learning in NLP built on TensorFlow.
Stars: ✭ 400 (+525%)
Quaternions-RevisitedSample code for a 'Quaternions revisited' article from GPU Pro 5
Stars: ✭ 30 (-53.12%)
Vkquake2id Software's Quake 2 v3.21 with mission packs and Vulkan support (Windows, Linux, MacOS, FreeBSD, Raspberry Pi 4)
Stars: ✭ 543 (+748.44%)
Oceananigans.jl🌊 Fast and friendly fluid dynamics on CPUs and GPUs
Stars: ✭ 400 (+525%)
revisiting-sepconvan implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation using PyTorch
Stars: ✭ 43 (-32.81%)
NvparseFast, gpu-based CSV parser
Stars: ✭ 533 (+732.81%)
Ddsh Tip2018source code for paper "Deep Discrete Supervised Hashing"
Stars: ✭ 16 (-75%)
CubertFast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL
Stars: ✭ 395 (+517.19%)
disptoolsGenerate displacement fields with known volume changes
Stars: ✭ 17 (-73.44%)
TrainyourownyoloTrain a state-of-the-art yolov3 object detector from scratch!
Stars: ✭ 399 (+523.44%)
mini-nbodyA simple gravitational N-body simulation in less than 100 lines of C code, with CUDA optimizations.
Stars: ✭ 73 (+14.06%)
MinkowskiengineMinkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
Stars: ✭ 1,110 (+1634.38%)
Gfx[maintenance mode] A low-overhead Vulkan-like GPU API for Rust.
Stars: ✭ 5,045 (+7782.81%)
Style Feature Reshufflecaffe implementation of "Arbitrary Style Transfer with Deep Feature Reshuffle"
Stars: ✭ 38 (-40.62%)
CudadbclusteringClustering via Graphics Processor, using NVIDIA CUDA sdk to preform database clustering on the massively parallel graphics card processor
Stars: ✭ 6 (-90.62%)
ScaleneScalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python
Stars: ✭ 4,819 (+7429.69%)
SilentXMRMinerA Silent (Hidden) Monero (XMR) Miner Builder
Stars: ✭ 417 (+551.56%)
cressetTemplate repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.
Stars: ✭ 573 (+795.31%)
Lyra Stars: ✭ 43 (-32.81%)
Cloud GpusThis repository contains information about Cloud GPU offerings for Machine Learning practitioners.
Stars: ✭ 395 (+517.19%)
Turbotransformersa fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
Stars: ✭ 826 (+1190.63%)
Cudanative.jlJulia support for native CUDA programming
Stars: ✭ 393 (+514.06%)
Nvbio GplNVBIO is a library of reusable components designed to accelerate bioinformatics applications using CUDA.
Stars: ✭ 56 (-12.5%)
Soul EnginePhysically based renderer and simulation engine for real-time applications.
Stars: ✭ 37 (-42.19%)
Tensorflow.jlA Julia wrapper for TensorFlow
Stars: ✭ 822 (+1184.38%)
GanetGA-Net: Guided Aggregation Net for End-to-end Stereo Matching
Stars: ✭ 393 (+514.06%)