HipHIP: C++ Heterogeneous-Compute Interface for Portability
Stars: ✭ 2,609 (+1029.44%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-40.26%)
SpanetSpatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)
Stars: ✭ 136 (-41.13%)
Pedestrian alignmentTCSVT2018 Pedestrian Alignment Network for Large-scale Person Re-identification
Stars: ✭ 223 (-3.46%)
DragonDragon: A Computation Graph Virtual Machine Based Deep Learning Framework.
Stars: ✭ 168 (-27.27%)
NnvmNo description or website provided.
Stars: ✭ 1,639 (+609.52%)
Cunn Stars: ✭ 205 (-11.26%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-43.72%)
FloorA C++ Compute/Graphics Library and Toolchain enabling same-source CUDA/Host/Metal/OpenCL/Vulkan C++ programming and execution.
Stars: ✭ 166 (-28.14%)
Pytorch Heda reimplementation of Holistically-Nested Edge Detection in PyTorch
Stars: ✭ 228 (-1.3%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+4727.27%)
SporcoSparse Optimisation Research Code
Stars: ✭ 164 (-29%)
FcisFully Convolutional Instance-aware Semantic Segmentation
Stars: ✭ 1,563 (+576.62%)
Pine🌲 Aimbot powered by real-time object detection with neural networks, GPU accelerated with Nvidia. Optimized for use with CS:GO.
Stars: ✭ 202 (-12.55%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-47.19%)
JcudaJCuda - Java bindings for CUDA
Stars: ✭ 165 (-28.57%)
Knn cudaFast K-Nearest Neighbor search with GPU
Stars: ✭ 119 (-48.48%)
RelionImage-processing software for cryo-electron microscopy
Stars: ✭ 219 (-5.19%)
SpocStream Processing with OCaml
Stars: ✭ 115 (-50.22%)
Multi Gpu Programming ModelsExamples demonstrating available options to program multiple GPUs in a single node or a cluster
Stars: ✭ 165 (-28.57%)
CltuneCLTune: An automatic OpenCL & CUDA kernel tuner
Stars: ✭ 114 (-50.65%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-51.08%)
Cx db8a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)
Stars: ✭ 164 (-29%)
Adacof PytorchOfficial source code for our paper "AdaCoF: Adaptive Collaboration of Flows for Video Frame Interpolation" (CVPR 2020)
Stars: ✭ 110 (-52.38%)
CuspatialCUDA-accelerated GIS and spatiotemporal algorithms
Stars: ✭ 229 (-0.87%)
CuheCUDA Homomorphic Encryption Library
Stars: ✭ 109 (-52.81%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (-31.6%)
HashcatWorld's fastest and most advanced password recovery utility
Stars: ✭ 11,014 (+4667.97%)
TimemoryModular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
Stars: ✭ 192 (-16.88%)
PygraphistryPyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Stars: ✭ 1,365 (+490.91%)
TigreTIGRE: Tomographic Iterative GPU-based Reconstruction Toolbox
Stars: ✭ 215 (-6.93%)
DppDetail-Preserving Pooling in Deep Networks (CVPR 2018)
Stars: ✭ 99 (-57.14%)
Cumf alsCUDA Matrix Factorization Library with Alternating Least Square (ALS)
Stars: ✭ 154 (-33.33%)
SupraSUPRA: Software Defined Ultrasound Processing for Real-Time Applications - An Open Source 2D and 3D Pipeline from Beamforming to B-Mode
Stars: ✭ 96 (-58.44%)
Pytorch Spynet a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch
Stars: ✭ 190 (-17.75%)
Region ConvNot All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade
Stars: ✭ 95 (-58.87%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (-34.2%)
NumerNumeric Erlang - vector and matrix operations with CUDA. Heavily inspired by Pteracuda - https://github.com/kevsmith/pteracuda
Stars: ✭ 91 (-60.61%)
Nvidia Modded InfModified nVidia .inf files to run drivers on all video cards, research & telemetry free drivers
Stars: ✭ 227 (-1.73%)
ElasticfusionReal-time dense visual SLAM system
Stars: ✭ 1,298 (+461.9%)
AuroraMinimal Deep Learning library is written in Python/Cython/C++ and Numpy/CUDA/cuDNN.
Stars: ✭ 90 (-61.04%)
Nvidia DockerBuild and run Docker containers leveraging NVIDIA GPUs
Stars: ✭ 13,961 (+5943.72%)
HallocA fast and highly scalable GPU dynamic memory allocator
Stars: ✭ 89 (-61.47%)
Cuda CnnCNN accelerated by cuda. Test on mnist and finilly get 99.76%
Stars: ✭ 148 (-35.93%)
HasteHaste: a fast, simple, and open RNN library
Stars: ✭ 214 (-7.36%)
Optix PathtracerSimple physically based path tracer based on Nvidia's Optix Ray Tracing Engine
Stars: ✭ 231 (+0%)
TengineTengine is a lite, high performance, modular inference engine for embedded device
Stars: ✭ 4,012 (+1636.8%)
DeepspeechDeepSpeech neon implementation
Stars: ✭ 223 (-3.46%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (-9.52%)
Ssd Gpu DmaBuild userspace NVMe drivers and storage applications with CUDA support
Stars: ✭ 172 (-25.54%)
RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+725.97%)