Gpu badmm mtBregman ADMM for mass transportation on GPU
Stars: ✭ 10 (-93.9%)
dynamic-occupancy-grid-mapImplementation of A Random Finite Set Approach for Dynamic Occupancy Grid Maps with Real-Time Application
Stars: ✭ 89 (-45.73%)
warpcontinuous energy monte carlo neutron transport in general geometries on GPUs
Stars: ✭ 27 (-83.54%)
PresentationsSlides and demo code for past presentations
Stars: ✭ 7 (-95.73%)
bazel.cmakebazel.cmake mimics the behavior of bazel to simplify the usability of CMake
Stars: ✭ 38 (-76.83%)
JampackExperimental parallel compression algorithm
Stars: ✭ 21 (-87.2%)
ZludaCUDA on Intel GPUs
Stars: ✭ 937 (+471.34%)
SoliditySHA3MinerAll-in-one mixed multi-GPU (nVidia, AMD, Intel) & CPU miner solves proof of work to mine supported EIP918 tokens in a single instance (with API).
Stars: ✭ 28 (-82.93%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (-1.83%)
Arch-Data-ScienceArchlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision
Stars: ✭ 92 (-43.9%)
ThorAtmospheric fluid dynamics solver optimized for GPUs.
Stars: ✭ 23 (-85.98%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-58.54%)
Knn cudapytorch knn [cuda version]
Stars: ✭ 86 (-47.56%)
monolishmonolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Stars: ✭ 166 (+1.22%)
Sepconv Slomoan implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch
Stars: ✭ 918 (+459.76%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-65.24%)
Warp RnntCUDA-Warp RNN-Transducer
Stars: ✭ 122 (-25.61%)
JetScanJetScan : GPU accelerated portable RGB-D reconstruction system
Stars: ✭ 77 (-53.05%)
bifrostA stream processing framework for high-throughput applications.
Stars: ✭ 48 (-70.73%)
Pytorch EmdlossPyTorch 1.0 implementation of the approximate Earth Mover's Distance
Stars: ✭ 82 (-50%)
nBodyGPU-accelerated N-Body particle simulator with visualizer.
Stars: ✭ 28 (-82.93%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+443.29%)
k-meansCode accompanying my blog post on k-means in Python, C++ and CUDA
Stars: ✭ 56 (-65.85%)
RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+1063.41%)
peakperfAchieve peak performance on x86 CPUs and NVIDIA GPUs
Stars: ✭ 33 (-79.88%)
Ddsh Tip2018source code for paper "Deep Discrete Supervised Hashing"
Stars: ✭ 16 (-90.24%)
gproshangeometry processing and shape analysis framework
Stars: ✭ 48 (-70.73%)
Nnabla Ext CudaA CUDA Extension of Neural Network Libraries
Stars: ✭ 79 (-51.83%)
allgebraBase container for developing C++ and Fortran HPC applications
Stars: ✭ 14 (-91.46%)
LibcudarangeAn interval arithmetic and affine arithmetic library for NVIDIA CUDA
Stars: ✭ 5 (-96.95%)
ctuning-programsCollective Knowledge extension with unified and customizable benchmarks (with extensible JSON meta information) to be easily integrated with customizable and portable Collective Knowledge workflows. You can easily compile and run these benchmarks using different compilers, environments, hardware and OS (Linux, MacOS, Windows, Android). More info:
Stars: ✭ 41 (-75%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (-26.22%)
dbcsrDBCSR: Distributed Block Compressed Sparse Row matrix library
Stars: ✭ 65 (-60.37%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+389.63%)
Cuda Design PatternsSome CUDA design patterns and a bit of template magic for CUDA
Stars: ✭ 78 (-52.44%)
nodeGPU-accelerated data science and visualization in node
Stars: ✭ 85 (-48.17%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+383.54%)
lane detectionLane detection for the Nvidia Jetson TX2 using OpenCV4Tegra
Stars: ✭ 15 (-90.85%)
DsmnetDomain-invariant Stereo Matching Networks
Stars: ✭ 153 (-6.71%)
libelas-gpuImplementation of LIBELAS in cuda.
Stars: ✭ 41 (-75%)
NumbaNumPy aware dynamic Python compiler using LLVM
Stars: ✭ 7,090 (+4223.17%)
AresdbA GPU-powered real-time analytics storage and query engine.
Stars: ✭ 2,814 (+1615.85%)
Cudart.jlJulia wrapper for CUDA runtime API
Stars: ✭ 75 (-54.27%)
DlamiA Deep Learning Amazon Web Service (AWS) AMI that is open, free and works. Run in less than 5 minutes. TensorFlow, Keras, PyTorch, Theano, MXNet, CNTK, Caffe and all dependencies.
Stars: ✭ 239 (+45.73%)
Ethereum nvidia miner💰 USB flash drive ISO image for Ethereum, Zcash and Monero mining with NVIDIA graphics cards and Ubuntu GNU/Linux (headless)
Stars: ✭ 772 (+370.73%)
Tensorflow Optimized WheelsTensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (-28.05%)
JuiceThe Hacker's Machine Learning Engine
Stars: ✭ 743 (+353.05%)
Multi Gpu Programming ModelsExamples demonstrating available options to program multiple GPUs in a single node or a cluster
Stars: ✭ 165 (+0.61%)
Cx db8a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)
Stars: ✭ 164 (+0%)
Cuda CnnCNN accelerated by cuda. Test on mnist and finilly get 99.76%
Stars: ✭ 148 (-9.76%)
NnvmNo description or website provided.
Stars: ✭ 1,639 (+899.39%)
SupraSUPRA: Software Defined Ultrasound Processing for Real-Time Applications - An Open Source 2D and 3D Pipeline from Beamforming to B-Mode
Stars: ✭ 96 (-41.46%)
Slic cudaSuperpixel SLIC for GPU (CUDA)
Stars: ✭ 45 (-72.56%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+2151.83%)