UammdA CUDA project for Molecular Dynamics, Brownian Dynamics, Hydrodynamics... intended to simulate a very generic system constructing a simulation with modules.
Stars: ✭ 11 (-93.37%)
mini-nbodyA simple gravitational N-body simulation in less than 100 lines of C code, with CUDA optimizations.
Stars: ✭ 73 (-56.02%)
dynamic-occupancy-grid-mapImplementation of A Random Finite Set Approach for Dynamic Occupancy Grid Maps with Real-Time Application
Stars: ✭ 89 (-46.39%)
Gpu badmm mtBregman ADMM for mass transportation on GPU
Stars: ✭ 10 (-93.98%)
warpcontinuous energy monte carlo neutron transport in general geometries on GPUs
Stars: ✭ 27 (-83.73%)
ThundersvmThunderSVM: A Fast SVM Library on GPUs and CPUs
Stars: ✭ 1,282 (+672.29%)
bazel.cmakebazel.cmake mimics the behavior of bazel to simplify the usability of CMake
Stars: ✭ 38 (-77.11%)
PresentationsSlides and demo code for past presentations
Stars: ✭ 7 (-95.78%)
JampackExperimental parallel compression algorithm
Stars: ✭ 21 (-87.35%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-1.2%)
SoliditySHA3MinerAll-in-one mixed multi-GPU (nVidia, AMD, Intel) & CPU miner solves proof of work to mine supported EIP918 tokens in a single instance (with API).
Stars: ✭ 28 (-83.13%)
ZludaCUDA on Intel GPUs
Stars: ✭ 937 (+464.46%)
Arch-Data-ScienceArchlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision
Stars: ✭ 92 (-44.58%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-59.04%)
ThorAtmospheric fluid dynamics solver optimized for GPUs.
Stars: ✭ 23 (-86.14%)
monolishmonolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Stars: ✭ 166 (+0%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-65.66%)
Sepconv Slomoan implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch
Stars: ✭ 918 (+453.01%)
JetScanJetScan : GPU accelerated portable RGB-D reconstruction system
Stars: ✭ 77 (-53.61%)
Knn cudapytorch knn [cuda version]
Stars: ✭ 86 (-48.19%)
bifrostA stream processing framework for high-throughput applications.
Stars: ✭ 48 (-71.08%)
nBodyGPU-accelerated N-Body particle simulator with visualizer.
Stars: ✭ 28 (-83.13%)
k-meansCode accompanying my blog post on k-means in Python, C++ and CUDA
Stars: ✭ 56 (-66.27%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+436.75%)
peakperfAchieve peak performance on x86 CPUs and NVIDIA GPUs
Stars: ✭ 33 (-80.12%)
Pytorch EmdlossPyTorch 1.0 implementation of the approximate Earth Mover's Distance
Stars: ✭ 82 (-50.6%)
gproshangeometry processing and shape analysis framework
Stars: ✭ 48 (-71.08%)
Ddsh Tip2018source code for paper "Deep Discrete Supervised Hashing"
Stars: ✭ 16 (-90.36%)
allgebraBase container for developing C++ and Fortran HPC applications
Stars: ✭ 14 (-91.57%)
Warp RnntCUDA-Warp RNN-Transducer
Stars: ✭ 122 (-26.51%)
ctuning-programsCollective Knowledge extension with unified and customizable benchmarks (with extensible JSON meta information) to be easily integrated with customizable and portable Collective Knowledge workflows. You can easily compile and run these benchmarks using different compilers, environments, hardware and OS (Linux, MacOS, Windows, Android). More info:
Stars: ✭ 41 (-75.3%)
LibcudarangeAn interval arithmetic and affine arithmetic library for NVIDIA CUDA
Stars: ✭ 5 (-96.99%)
dbcsrDBCSR: Distributed Block Compressed Sparse Row matrix library
Stars: ✭ 65 (-60.84%)
Nnabla Ext CudaA CUDA Extension of Neural Network Libraries
Stars: ✭ 79 (-52.41%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+383.73%)
nodeGPU-accelerated data science and visualization in node
Stars: ✭ 85 (-48.8%)
RmmRAPIDS Memory Manager
Stars: ✭ 154 (-7.23%)
lane detectionLane detection for the Nvidia Jetson TX2 using OpenCV4Tegra
Stars: ✭ 15 (-90.96%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+377.71%)
libelas-gpuImplementation of LIBELAS in cuda.
Stars: ✭ 41 (-75.3%)
Cuda Design PatternsSome CUDA design patterns and a bit of template magic for CUDA
Stars: ✭ 78 (-53.01%)
AresdbA GPU-powered real-time analytics storage and query engine.
Stars: ✭ 2,814 (+1595.18%)
NumbaNumPy aware dynamic Python compiler using LLVM
Stars: ✭ 7,090 (+4171.08%)
DlamiA Deep Learning Amazon Web Service (AWS) AMI that is open, free and works. Run in less than 5 minutes. TensorFlow, Keras, PyTorch, Theano, MXNet, CNTK, Caffe and all dependencies.
Stars: ✭ 239 (+43.98%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (-27.11%)
Ethereum nvidia miner💰 USB flash drive ISO image for Ethereum, Zcash and Monero mining with NVIDIA graphics cards and Ubuntu GNU/Linux (headless)
Stars: ✭ 772 (+365.06%)
JcudaJCuda - Java bindings for CUDA
Stars: ✭ 165 (-0.6%)
Multi Gpu Programming ModelsExamples demonstrating available options to program multiple GPUs in a single node or a cluster
Stars: ✭ 165 (-0.6%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (-4.82%)
DppDetail-Preserving Pooling in Deep Networks (CVPR 2018)
Stars: ✭ 99 (-40.36%)
HornetHornet data structure for sparse dynamic graphs and matrices
Stars: ✭ 49 (-70.48%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+106.02%)