Lyra Stars: ✭ 43 (-79.23%)
PopsiftPopSift is an implementation of the SIFT algorithm in CUDA.
Stars: ✭ 259 (+25.12%)
JetsonHelmut Hoffer von Ankershoffen experimenting with arm64 based NVIDIA Jetson (Nano and AGX Xavier) edge devices running Kubernetes (K8s) for machine learning (ML) including Jupyter Notebooks, TensorFlow Training and TensorFlow Serving using CUDA for smart IoT.
Stars: ✭ 151 (-27.05%)
gpu-monitorScript to remotely check GPU servers for free GPUs
Stars: ✭ 85 (-58.94%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+692.75%)
Torch-TensorRTPyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
Stars: ✭ 1,216 (+487.44%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-80.19%)
crowdsource-video-experiments-on-androidCrowdsourcing video experiments (such as collaborative benchmarking and optimization of DNN algorithms) using Collective Knowledge Framework across diverse Android devices provided by volunteers. Results are continuously aggregated in the open repository:
Stars: ✭ 29 (-85.99%)
SimplegpuhashtableA simple GPU hash table implemented in CUDA using lock free techniques
Stars: ✭ 198 (-4.35%)
desertA fast (?) random sampling drawing library
Stars: ✭ 61 (-70.53%)
NbodyN body gravity attraction problem solver
Stars: ✭ 40 (-80.68%)
CPP-ProgrammingVarious C/C++ examples. DirectX, OpenGL, CUDA, Vulkan, OpenCL.
Stars: ✭ 30 (-85.51%)
hipaccA domain-specific language and compiler for image processing
Stars: ✭ 72 (-65.22%)
Soul EnginePhysically based renderer and simulation engine for real-time applications.
Stars: ✭ 37 (-82.13%)
tensorflow-windowsTensorFlow builds compiled on windows with avx and avx2 extensions
Stars: ✭ 20 (-90.34%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (-28.02%)
QPT[内测中]前向式Python环境快捷封装工具,快速将Python打包为EXE并添加CUDA、NoAVX等支持。
Stars: ✭ 308 (+48.79%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-82.61%)
octotigerAstrophysics program simulating the evolution of star systems based on the fast multipole method on adaptive Octrees
Stars: ✭ 30 (-85.51%)
DaceDaCe - Data Centric Parallel Programming
Stars: ✭ 106 (-48.79%)
ThrustRTCCUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.
Stars: ✭ 41 (-80.19%)
Object Detection And Location Realsensed435Use the Intel D435 real-sensing camera to realize target detection based on the Yolov3 framework under the Opencv DNN framework, and realize the 3D positioning of the Objection according to the depth information. Real-time display of the coordinates in the camera coordinate system.ADD--Using Yolov5 By TensorRT model,AGX-Xavier,RealTime Object Detection
Stars: ✭ 36 (-82.61%)
mini-nbodyA simple gravitational N-body simulation in less than 100 lines of C code, with CUDA optimizations.
Stars: ✭ 73 (-64.73%)
Deformable KernelsDeforming kernels to adapt towards object deformation. In ICLR 2020.
Stars: ✭ 166 (-19.81%)
dynamic-occupancy-grid-mapImplementation of A Random Finite Set Approach for Dynamic Occupancy Grid Maps with Real-Time Application
Stars: ✭ 89 (-57%)
warpcontinuous energy monte carlo neutron transport in general geometries on GPUs
Stars: ✭ 27 (-86.96%)
Cuda WinogradFast CUDA Kernels for ResNet Inference.
Stars: ✭ 104 (-49.76%)
bazel.cmakebazel.cmake mimics the behavior of bazel to simplify the usability of CMake
Stars: ✭ 38 (-81.64%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-85.02%)
JampackExperimental parallel compression algorithm
Stars: ✭ 21 (-89.86%)
SketchgraphsA dataset of 15 million CAD sketches with geometric constraint graphs.
Stars: ✭ 148 (-28.5%)
SoliditySHA3MinerAll-in-one mixed multi-GPU (nVidia, AMD, Intel) & CPU miner solves proof of work to mine supported EIP918 tokens in a single instance (with API).
Stars: ✭ 28 (-86.47%)
Arch-Data-ScienceArchlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision
Stars: ✭ 92 (-55.56%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-52.17%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-67.15%)
Des CudaDES cracking using brute force algorithm and CUDA
Stars: ✭ 21 (-89.86%)
monolishmonolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Stars: ✭ 166 (-19.81%)
Macos Egpu Cuda GuideSet up CUDA for machine learning (and gaming) on macOS using a NVIDIA eGPU
Stars: ✭ 187 (-9.66%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-72.46%)
JetScanJetScan : GPU accelerated portable RGB-D reconstruction system
Stars: ✭ 77 (-62.8%)
Extending JaxExtending JAX with custom C++ and CUDA code
Stars: ✭ 98 (-52.66%)
bifrostA stream processing framework for high-throughput applications.
Stars: ✭ 48 (-76.81%)
UammdA CUDA project for Molecular Dynamics, Brownian Dynamics, Hydrodynamics... intended to simulate a very generic system constructing a simulation with modules.
Stars: ✭ 11 (-94.69%)
nBodyGPU-accelerated N-Body particle simulator with visualizer.
Stars: ✭ 28 (-86.47%)
Gpu badmm mtBregman ADMM for mass transportation on GPU
Stars: ✭ 10 (-95.17%)
Cunn Stars: ✭ 205 (-0.97%)
Pine🌲 Aimbot powered by real-time object detection with neural networks, GPU accelerated with Nvidia. Optimized for use with CS:GO.
Stars: ✭ 202 (-2.42%)
TimemoryModular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
Stars: ✭ 192 (-7.25%)
Gmonitorgmonitor is a GPU monitor (Nvidia only at the moment)
Stars: ✭ 169 (-18.36%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (-23.67%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+5286.96%)
CutlassCUDA Templates for Linear Algebra Subroutines
Stars: ✭ 1,123 (+442.51%)