Torch-TensorRTPyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
Stars: ✭ 1,216 (+1996.55%)
ClothTOPGPU-accelerated Cloth TOP node for TouchDesigner using the NVIDIA Flex physics solver.
Stars: ✭ 33 (-43.1%)
warpcontinuous energy monte carlo neutron transport in general geometries on GPUs
Stars: ✭ 27 (-53.45%)
NN-CUDA-ExampleSeveral simple examples for popular neural network toolkits calling custom CUDA operators.
Stars: ✭ 594 (+924.14%)
monolishmonolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Stars: ✭ 166 (+186.21%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+620.69%)
bifrostA stream processing framework for high-throughput applications.
Stars: ✭ 48 (-17.24%)
QPT[内测中]前向式Python环境快捷封装工具,快速将Python打包为EXE并添加CUDA、NoAVX等支持。
Stars: ✭ 308 (+431.03%)
GOSHAn ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.
Stars: ✭ 12 (-79.31%)
JampackExperimental parallel compression algorithm
Stars: ✭ 21 (-63.79%)
allgebraBase container for developing C++ and Fortran HPC applications
Stars: ✭ 14 (-75.86%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (+17.24%)
mini-nbodyA simple gravitational N-body simulation in less than 100 lines of C code, with CUDA optimizations.
Stars: ✭ 73 (+25.86%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-1.72%)
tensorflow-windowsTensorFlow builds compiled on windows with avx and avx2 extensions
Stars: ✭ 20 (-65.52%)
GPU-PathtracerGPU Raytracer from scratch in C++/CUDA
Stars: ✭ 326 (+462.07%)
cressetTemplate repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.
Stars: ✭ 573 (+887.93%)
CPP-ProgrammingVarious C/C++ examples. DirectX, OpenGL, CUDA, Vulkan, OpenCL.
Stars: ✭ 30 (-48.28%)
gpubootcampThis repository consists for gpu bootcamp material for HPC and AI
Stars: ✭ 227 (+291.38%)
bazel.cmakebazel.cmake mimics the behavior of bazel to simplify the usability of CMake
Stars: ✭ 38 (-34.48%)
gproshangeometry processing and shape analysis framework
Stars: ✭ 48 (-17.24%)
octotigerAstrophysics program simulating the evolution of star systems based on the fast multipole method on adaptive Octrees
Stars: ✭ 30 (-48.28%)
ctuning-programsCollective Knowledge extension with unified and customizable benchmarks (with extensible JSON meta information) to be easily integrated with customizable and portable Collective Knowledge workflows. You can easily compile and run these benchmarks using different compilers, environments, hardware and OS (Linux, MacOS, Windows, Android). More info:
Stars: ✭ 41 (-29.31%)
SoliditySHA3MinerAll-in-one mixed multi-GPU (nVidia, AMD, Intel) & CPU miner solves proof of work to mine supported EIP918 tokens in a single instance (with API).
Stars: ✭ 28 (-51.72%)
dbcsrDBCSR: Distributed Block Compressed Sparse Row matrix library
Stars: ✭ 65 (+12.07%)
Fat-CloudsGPU Fluid Simulation with Volumetric Rendering
Stars: ✭ 81 (+39.66%)
disptoolsGenerate displacement fields with known volume changes
Stars: ✭ 17 (-70.69%)
FLAMEGPU2FLAME GPU 2 is a GPU accelerated agent based modelling framework for C++ and Python
Stars: ✭ 25 (-56.9%)
tiny-cuda-nnLightning fast & tiny C++/CUDA neural network framework
Stars: ✭ 908 (+1465.52%)
pyRenderLightweight Cuda Renderer with Python Wrapper.
Stars: ✭ 49 (-15.52%)
raytkRaymarching shader toolkit for TouchDesigner
Stars: ✭ 98 (+68.97%)
PhaserCHOP-TD-Summit-TalkProject files associated with http://github.com/dbraun/PhaserCHOP and David Braun's "Quantitative Easing" talk at the 2019 TouchDesigner Summit https://www.youtube.com/watch?v=S4PQW4f34c8
Stars: ✭ 36 (-37.93%)
opencv-cuda-dockerDockerfiles for OpenCV compiled with CUDA, opencv_contrib modules and Python 3 bindings
Stars: ✭ 55 (-5.17%)
JetScanJetScan : GPU accelerated portable RGB-D reconstruction system
Stars: ✭ 77 (+32.76%)
dynamic-occupancy-grid-mapImplementation of A Random Finite Set Approach for Dynamic Occupancy Grid Maps with Real-Time Application
Stars: ✭ 89 (+53.45%)
pytorch-cppJust messing around with PyTorch 1.0's JIT compiler and their new C++ API Libtorch.
Stars: ✭ 17 (-70.69%)
CudaSHA256Simple tool to calculate sha256 on GPU using Cuda
Stars: ✭ 38 (-34.48%)
briefmatchBriefMatch real-time GPU optical flow
Stars: ✭ 36 (-37.93%)
nBodyGPU-accelerated N-Body particle simulator with visualizer.
Stars: ✭ 28 (-51.72%)
cuda2GLcoreImplementation of Cuda to OpenGL rendering
Stars: ✭ 46 (-20.69%)
k-meansCode accompanying my blog post on k-means in Python, C++ and CUDA
Stars: ✭ 56 (-3.45%)
PbfVsImplementation of Macklin, Miles, and Matthias Müller. "Position based fluids.". Visual Studio 2015 + CUDA 8.0
Stars: ✭ 100 (+72.41%)
peakperfAchieve peak performance on x86 CPUs and NVIDIA GPUs
Stars: ✭ 33 (-43.1%)
revisiting-sepconvan implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation using PyTorch
Stars: ✭ 43 (-25.86%)
onnx tensorrt projectSupport Yolov5(4.0)/Yolov5(5.0)/YoloR/YoloX/Yolov4/Yolov3/CenterNet/CenterFace/RetinaFace/Classify/Unet. use darknet/libtorch/pytorch/mxnet to onnx to tensorrt
Stars: ✭ 145 (+150%)
FGPUNo description or website provided.
Stars: ✭ 30 (-48.28%)
mbsolveAn open-source solver tool for the Maxwell-Bloch equations.
Stars: ✭ 14 (-75.86%)
NewsMTSCTarget-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k sentences and a state-of-the-art classification model.
Stars: ✭ 54 (-6.9%)
cuda-toolkitGitHub Action to install CUDA
Stars: ✭ 34 (-41.38%)
HiSpatialClusterClustering spatial points with algorithm of Fast Search, high performace computing implements of CUDA or parallel in CPU, and runnable implements on python standalone or arcgis.
Stars: ✭ 31 (-46.55%)
lbvhan implementation of parallel linear BVH (LBVH) on GPU
Stars: ✭ 67 (+15.52%)
euler2d kokkosSimple 2d finite volume solver for Euler equations using c++ kokkos library
Stars: ✭ 27 (-53.45%)
crowdsource-video-experiments-on-androidCrowdsourcing video experiments (such as collaborative benchmarking and optimization of DNN algorithms) using Collective Knowledge Framework across diverse Android devices provided by volunteers. Results are continuously aggregated in the open repository:
Stars: ✭ 29 (-50%)
desertA fast (?) random sampling drawing library
Stars: ✭ 61 (+5.17%)
hipaccA domain-specific language and compiler for image processing
Stars: ✭ 72 (+24.14%)
ThrustRTCCUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.
Stars: ✭ 41 (-29.31%)