RETROCm Machine Learning and HPC Stack installer
Stars: ✭ 28 (-60.56%)
Amplifier.NETAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 142 (+100%)
docker-nvidia-glx-desktopMATE Desktop container designed for Kubernetes supporting OpenGL GLX and Vulkan for NVIDIA GPUs with WebRTC and HTML5, providing an open source remote cloud graphics or game streaming platform. Spawns its own fully isolated X Server instead of using the host X server, therefore not requiring /tmp/.X11-unix host sockets or host configuration.
Stars: ✭ 47 (-33.8%)
QPT[内测中]前向式Python环境快捷封装工具,快速将Python打包为EXE并添加CUDA、NoAVX等支持。
Stars: ✭ 308 (+333.8%)
ArboretumGradient Boosting powered by GPU(NVIDIA CUDA)
Stars: ✭ 64 (-9.86%)
CLUThe OpenCL Utility library
Stars: ✭ 18 (-74.65%)
boxtreeQuad/octree building for FMMs in Python and OpenCL
Stars: ✭ 52 (-26.76%)
gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (-69.01%)
RayTracingRealtime GPU Path tracer based on OpenCL and OpenGL
Stars: ✭ 120 (+69.01%)
Torch-TensorRTPyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
Stars: ✭ 1,216 (+1612.68%)
cruiseUser space POSIX-like file system in main memory
Stars: ✭ 27 (-61.97%)
ddcpuid🔬 dd's x86 CPU Identification tool
Stars: ✭ 21 (-70.42%)
nodeGPU-accelerated data science and visualization in node
Stars: ✭ 85 (+19.72%)
lane detectionLane detection for the Nvidia Jetson TX2 using OpenCV4Tegra
Stars: ✭ 15 (-78.87%)
HiSpatialClusterClustering spatial points with algorithm of Fast Search, high performace computing implements of CUDA or parallel in CPU, and runnable implements on python standalone or arcgis.
Stars: ✭ 31 (-56.34%)
pcluster-managerManage AWS ParallelCluster through an easy to use web interface
Stars: ✭ 67 (-5.63%)
FGPUNo description or website provided.
Stars: ✭ 30 (-57.75%)
LuisaRenderHigh-Performance Multiple-Backend Renderer Based on LuisaCompute
Stars: ✭ 47 (-33.8%)
PopsiftPopSift is an implementation of the SIFT algorithm in CUDA.
Stars: ✭ 259 (+264.79%)
HemiSimple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
Stars: ✭ 275 (+287.32%)
slibsSingle file libraries for C/C++
Stars: ✭ 80 (+12.68%)
frameworkThe Arcane Framework for HPC codes
Stars: ✭ 15 (-78.87%)
coreos-gpu-installerScripts to build and use a container to install GPU drivers on CoreOS Container Linux
Stars: ✭ 21 (-70.42%)
warpcontinuous energy monte carlo neutron transport in general geometries on GPUs
Stars: ✭ 27 (-61.97%)
crowdsource-video-experiments-on-androidCrowdsourcing video experiments (such as collaborative benchmarking and optimization of DNN algorithms) using Collective Knowledge Framework across diverse Android devices provided by volunteers. Results are continuously aggregated in the open repository:
Stars: ✭ 29 (-59.15%)
VuhVulkan compute for people
Stars: ✭ 264 (+271.83%)
GraphviteGraphVite: A General and High-performance Graph Embedding System
Stars: ✭ 865 (+1118.31%)
PyMFEMPython wrapper for MFEM
Stars: ✭ 91 (+28.17%)
PbfVsImplementation of Macklin, Miles, and Matthias Müller. "Position based fluids.". Visual Studio 2015 + CUDA 8.0
Stars: ✭ 100 (+40.85%)
Fast gicpA collection of GICP-based fast point cloud registration algorithms
Stars: ✭ 307 (+332.39%)
JetsonjsEmbed a JavaScript/WebGL application on a Nvidia Jetson TX2 and stream the results through websockets. It does not rely on CUDA/Jetpack. HDMI touchscreen, virtual keyboard, GPIO control, wifi config are included.
Stars: ✭ 18 (-74.65%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+1154.93%)
GpusortingImplementation of a few sorting algorithms in OpenCL
Stars: ✭ 9 (-87.32%)
ufo-coreGLib-based framework for GPU-based data processing
Stars: ✭ 20 (-71.83%)
KernelsThis is a set of simple programs that can be used to explore the features of a parallel platform.
Stars: ✭ 287 (+304.23%)
NvptxHow to: Run Rust code on your NVIDIA GPU
Stars: ✭ 335 (+371.83%)
JugParallel programming with Python
Stars: ✭ 337 (+374.65%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-42.25%)
Clarrays.jlOpenCL-backed GPU Arrays
Stars: ✭ 58 (-18.31%)
Sycl DnnSYCL-DNN is a library implementing neural network algorithms written using SYCL
Stars: ✭ 67 (-5.63%)
GPU-JupyterhubSetting up a Jupyterhub Dockercontainer to spawn Jupyter Notebooks with GPU support (containing Tensorflow, Pytorch and Keras)
Stars: ✭ 23 (-67.61%)
gpu-monitorScript to remotely check GPU servers for free GPUs
Stars: ✭ 85 (+19.72%)
MOTMulti-threaded Optimization Toolbox
Stars: ✭ 28 (-60.56%)
CudfcuDF - GPU DataFrame Library
Stars: ✭ 4,370 (+6054.93%)
H2o4gpuH2Oai GPU Edition
Stars: ✭ 416 (+485.92%)
Ai LabAll-in-one AI container for rapid prototyping
Stars: ✭ 406 (+471.83%)
Accel(Mirror of GitLab) GPGPU Framework for Rust
Stars: ✭ 420 (+491.55%)
WebclglGPGPU Javascript library 🐸
Stars: ✭ 313 (+340.85%)
NvtopNVIDIA GPUs htop like monitoring tool
Stars: ✭ 3,604 (+4976.06%)
Set EgpuDisplay-agnostic acceleration of macOS applications using external GPUs.
Stars: ✭ 429 (+504.23%)
Open3dOpen3D: A Modern Library for 3D Data Processing
Stars: ✭ 5,860 (+8153.52%)
Cuda.jlCUDA programming in Julia.
Stars: ✭ 370 (+421.13%)
ChlorineDead Simple OpenCL
Stars: ✭ 419 (+490.14%)
CaerHigh-performance Vision library in Python. Scale your research, not boilerplate.
Stars: ✭ 452 (+536.62%)
TornadovmTornadoVM: A practical and efficient heterogeneous programming framework for managed languages
Stars: ✭ 479 (+574.65%)
Regl CnnDigit recognition with Convolutional Neural Networks in WebGL
Stars: ✭ 490 (+590.14%)
RustacudaRusty wrapper for the CUDA Driver API
Stars: ✭ 511 (+619.72%)
Soul EnginePhysically based renderer and simulation engine for real-time applications.
Stars: ✭ 37 (-47.89%)