QPT[内测中]前向式Python环境快捷封装工具,快速将Python打包为EXE并添加CUDA、NoAVX等支持。
Stars: ✭ 308 (+82.25%)
octotigerAstrophysics program simulating the evolution of star systems based on the fast multipole method on adaptive Octrees
Stars: ✭ 30 (-82.25%)
Des CudaDES cracking using brute force algorithm and CUDA
Stars: ✭ 21 (-87.57%)
ThrustRTCCUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.
Stars: ✭ 41 (-75.74%)
MatconvnetMatConvNet: CNNs for MATLAB
Stars: ✭ 1,299 (+668.64%)
mini-nbodyA simple gravitational N-body simulation in less than 100 lines of C code, with CUDA optimizations.
Stars: ✭ 73 (-56.8%)
dynamic-occupancy-grid-mapImplementation of A Random Finite Set Approach for Dynamic Occupancy Grid Maps with Real-Time Application
Stars: ✭ 89 (-47.34%)
QudaQUDA is a library for performing calculations in lattice QCD on GPUs.
Stars: ✭ 166 (-1.78%)
warpcontinuous energy monte carlo neutron transport in general geometries on GPUs
Stars: ✭ 27 (-84.02%)
UammdA CUDA project for Molecular Dynamics, Brownian Dynamics, Hydrodynamics... intended to simulate a very generic system constructing a simulation with modules.
Stars: ✭ 11 (-93.49%)
bazel.cmakebazel.cmake mimics the behavior of bazel to simplify the usability of CMake
Stars: ✭ 38 (-77.51%)
Deeppipe2Deep Learning library using GPU(CUDA/cuBLAS)
Stars: ✭ 90 (-46.75%)
JampackExperimental parallel compression algorithm
Stars: ✭ 21 (-87.57%)
Gpu badmm mtBregman ADMM for mass transportation on GPU
Stars: ✭ 10 (-94.08%)
SoliditySHA3MinerAll-in-one mixed multi-GPU (nVidia, AMD, Intel) & CPU miner solves proof of work to mine supported EIP918 tokens in a single instance (with API).
Stars: ✭ 28 (-83.43%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+1001.18%)
Arch-Data-ScienceArchlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision
Stars: ✭ 92 (-45.56%)
PresentationsSlides and demo code for past presentations
Stars: ✭ 7 (-95.86%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-59.76%)
monolishmonolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Stars: ✭ 166 (-1.78%)
ZludaCUDA on Intel GPUs
Stars: ✭ 937 (+454.44%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-66.27%)
JetsonHelmut Hoffer von Ankershoffen experimenting with arm64 based NVIDIA Jetson (Nano and AGX Xavier) edge devices running Kubernetes (K8s) for machine learning (ML) including Jupyter Notebooks, TensorFlow Training and TensorFlow Serving using CUDA for smart IoT.
Stars: ✭ 151 (-10.65%)
JetScanJetScan : GPU accelerated portable RGB-D reconstruction system
Stars: ✭ 77 (-54.44%)
ThorAtmospheric fluid dynamics solver optimized for GPUs.
Stars: ✭ 23 (-86.39%)
bifrostA stream processing framework for high-throughput applications.
Stars: ✭ 48 (-71.6%)
ThundersvmThunderSVM: A Fast SVM Library on GPUs and CPUs
Stars: ✭ 1,282 (+658.58%)
nBodyGPU-accelerated N-Body particle simulator with visualizer.
Stars: ✭ 28 (-83.43%)
Sepconv Slomoan implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch
Stars: ✭ 918 (+443.2%)
k-meansCode accompanying my blog post on k-means in Python, C++ and CUDA
Stars: ✭ 56 (-66.86%)
AgencyExecution primitives for C++
Stars: ✭ 127 (-24.85%)
peakperfAchieve peak performance on x86 CPUs and NVIDIA GPUs
Stars: ✭ 33 (-80.47%)
gproshangeometry processing and shape analysis framework
Stars: ✭ 48 (-71.6%)
allgebraBase container for developing C++ and Fortran HPC applications
Stars: ✭ 14 (-91.72%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+427.22%)
ctuning-programsCollective Knowledge extension with unified and customizable benchmarks (with extensible JSON meta information) to be easily integrated with customizable and portable Collective Knowledge workflows. You can easily compile and run these benchmarks using different compilers, environments, hardware and OS (Linux, MacOS, Windows, Android). More info:
Stars: ✭ 41 (-75.74%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-2.96%)
dbcsrDBCSR: Distributed Block Compressed Sparse Row matrix library
Stars: ✭ 65 (-61.54%)
Ddsh Tip2018source code for paper "Deep Discrete Supervised Hashing"
Stars: ✭ 16 (-90.53%)
Knn cudapytorch knn [cuda version]
Stars: ✭ 86 (-49.11%)
nodeGPU-accelerated data science and visualization in node
Stars: ✭ 85 (-49.7%)
LibcudarangeAn interval arithmetic and affine arithmetic library for NVIDIA CUDA
Stars: ✭ 5 (-97.04%)
lane detectionLane detection for the Nvidia Jetson TX2 using OpenCV4Tegra
Stars: ✭ 15 (-91.12%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+375.15%)
DragonDragon: A Computation Graph Virtual Machine Based Deep Learning Framework.
Stars: ✭ 168 (-0.59%)
FloorA C++ Compute/Graphics Library and Toolchain enabling same-source CUDA/Host/Metal/OpenCL/Vulkan C++ programming and execution.
Stars: ✭ 166 (-1.78%)
JcudaJCuda - Java bindings for CUDA
Stars: ✭ 165 (-2.37%)
ForwardA library for high performance deep learning inference on NVIDIA GPUs.
Stars: ✭ 136 (-19.53%)
HashcatWorld's fastest and most advanced password recovery utility
Stars: ✭ 11,014 (+6417.16%)
3d Ken Burnsan implementation of 3D Ken Burns Effect from a Single Image using PyTorch
Stars: ✭ 1,073 (+534.91%)
K2FSA/FST algorithms, differentiable, with PyTorch compatibility.
Stars: ✭ 354 (+109.47%)