Arch-Data-ScienceArchlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision
Stars: ✭ 92 (-39.87%)
Pytorch spnExtension package for spatial propagation network in pytorch.
Stars: ✭ 114 (-25.49%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-55.56%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+482.35%)
monolishmonolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Stars: ✭ 166 (+8.5%)
Cudart.jlJulia wrapper for CUDA runtime API
Stars: ✭ 75 (-50.98%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-62.75%)
Ddsh Tip2018source code for paper "Deep Discrete Supervised Hashing"
Stars: ✭ 16 (-89.54%)
JetScanJetScan : GPU accelerated portable RGB-D reconstruction system
Stars: ✭ 77 (-49.67%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (-2.61%)
bifrostA stream processing framework for high-throughput applications.
Stars: ✭ 48 (-68.63%)
LibcudarangeAn interval arithmetic and affine arithmetic library for NVIDIA CUDA
Stars: ✭ 5 (-96.73%)
nBodyGPU-accelerated N-Body particle simulator with visualizer.
Stars: ✭ 28 (-81.7%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-53.59%)
k-meansCode accompanying my blog post on k-means in Python, C++ and CUDA
Stars: ✭ 56 (-63.4%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+424.84%)
peakperfAchieve peak performance on x86 CPUs and NVIDIA GPUs
Stars: ✭ 33 (-78.43%)
Pytorch Unflow a reimplementation of UnFlow in PyTorch that matches the official TensorFlow version
Stars: ✭ 113 (-26.14%)
gproshangeometry processing and shape analysis framework
Stars: ✭ 48 (-68.63%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+418.3%)
allgebraBase container for developing C++ and Fortran HPC applications
Stars: ✭ 14 (-90.85%)
DeepjointfilterThe source code of ECCV16 'Deep Joint Image Filtering'.
Stars: ✭ 68 (-55.56%)
ctuning-programsCollective Knowledge extension with unified and customizable benchmarks (with extensible JSON meta information) to be easily integrated with customizable and portable Collective Knowledge workflows. You can easily compile and run these benchmarks using different compilers, environments, hardware and OS (Linux, MacOS, Windows, Android). More info:
Stars: ✭ 41 (-73.2%)
NumbaNumPy aware dynamic Python compiler using LLVM
Stars: ✭ 7,090 (+4533.99%)
dbcsrDBCSR: Distributed Block Compressed Sparse Row matrix library
Stars: ✭ 65 (-57.52%)
Ethereum nvidia miner💰 USB flash drive ISO image for Ethereum, Zcash and Monero mining with NVIDIA graphics cards and Ubuntu GNU/Linux (headless)
Stars: ✭ 772 (+404.58%)
nodeGPU-accelerated data science and visualization in node
Stars: ✭ 85 (-44.44%)
AlenkaGPU database engine
Stars: ✭ 1,150 (+651.63%)
lane detectionLane detection for the Nvidia Jetson TX2 using OpenCV4Tegra
Stars: ✭ 15 (-90.2%)
JuiceThe Hacker's Machine Learning Engine
Stars: ✭ 743 (+385.62%)
libelas-gpuImplementation of LIBELAS in cuda.
Stars: ✭ 41 (-73.2%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+972.55%)
AresdbA GPU-powered real-time analytics storage and query engine.
Stars: ✭ 2,814 (+1739.22%)
Deep Painterly HarmonizationCode and data for paper "Deep Painterly Harmonization": https://arxiv.org/abs/1804.03189
Stars: ✭ 6,027 (+3839.22%)
DlamiA Deep Learning Amazon Web Service (AWS) AMI that is open, free and works. Run in less than 5 minutes. TensorFlow, Keras, PyTorch, Theano, MXNet, CNTK, Caffe and all dependencies.
Stars: ✭ 239 (+56.21%)
Autodock GpuAutoDock for GPUs and other accelerators
Stars: ✭ 65 (-57.52%)
Cuda Convnet2Automatically exported from code.google.com/p/cuda-convnet2
Stars: ✭ 690 (+350.98%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (+50.33%)
RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+1147.06%)
Optix PathtracerSimple physically based path tracer based on Nvidia's Optix Ray Tracing Engine
Stars: ✭ 231 (+50.98%)
Nv WavenetReference implementation of real-time autoregressive wavenet inference
Stars: ✭ 681 (+345.1%)
TengineTengine is a lite, high performance, modular inference engine for embedded device
Stars: ✭ 4,012 (+2522.22%)
Cudadrv.jlA Julia wrapper for the CUDA driver API.
Stars: ✭ 64 (-58.17%)
CupochRobotics with GPU computing
Stars: ✭ 225 (+47.06%)
Mc CnnStereo Matching by Training a Convolutional Neural Network to Compare Image Patches
Stars: ✭ 638 (+316.99%)
DeepspeechDeepSpeech neon implementation
Stars: ✭ 223 (+45.75%)
KmcudaLarge scale K-means and K-nn implementation on NVIDIA GPU / CUDA
Stars: ✭ 627 (+309.8%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (-0.65%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-9.8%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-20.26%)
HallocA fast and highly scalable GPU dynamic memory allocator
Stars: ✭ 89 (-41.83%)
Smallpt Parallel Bvh GpuA GPU implementation of smallpt (http://www.kevinbeason.com/smallpt/) with Bounding Volume Hierarchy (BVH) tree.
Stars: ✭ 36 (-76.47%)
Cuda voxelizerCUDA Voxelizer to convert polygon meshes into annotated voxel grids
Stars: ✭ 299 (+95.42%)