BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+63.64%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-66.03%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+277.99%)
CekirdeklerMulti-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Stars: ✭ 76 (-63.64%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+279.43%)
PynvvlA Python wrapper of NVIDIA Video Loader (NVVL) with CuPy for fast video loading with Python
Stars: ✭ 95 (-54.55%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-37.8%)
CupyNumPy & SciPy for GPU
Stars: ✭ 5,625 (+2591.39%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+80.38%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+685.17%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (+10.05%)
IlgpuILGPU JIT Compiler for high-performance .Net GPU programs
Stars: ✭ 374 (+78.95%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (-22.97%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-52.63%)
ChainerA flexible framework of neural networks for deep learning
Stars: ✭ 5,656 (+2606.22%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-41.63%)
hipaccA domain-specific language and compiler for image processing
Stars: ✭ 72 (-65.55%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+1666.99%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+343.54%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+154.07%)
Arrayfire PythonPython bindings for ArrayFire: A general purpose GPU library.
Stars: ✭ 358 (+71.29%)
BitcrackerBitCracker is the first open source password cracking tool for memory units encrypted with BitLocker
Stars: ✭ 463 (+121.53%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-72.73%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-21.53%)
Pine🌲 Aimbot powered by real-time object detection with neural networks, GPU accelerated with Nvidia. Optimized for use with CS:GO.
Stars: ✭ 202 (-3.35%)
Glove As A Tensorflow Embedding LayerTaking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Stars: ✭ 85 (-59.33%)
MprReference implementation for "Massively Parallel Rendering of Complex Closed-Form Implicit Surfaces" (SIGGRAPH 2020)
Stars: ✭ 84 (-59.81%)
Deeppipe2Deep Learning library using GPU(CUDA/cuBLAS)
Stars: ✭ 90 (-56.94%)
ThundersvmThunderSVM: A Fast SVM Library on GPUs and CPUs
Stars: ✭ 1,282 (+513.4%)
NumerNumeric Erlang - vector and matrix operations with CUDA. Heavily inspired by Pteracuda - https://github.com/kevsmith/pteracuda
Stars: ✭ 91 (-56.46%)
EmuThe write-once-run-anywhere GPGPU library for Rust
Stars: ✭ 1,350 (+545.93%)
PygraphistryPyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Stars: ✭ 1,365 (+553.11%)
Cuda Design PatternsSome CUDA design patterns and a bit of template magic for CUDA
Stars: ✭ 78 (-62.68%)
Ck CaffeCollective Knowledge workflow for Caffe to automate installation across diverse platforms and to collaboratively evaluate and optimize Caffe-based workloads across diverse hardware, software and data sets (compilers, libraries, tools, models, inputs):
Stars: ✭ 192 (-8.13%)
HashcatWorld's fastest and most advanced password recovery utility
Stars: ✭ 11,014 (+5169.86%)
PytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Stars: ✭ 52,811 (+25168.42%)
CltuneCLTune: An automatic OpenCL & CUDA kernel tuner
Stars: ✭ 114 (-45.45%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-45.93%)
Compute.scalaScientific computing with N-dimensional arrays
Stars: ✭ 191 (-8.61%)
SpocStream Processing with OCaml
Stars: ✭ 115 (-44.98%)
BlazingsqlBlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
Stars: ✭ 1,652 (+690.43%)
MtensorA C++ Cuda Tensor Lazy Computing Library
Stars: ✭ 115 (-44.98%)
IvyThe templated deep learning framework, enabling framework-agnostic functions, layers and libraries.
Stars: ✭ 118 (-43.54%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (-42.11%)
NnvmNo description or website provided.
Stars: ✭ 1,639 (+684.21%)
Pyhpc BenchmarksA suite of benchmarks to test the sequential CPU and GPU performance of most popular high-performance libraries for Python.
Stars: ✭ 119 (-43.06%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+790.43%)
Hoomd BlueMolecular dynamics and Monte Carlo soft matter simulation on GPUs.
Stars: ✭ 143 (-31.58%)
ForwardA library for high performance deep learning inference on NVIDIA GPUs.
Stars: ✭ 136 (-34.93%)
RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+812.92%)
Macos Egpu Cuda GuideSet up CUDA for machine learning (and gaming) on macOS using a NVIDIA eGPU
Stars: ✭ 187 (-10.53%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (-33.01%)
GpurirPython library for Room Impulse Response (RIR) simulation with GPU acceleration
Stars: ✭ 145 (-30.62%)
Optical Flow FilterA real time optical flow algorithm implemented on GPU
Stars: ✭ 146 (-30.14%)
Cumf alsCUDA Matrix Factorization Library with Alternating Least Square (ALS)
Stars: ✭ 154 (-26.32%)
SimplegpuhashtableA simple GPU hash table implemented in CUDA using lock free techniques
Stars: ✭ 198 (-5.26%)