IlgpuILGPU JIT Compiler for high-performance .Net GPU programs
Stars: ✭ 374 (+128.05%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-56.71%)
Ck CaffeCollective Knowledge workflow for Caffe to automate installation across diverse platforms and to collaboratively evaluate and optimize Caffe-based workloads across diverse hardware, software and data sets (compilers, libraries, tools, models, inputs):
Stars: ✭ 192 (+17.07%)
Arrayfire PythonPython bindings for ArrayFire: A general purpose GPU library.
Stars: ✭ 358 (+118.29%)
hipaccA domain-specific language and compiler for image processing
Stars: ✭ 72 (-56.1%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (-1.83%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+465.24%)
JuiceThe Hacker's Machine Learning Engine
Stars: ✭ 743 (+353.05%)
BitcrackerBitCracker is the first open source password cracking tool for memory units encrypted with BitLocker
Stars: ✭ 463 (+182.32%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (+40.24%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+2151.83%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-20.73%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+129.88%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+900.61%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (+27.44%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+108.54%)
KomputationKomputation is a neural network framework for the Java Virtual Machine written in Kotlin and CUDA C.
Stars: ✭ 295 (+79.88%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+381.71%)
GpusortingImplementation of a few sorting algorithms in OpenCL
Stars: ✭ 9 (-94.51%)
GraphviteGraphVite: A General and High-performance Graph Embedding System
Stars: ✭ 865 (+427.44%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-81.1%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-75%)
KttKernel Tuning Toolkit
Stars: ✭ 33 (-79.88%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (-68.29%)
Clarrays.jlOpenCL-backed GPU Arrays
Stars: ✭ 58 (-64.63%)
C cpp project framework CMake build system( framework) with kconfig support for C/CPP projects
Stars: ✭ 26 (-84.15%)
GgnnGGNN: State of the Art Graph-based GPU Nearest Neighbor Search
Stars: ✭ 63 (-61.59%)
CubCooperative primitives for CUDA C++.
Stars: ✭ 883 (+438.41%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+443.29%)
Soul EnginePhysically based renderer and simulation engine for real-time applications.
Stars: ✭ 37 (-77.44%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-78.05%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-65.24%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+389.63%)
Xmrminer🐜 A CUDA based miner for Monero
Stars: ✭ 158 (-3.66%)
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
Stars: ✭ 1,120 (+582.93%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+578.05%)
Autodock GpuAutoDock for GPUs and other accelerators
Stars: ✭ 65 (-60.37%)
ArboretumGradient Boosting powered by GPU(NVIDIA CUDA)
Stars: ✭ 64 (-60.98%)
SpiritAtomistic Spin Simulation Framework
Stars: ✭ 67 (-59.15%)
Cudart.jlJulia wrapper for CUDA runtime API
Stars: ✭ 75 (-54.27%)
ComputeA C++ GPU Computing Library for OpenCL
Stars: ✭ 1,192 (+626.83%)
CekirdeklerMulti-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Stars: ✭ 76 (-53.66%)
TvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Stars: ✭ 7,494 (+4469.51%)
MprReference implementation for "Massively Parallel Rendering of Complex Closed-Form Implicit Surfaces" (SIGGRAPH 2020)
Stars: ✭ 84 (-48.78%)
Cuda Design PatternsSome CUDA design patterns and a bit of template magic for CUDA
Stars: ✭ 78 (-52.44%)
Sparkle🎇 A modern particle engine running on GPU, using c++14 and OpenGL 4.4.
Stars: ✭ 162 (-1.22%)
NumerNumeric Erlang - vector and matrix operations with CUDA. Heavily inspired by Pteracuda - https://github.com/kevsmith/pteracuda
Stars: ✭ 91 (-44.51%)
Deeppipe2Deep Learning library using GPU(CUDA/cuBLAS)
Stars: ✭ 90 (-45.12%)
PynvvlA Python wrapper of NVIDIA Video Loader (NVVL) with CuPy for fast video loading with Python
Stars: ✭ 95 (-42.07%)
ThundersvmThunderSVM: A Fast SVM Library on GPUs and CPUs
Stars: ✭ 1,282 (+681.71%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-39.63%)
PygraphistryPyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Stars: ✭ 1,365 (+732.32%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-31.1%)
SpocStream Processing with OCaml
Stars: ✭ 115 (-29.88%)
NovuscoreA modern take on WoW emulation
Stars: ✭ 88 (-46.34%)
HashcatWorld's fastest and most advanced password recovery utility
Stars: ✭ 11,014 (+6615.85%)
CltuneCLTune: An automatic OpenCL & CUDA kernel tuner
Stars: ✭ 114 (-30.49%)