ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (-5.95%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-41.07%)
ObsidianObsidian Language Repository
Stars: ✭ 38 (-77.38%)
EmuThe write-once-run-anywhere GPGPU library for Rust
Stars: ✭ 1,350 (+703.57%)
PysnnEfficient Spiking Neural Network framework, built on top of PyTorch for GPU acceleration
Stars: ✭ 129 (-23.21%)
CekirdeklerMulti-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Stars: ✭ 76 (-54.76%)
VuhVulkan compute for people
Stars: ✭ 264 (+57.14%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+103.57%)
gpuvmemGPU Framework for Radio Astronomical Image Synthesis
Stars: ✭ 27 (-83.93%)
GpufitGPU-accelerated Levenberg-Marquardt curve fitting in CUDA
Stars: ✭ 174 (+3.57%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+216.07%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-66.07%)
CARECHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.
Stars: ✭ 22 (-86.9%)
runtimeAnyDSL Runtime Library
Stars: ✭ 17 (-89.88%)
gpuhdMassively Parallel Huffman Decoding on GPUs
Stars: ✭ 30 (-82.14%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-66.07%)
Fractional differencing gpuRapid large-scale fractional differencing with RAPIDS to minimize memory loss while making a time series stationary. 6x-400x speed up over CPU implementation.
Stars: ✭ 38 (-77.38%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-78.57%)
1833718.337 - Parallel Computing and Scientific Machine Learning
Stars: ✭ 834 (+396.43%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+451.79%)
Marian DevFast Neural Machine Translation in C++ - development repository
Stars: ✭ 136 (-19.05%)
NyuziprocessorGPGPU microprocessor architecture
Stars: ✭ 1,351 (+704.17%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+372.02%)
Kubernetes Gpu GuideThis guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.
Stars: ✭ 740 (+340.48%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (+257.74%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-75.6%)
OpenclgaA Python Library for Genetic Algorithm on OpenCL
Stars: ✭ 103 (-38.69%)
FastflowFastFlow pattern-based parallel programming framework (formerly on sourceforge)
Stars: ✭ 137 (-18.45%)
PicongpuParticle-in-Cell Simulations for the Exascale Era ✨
Stars: ✭ 452 (+169.05%)
BindsnetSimulation of spiking neural networks (SNNs) using PyTorch.
Stars: ✭ 837 (+398.21%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (-11.31%)
Opt einsum⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.
Stars: ✭ 397 (+136.31%)
Neuralnetwork.netA TensorFlow-inspired neural network library built from scratch in C# 7.3 for .NET Standard 2.0, with GPU support through cuDNN
Stars: ✭ 392 (+133.33%)
Anime4kcppA high performance anime upscaler
Stars: ✭ 887 (+427.98%)
ClvkExperimental implementation of OpenCL on Vulkan
Stars: ✭ 158 (-5.95%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+347.02%)
DlwinGPU-accelerated Deep Learning on Windows 10 native
Stars: ✭ 523 (+211.31%)
Glove As A Tensorflow Embedding LayerTaking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Stars: ✭ 85 (-49.4%)
Tfjs CoreWebGL-accelerated ML // linear algebra // automatic differentiation for JavaScript.
Stars: ✭ 8,514 (+4967.86%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+124.4%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+115.48%)
TutorialsSome basic programming tutorials
Stars: ✭ 353 (+110.12%)
Hedgehog LabRun, compile and execute JavaScript for Scientific Computing and Data Visualization TOTALLY TOTALLY TOTALLY in your BROWSER! An open source scientific computing environment for JavaScript TOTALLY in your browser, matrix operations with GPU acceleration, TeX support, data visualization and symbolic computation.
Stars: ✭ 1,797 (+969.64%)
SluggishToy CPU and GPU implementations of the Slug rendering algorithm
Stars: ✭ 70 (-58.33%)
Trisycl Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
Stars: ✭ 354 (+110.71%)
Autodock GpuAutoDock for GPUs and other accelerators
Stars: ✭ 65 (-61.31%)
Vulkan KomputeGeneral purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases.
Stars: ✭ 350 (+108.33%)
StitchemVahana VR & VideoStitch Studio: software to create immersive 360° VR video, live and in post-production
Stars: ✭ 147 (-12.5%)
WebclglGPGPU Javascript library 🐸
Stars: ✭ 313 (+86.31%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+561.9%)
BlendluxcoreBlender Integration for LuxCore
Stars: ✭ 287 (+70.83%)
Sushi2Matrix Library for JavaScript
Stars: ✭ 60 (-64.29%)
ClojureclClojureCL is a Clojure library for parallel computations with OpenCL.
Stars: ✭ 266 (+58.33%)
BlazingsqlBlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
Stars: ✭ 1,652 (+883.33%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+148.81%)