opensbliA framework for the automated derivation and parallel execution of finite difference solvers on a range of computer architectures.
Stars: ✭ 56 (-30%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (+97.5%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-28.75%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+891.25%)
CekirdeklerMulti-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Stars: ✭ 76 (-5%)
FastA framework for GPU based high-performance medical image processing and visualization
Stars: ✭ 179 (+123.75%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-55%)
notebooksA docker-based starter kit for machine learning via jupyter notebooks. Designed for those who just want a runtime environment and get on with machine learning. Docker tags:
Stars: ✭ 29 (-63.75%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+563.75%)
TutorialsSome basic programming tutorials
Stars: ✭ 353 (+341.25%)
NyuziprocessorGPGPU microprocessor architecture
Stars: ✭ 1,351 (+1588.75%)
LingvoLingvo
Stars: ✭ 2,361 (+2851.25%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+1290%)
CaNSA code for fast, massively-parallel direct numerical simulations (DNS) of canonical flows
Stars: ✭ 144 (+80%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-48.75%)
Montecarlomeasurements.jlPropagation of distributions by Monte-Carlo sampling: Real number types with uncertainty represented by samples.
Stars: ✭ 168 (+110%)
BindsnetSimulation of spiking neural networks (SNNs) using PyTorch.
Stars: ✭ 837 (+946.25%)
range3Range Software - Finite Element Analysis
Stars: ✭ 31 (-61.25%)
Kubernetes Gpu GuideThis guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.
Stars: ✭ 740 (+825%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (+86.25%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+371.25%)
magicMagIC is a high-performance code that solves the magneto-hydrodynamics equations in rotating spherical shells
Stars: ✭ 67 (-16.25%)
PysnnEfficient Spiking Neural Network framework, built on top of PyTorch for GPU acceleration
Stars: ✭ 129 (+61.25%)
Vulkan KomputeGeneral purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases.
Stars: ✭ 350 (+337.5%)
WebclglGPGPU Javascript library 🐸
Stars: ✭ 313 (+291.25%)
EmuThe write-once-run-anywhere GPGPU library for Rust
Stars: ✭ 1,350 (+1587.5%)
GpurR interface to use GPU's
Stars: ✭ 208 (+160%)
gpuowlGPU Mersenne primality test.
Stars: ✭ 77 (-3.75%)
Autodock GpuAutoDock for GPUs and other accelerators
Stars: ✭ 65 (-18.75%)
Awesome Webgpu😎 Curated list of awesome things around WebGPU ecosystem.
Stars: ✭ 182 (+127.5%)
Sushi2Matrix Library for JavaScript
Stars: ✭ 60 (-25%)
quagmirePython surface process framework on highly scalable unstructured meshes
Stars: ✭ 13 (-83.75%)
Raspberrypi tempmonRaspberry pi CPU temperature monitor with many functions such as logging, GPIO output, graphing, email, alarm, notifications and stress testing. Python 3.
Stars: ✭ 52 (-35%)
GpufitGPU-accelerated Levenberg-Marquardt curve fitting in CUDA
Stars: ✭ 174 (+117.5%)
Fractional differencing gpuRapid large-scale fractional differencing with RAPIDS to minimize memory loss while making a time series stationary. 6x-400x speed up over CPU implementation.
Stars: ✭ 38 (-52.5%)
gpuvmemGPU Framework for Radio Astronomical Image Synthesis
Stars: ✭ 27 (-66.25%)
1833718.337 - Parallel Computing and Scientific Machine Learning
Stars: ✭ 834 (+942.5%)
PelemayPelemay is a native compiler for Elixir, which generates SIMD instructions. It has a plan to generate for GPU code.
Stars: ✭ 161 (+101.25%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+1058.75%)
LvArrayPortable HPC Containers (C++)
Stars: ✭ 37 (-53.75%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+838.75%)
ClvkExperimental implementation of OpenCL on Vulkan
Stars: ✭ 158 (+97.5%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (+651.25%)
PicongpuParticle-in-Cell Simulations for the Exascale Era ✨
Stars: ✭ 452 (+465%)
FastflowFastFlow pattern-based parallel programming framework (formerly on sourceforge)
Stars: ✭ 137 (+71.25%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+352.5%)
hipercHigh Performance Computing Strategies for Boundary Value Problems
Stars: ✭ 36 (-55%)
Trisycl Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
Stars: ✭ 354 (+342.5%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+327.5%)
OptOpt DSL
Stars: ✭ 237 (+196.25%)
OpenclgaA Python Library for Genetic Algorithm on OpenCL
Stars: ✭ 103 (+28.75%)
CARECHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.
Stars: ✭ 22 (-72.5%)
UCNS3DUnstructured Compressible Navier Stokes 3D code (UCNS3D)
Stars: ✭ 141 (+76.25%)
CUDAfy.NETCUDAfy .NET allows easy development of high performance GPGPU applications completely from the .NET. It's developed in C#.
Stars: ✭ 56 (-30%)
Tf Quant FinanceHigh-performance TensorFlow library for quantitative finance.
Stars: ✭ 2,925 (+3556.25%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (+23.75%)