HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+179.26%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+586.67%)
GOSHAn ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.
Stars: ✭ 12 (-91.11%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+153.33%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-57.78%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+487.41%)
RelionImage-processing software for cryo-electron microscopy
Stars: ✭ 219 (+62.22%)
DaceDaCe - Data Centric Parallel Programming
Stars: ✭ 106 (-21.48%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (+345.19%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (+17.04%)
opensbliA framework for the automated derivation and parallel execution of finite difference solvers on a range of computer architectures.
Stars: ✭ 56 (-58.52%)
Tf Quant FinanceHigh-performance TensorFlow library for quantitative finance.
Stars: ✭ 2,925 (+2066.67%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-49.63%)
TutorialsSome basic programming tutorials
Stars: ✭ 353 (+161.48%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+168.15%)
VuhVulkan compute for people
Stars: ✭ 264 (+95.56%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-26.67%)
TaskflowA General-purpose Parallel and Heterogeneous Task Programming System
Stars: ✭ 6,128 (+4439.26%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (-37.04%)
bifrostA stream processing framework for high-throughput applications.
Stars: ✭ 48 (-64.44%)
ClojureclClojureCL is a Clojure library for parallel computations with OpenCL.
Stars: ✭ 266 (+97.04%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+209.63%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-73.33%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+456.3%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-69.63%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+723.7%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (+10.37%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+293.33%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-57.78%)
Autodock GpuAutoDock for GPUs and other accelerators
Stars: ✭ 65 (-51.85%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+1115.56%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (-10.37%)
CuheCUDA Homomorphic Encryption Library
Stars: ✭ 109 (-19.26%)
Surrogates.jlSurrogate modeling and optimization for scientific machine learning (SciML)
Stars: ✭ 121 (-10.37%)
HashcatWorld's fastest and most advanced password recovery utility
Stars: ✭ 11,014 (+8058.52%)
AspectA parallel, extensible finite element code to simulate convection in both 2D and 3D models.
Stars: ✭ 120 (-11.11%)
ClustermqR package to send function calls as jobs on LSF, SGE, Slurm, PBS/Torque, or each via SSH
Stars: ✭ 106 (-21.48%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+1278.52%)
BatchtoolsTools for computation on batch systems
Stars: ✭ 127 (-5.93%)
Pyhpc BenchmarksA suite of benchmarks to test the sequential CPU and GPU performance of most popular high-performance libraries for Python.
Stars: ✭ 119 (-11.85%)
Cuda WinogradFast CUDA Kernels for ResNet Inference.
Stars: ✭ 104 (-22.96%)
OpenclgaA Python Library for Genetic Algorithm on OpenCL
Stars: ✭ 103 (-23.7%)
Knn cudaFast K-Nearest Neighbor search with GPU
Stars: ✭ 119 (-11.85%)
PygraphistryPyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Stars: ✭ 1,365 (+911.11%)
NnpackAcceleration package for neural networks on multi-core CPUs
Stars: ✭ 1,538 (+1039.26%)
DppDetail-Preserving Pooling in Deep Networks (CVPR 2018)
Stars: ✭ 99 (-26.67%)
EmuThe write-once-run-anywhere GPGPU library for Rust
Stars: ✭ 1,350 (+900%)
Tensorflow Optimized WheelsTensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (-12.59%)
NyuziprocessorGPGPU microprocessor architecture
Stars: ✭ 1,351 (+900.74%)
Extending JaxExtending JAX with custom C++ and CUDA code
Stars: ✭ 98 (-27.41%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-3.7%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+8160%)
SpocStream Processing with OCaml
Stars: ✭ 115 (-14.81%)
SupraSUPRA: Software Defined Ultrasound Processing for Real-Time Applications - An Open Source 2D and 3D Pipeline from Beamforming to B-Mode
Stars: ✭ 96 (-28.89%)
PynvvlA Python wrapper of NVIDIA Video Loader (NVVL) with CuPy for fast video loading with Python
Stars: ✭ 95 (-29.63%)