ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+3504.55%)
KernelsThis is a set of simple programs that can be used to explore the features of a parallel platform.
Stars: ✭ 287 (+1204.55%)
learn-gpgpuAlgorithms implemented in CUDA + resources about GPGPU
Stars: ✭ 37 (+68.18%)
hero-sdk⛔ DEPRECATED ⛔ HERO Software Development Kit
Stars: ✭ 21 (-4.55%)
FastA framework for GPU based high-performance medical image processing and visualization
Stars: ✭ 179 (+713.64%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (+77.27%)
KratosKratos Multiphysics (A.K.A Kratos) is a framework for building parallel multi-disciplinary simulation software. Modularity, extensibility and HPC are the main objectives. Kratos has BSD license and is written in C++ with extensive Python interface.
Stars: ✭ 558 (+2436.36%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (+536.36%)
ClojureclClojureCL is a Clojure library for parallel computations with OpenCL.
Stars: ✭ 266 (+1109.09%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+1613.64%)
Trisycl Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
Stars: ✭ 354 (+1509.09%)
AmgclC++ library for solving large sparse linear systems with algebraic multigrid method
Stars: ✭ 390 (+1672.73%)
yaskYASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-difference methods and similar applications.
Stars: ✭ 81 (+268.18%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (+286.36%)
hipercHigh Performance Computing Strategies for Boundary Value Problems
Stars: ✭ 36 (+63.64%)
monolishmonolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Stars: ✭ 166 (+654.55%)
vercorsThe VerCors verification toolset for verifying parallel and concurrent software
Stars: ✭ 30 (+36.36%)
crowdsource-video-experiments-on-androidCrowdsourcing video experiments (such as collaborative benchmarking and optimization of DNN algorithms) using Collective Knowledge Framework across diverse Android devices provided by volunteers. Results are continuously aggregated in the open repository:
Stars: ✭ 29 (+31.82%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+1454.55%)
articThe AlteRnaTive Impala Compiler
Stars: ✭ 16 (-27.27%)
Autodock GpuAutoDock for GPUs and other accelerators
Stars: ✭ 65 (+195.45%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (+222.73%)
OpenclgaA Python Library for Genetic Algorithm on OpenCL
Stars: ✭ 103 (+368.18%)
OnednnoneAPI Deep Neural Network Library (oneDNN)
Stars: ✭ 2,600 (+11718.18%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (+450%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (+850%)
pgx-samplesApplications using Parallel Graph AnalytiX (PGX) from Oracle Labs
Stars: ✭ 39 (+77.27%)
matrix multiplicationParallel Matrix Multiplication Using OpenMP, Phtreads, and MPI
Stars: ✭ 41 (+86.36%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+3313.64%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+2313.64%)
mbsolveAn open-source solver tool for the Maxwell-Bloch equations.
Stars: ✭ 14 (-36.36%)
TaskflowA General-purpose Parallel and Heterogeneous Task Programming System
Stars: ✭ 6,128 (+27754.55%)
EtalerA flexable HTM (Hierarchical Temporal Memory) framework with full GPU support.
Stars: ✭ 79 (+259.09%)
dlprimitivesDeep Learning Primitives and Mini-Framework for OpenCL
Stars: ✭ 65 (+195.45%)
MOTMulti-threaded Optimization Toolbox
Stars: ✭ 28 (+27.27%)
boxtreeQuad/octree building for FMMs in Python and OpenCL
Stars: ✭ 52 (+136.36%)
CUDAfy.NETCUDAfy .NET allows easy development of high performance GPGPU applications completely from the .NET. It's developed in C#.
Stars: ✭ 56 (+154.55%)
BlendluxcoreBlender Integration for LuxCore
Stars: ✭ 287 (+1204.55%)
Graphs.jlAn optimized graphs package for the Julia programming language
Stars: ✭ 197 (+795.45%)
GapbsGAP Benchmark Suite
Stars: ✭ 165 (+650%)
CRONOA Shared Memory Multithreaded Graph Benchmark Suite for Multicores
Stars: ✭ 21 (-4.55%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+4113.64%)
CekirdeklerMulti-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Stars: ✭ 76 (+245.45%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+3490.91%)
opensbliA framework for the automated derivation and parallel execution of finite difference solvers on a range of computer architectures.
Stars: ✭ 56 (+154.55%)
ClvkExperimental implementation of OpenCL on Vulkan
Stars: ✭ 158 (+618.18%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (+945.45%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (+2631.82%)
gpuowlGPU Mersenne primality test.
Stars: ✭ 77 (+250%)
GOSHAn ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.
Stars: ✭ 12 (-45.45%)
graphsimR package: Simulate Expression data from igraph network using mvtnorm (CRAN; JOSS)
Stars: ✭ 16 (-27.27%)
LightGraphs.jlAn optimized graphs package for the Julia programming language
Stars: ✭ 680 (+2990.91%)
OpenPHParallel reduction of boundary matrices for Persistent Homology with CUDA
Stars: ✭ 14 (-36.36%)
PyMFEMPython wrapper for MFEM
Stars: ✭ 91 (+313.64%)
JohnJohn the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
Stars: ✭ 5,656 (+25609.09%)
grblasPython wrapper around GraphBLAS
Stars: ✭ 22 (+0%)
fahbenchFolding@home GPU benchmark
Stars: ✭ 32 (+45.45%)
Foundations of HPC 2021This repository collects the materials from the course "Foundations of HPC", 2021, at the Data Science and Scientific Computing Department, University of Trieste
Stars: ✭ 22 (+0%)