gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (-92.33%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+176.31%)
Foundations of HPC 2021This repository collects the materials from the course "Foundations of HPC", 2021, at the Data Science and Scientific Computing Department, University of Trieste
Stars: ✭ 22 (-92.33%)
AmgclC++ library for solving large sparse linear systems with algebraic multigrid method
Stars: ✭ 390 (+35.89%)
JohnJohn the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
Stars: ✭ 5,656 (+1870.73%)
libquoDynamic execution environments for coupled, thread-heterogeneous MPI+X applications
Stars: ✭ 21 (-92.68%)
KratosKratos Multiphysics (A.K.A Kratos) is a framework for building parallel multi-disciplinary simulation software. Modularity, extensibility and HPC are the main objectives. Kratos has BSD license and is written in C++ with extensive Python interface.
Stars: ✭ 558 (+94.43%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (-86.41%)
matrix multiplicationParallel Matrix Multiplication Using OpenMP, Phtreads, and MPI
Stars: ✭ 41 (-85.71%)
crowdsource-video-experiments-on-androidCrowdsourcing video experiments (such as collaborative benchmarking and optimization of DNN algorithms) using Collective Knowledge Framework across diverse Android devices provided by volunteers. Results are continuously aggregated in the open repository:
Stars: ✭ 29 (-89.9%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (-57.84%)
OnednnoneAPI Deep Neural Network Library (oneDNN)
Stars: ✭ 2,600 (+805.92%)
MOTMulti-threaded Optimization Toolbox
Stars: ✭ 28 (-90.24%)
EdgeExtreme-scale Discontinuous Galerkin Environment (EDGE)
Stars: ✭ 18 (-93.73%)
vercorsThe VerCors verification toolset for verifying parallel and concurrent software
Stars: ✭ 30 (-89.55%)
mbsolveAn open-source solver tool for the Maxwell-Bloch equations.
Stars: ✭ 14 (-95.12%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (-19.86%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+175.26%)
Coreparallel finite element unstructured meshes
Stars: ✭ 124 (-56.79%)
Abyss🔬 Assemble large genomes using short reads
Stars: ✭ 219 (-23.69%)
DashDASH, the C++ Template Library for Distributed Data Structures with Support for Hierarchical Locality for HPC and Data-Driven Science
Stars: ✭ 134 (-53.31%)
FastA framework for GPU based high-performance medical image processing and visualization
Stars: ✭ 179 (-37.63%)
t8codeParallel algorithms and data structures for tree-based AMR with arbitrary element shapes.
Stars: ✭ 37 (-87.11%)
hp2pHeavy Peer To Peer: a MPI based benchmark for network diagnostic
Stars: ✭ 17 (-94.08%)
learn-gpgpuAlgorithms implemented in CUDA + resources about GPGPU
Stars: ✭ 37 (-87.11%)
FaasmHigh-performance stateful serverless runtime based on WebAssembly
Stars: ✭ 403 (+40.42%)
ComputeA C++ GPU Computing Library for OpenCL
Stars: ✭ 1,192 (+315.33%)
boxtreeQuad/octree building for FMMs in Python and OpenCL
Stars: ✭ 52 (-81.88%)
Ytk Mp4jYtk-mp4j is a fast, user-friendly, cross-platform, multi-process, multi-thread collective message passing java library which includes gather, scatter, allgather, reduce-scatter, broadcast, reduce, allreduce communications for distributed machine learning.
Stars: ✭ 102 (-64.46%)
Training MaterialA collection of code examples as well as presentations for training purposes
Stars: ✭ 85 (-70.38%)
Primecount🚀 Fast prime counting function implementations
Stars: ✭ 193 (-32.75%)
AddaADDA - light scattering simulator based on the discrete dipole approximation
Stars: ✭ 43 (-85.02%)
ElmerfemOfficial git repository of Elmer FEM software
Stars: ✭ 523 (+82.23%)
Mpi4pyPython bindings for MPI
Stars: ✭ 388 (+35.19%)
nbodykitAnalysis kit for large-scale structure datasets, the massively parallel way
Stars: ✭ 93 (-67.6%)
musterMassively Scalable Clustering
Stars: ✭ 22 (-92.33%)
concurrent-resourceA header-only C++ library that allows easily creating thread-safe, concurrency friendly resources.
Stars: ✭ 17 (-94.08%)
SchwimmbadA common interface to processing pools.
Stars: ✭ 82 (-71.43%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-75.26%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (-51.22%)
developkit set2021年最新总结,值得推荐的c/c++开源框架与库。持续更新中。
Stars: ✭ 654 (+127.87%)
EFDCPluswww.eemodelingsystem.com
Stars: ✭ 9 (-96.86%)
wxparaverwxParaver is a trace-based visualization and analysis tool designed to study quantitative detailed metrics and obtain qualitative knowledge of the performance of applications, libraries, processors and whole architectures.
Stars: ✭ 23 (-91.99%)
gpubootcampThis repository consists for gpu bootcamp material for HPC and AI
Stars: ✭ 227 (-20.91%)
hpdbscanHighly parallel DBSCAN (HPDBSCAN)
Stars: ✭ 19 (-93.38%)
Torstenlibrary of C++ functions that support applications of Stan in Pharmacometrics
Stars: ✭ 38 (-86.76%)
JoblibComputing with Python functions.
Stars: ✭ 2,620 (+812.89%)
yaskYASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-difference methods and similar applications.
Stars: ✭ 81 (-71.78%)
tbslasA parallel, fast solver for the scalar advection-diffusion and the incompressible Navier-Stokes equations based on semi-Lagrangian/Volume-Integral method.
Stars: ✭ 21 (-92.68%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (-27.18%)
pystellaA code generator for grid-based PDE solving on CPUs and GPUs
Stars: ✭ 18 (-93.73%)
hero-sdk⛔ DEPRECATED ⛔ HERO Software Development Kit
Stars: ✭ 21 (-92.68%)
pyccelPython extension language using accelerators
Stars: ✭ 189 (-34.15%)
Travis Buddy🚀 Seamless integration between TravisCI and GitHub
Stars: ✭ 262 (-8.71%)
PebbleMulti threading and processing eye-candy.
Stars: ✭ 276 (-3.83%)