Training MaterialA collection of code examples as well as presentations for training purposes
Stars: ✭ 85 (+325%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+2555%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (+1050%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (+470%)
KernelsThis is a set of simple programs that can be used to explore the features of a parallel platform.
Stars: ✭ 287 (+1335%)
contechThe Contech analysis framework provides the means for generating and analyzing task graphs that enable computer architects and programmers to gain a deeper understanding of parallel programs.
Stars: ✭ 43 (+115%)
EdgeExtreme-scale Discontinuous Galerkin Environment (EDGE)
Stars: ✭ 18 (-10%)
java-cpp-exampleExample of using C++ classes from Java. Showcases SWIG, JNA and JNI
Stars: ✭ 135 (+575%)
AmgclC++ library for solving large sparse linear systems with algebraic multigrid method
Stars: ✭ 390 (+1850%)
OnednnoneAPI Deep Neural Network Library (oneDNN)
Stars: ✭ 2,600 (+12900%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (+600%)
mbsolveAn open-source solver tool for the Maxwell-Bloch equations.
Stars: ✭ 14 (-30%)
ForTrilinosForTrilinos provides portable object-oriented Fortran interfaces to Trilinos C++ packages.
Stars: ✭ 24 (+20%)
Ytk Mp4jYtk-mp4j is a fast, user-friendly, cross-platform, multi-process, multi-thread collective message passing java library which includes gather, scatter, allgather, reduce-scatter, broadcast, reduce, allreduce communications for distributed machine learning.
Stars: ✭ 102 (+410%)
canorusCanorus is a free cross-platform music score editor
Stars: ✭ 25 (+25%)
Foundations of HPC 2021This repository collects the materials from the course "Foundations of HPC", 2021, at the Data Science and Scientific Computing Department, University of Trieste
Stars: ✭ 22 (+10%)
KratosKratos Multiphysics (A.K.A Kratos) is a framework for building parallel multi-disciplinary simulation software. Modularity, extensibility and HPC are the main objectives. Kratos has BSD license and is written in C++ with extensive Python interface.
Stars: ✭ 558 (+2690%)
yadomsYadoms is open source, simple, powerfull, flexible and multiplatforms domotic solution.
Stars: ✭ 56 (+180%)
OptimOptimLib: a lightweight C++ library of numerical optimization methods for nonlinear functions
Stars: ✭ 411 (+1955%)
Dive Into Ml SystemDive into machine learning system, start from reinventing the wheel.
Stars: ✭ 220 (+1000%)
WeaveA state-of-the-art multithreading runtime: message-passing based, fast, scalable, ultra-low overhead
Stars: ✭ 305 (+1425%)
cpu-gbfilter♨️ Optimized Gaussian blur filter on CPU.
Stars: ✭ 14 (-30%)
crowdsource-video-experiments-on-androidCrowdsourcing video experiments (such as collaborative benchmarking and optimization of DNN algorithms) using Collective Knowledge Framework across diverse Android devices provided by volunteers. Results are continuously aggregated in the open repository:
Stars: ✭ 29 (+45%)
LaserThe HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
Stars: ✭ 191 (+855%)
GapbsGAP Benchmark Suite
Stars: ✭ 165 (+725%)
Torstenlibrary of C++ functions that support applications of Stan in Pharmacometrics
Stars: ✭ 38 (+90%)
pyaerLow-level Python APIs for Accessing Neuromorphic Devices.
Stars: ✭ 20 (+0%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (+505%)
libquoDynamic execution environments for coupled, thread-heterogeneous MPI+X applications
Stars: ✭ 21 (+5%)
Arm VoEfficient monocular visual odometry for ground vehicles on ARM processors
Stars: ✭ 115 (+475%)
swig-srilmSWIG Wrapper for the SRILM toolkit
Stars: ✭ 33 (+65%)
CompactnsearchA C++ library to compute neighborhood information for point clouds within a fixed radius. Suitable for many applications, e.g. neighborhood search for SPH fluid simulations.
Stars: ✭ 93 (+365%)
ByteSlice"Byteslice: Pushing the envelop of main memory data processing with a new storage layout" (SIGMOD'15)
Stars: ✭ 24 (+20%)
NbodyN body gravity attraction problem solver
Stars: ✭ 40 (+100%)
URTFast Unit Root Tests and OLS regression in C++ with wrappers for R and Python
Stars: ✭ 70 (+250%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+3865%)
EmsExtended Memory Semantics - Persistent shared object memory and parallelism for Node.js and Python
Stars: ✭ 552 (+2660%)
NPB-CPPNAS Parallel Benchmark Kernels in C/C++. The parallel versions are in FastFlow, TBB, and OpenMP.
Stars: ✭ 18 (-10%)
JohnJohn the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
Stars: ✭ 5,656 (+28180%)
Abyss🔬 Assemble large genomes using short reads
Stars: ✭ 219 (+995%)
FaasmHigh-performance stateful serverless runtime based on WebAssembly
Stars: ✭ 403 (+1915%)
claw-compilerCLAW Compiler for Performance Portability
Stars: ✭ 38 (+90%)
Armadillo CodeArmadillo: fast C++ library for linear algebra & scientific computing - http://arma.sourceforge.net
Stars: ✭ 388 (+1840%)
BvhA modern C++ BVH construction and traversal library
Stars: ✭ 216 (+980%)
StatsA C++ header-only library of statistical distribution functions.
Stars: ✭ 292 (+1360%)
boltOfficial BOLT Repository
Stars: ✭ 19 (-5%)
Primecount🚀 Fast prime counting function implementations
Stars: ✭ 193 (+865%)
vercorsThe VerCors verification toolset for verifying parallel and concurrent software
Stars: ✭ 30 (+50%)
go-snowboyGo wrapper for Kitt-AI's snowboy audio detection library.
Stars: ✭ 40 (+100%)
BkcrackCrack legacy zip encryption with Biham and Kocher's known plaintext attack.
Stars: ✭ 178 (+790%)
nanoxNanos++ is a runtime designed to serve as runtime support in parallel environments. It is mainly used to support OmpSs, a extension to OpenMP developed at BSC.
Stars: ✭ 37 (+85%)
CuVecUnifying Python/C++/CUDA memory: Python buffered array ↔️ `std::vector` ↔️ CUDA managed memory
Stars: ✭ 73 (+265%)
gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (+10%)
Rawspeedfast raw decoding library
Stars: ✭ 179 (+795%)