claw-compilerCLAW Compiler for Performance Portability
Stars: ✭ 38 (-35.59%)
nanoxNanos++ is a runtime designed to serve as runtime support in parallel environments. It is mainly used to support OmpSs, a extension to OpenMP developed at BSC.
Stars: ✭ 37 (-37.29%)
EdgeExtreme-scale Discontinuous Galerkin Environment (EDGE)
Stars: ✭ 18 (-69.49%)
OnednnoneAPI Deep Neural Network Library (oneDNN)
Stars: ✭ 2,600 (+4306.78%)
AmgclC++ library for solving large sparse linear systems with algebraic multigrid method
Stars: ✭ 390 (+561.02%)
Training MaterialA collection of code examples as well as presentations for training purposes
Stars: ✭ 85 (+44.07%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (+289.83%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+800%)
cpu-gbfilter♨️ Optimized Gaussian blur filter on CPU.
Stars: ✭ 14 (-76.27%)
KernelsThis is a set of simple programs that can be used to explore the features of a parallel platform.
Stars: ✭ 287 (+386.44%)
Rawspeedfast raw decoding library
Stars: ✭ 179 (+203.39%)
mbsolveAn open-source solver tool for the Maxwell-Bloch equations.
Stars: ✭ 14 (-76.27%)
Ytk Mp4jYtk-mp4j is a fast, user-friendly, cross-platform, multi-process, multi-thread collective message passing java library which includes gather, scatter, allgather, reduce-scatter, broadcast, reduce, allreduce communications for distributed machine learning.
Stars: ✭ 102 (+72.88%)
Foundations of HPC 2021This repository collects the materials from the course "Foundations of HPC", 2021, at the Data Science and Scientific Computing Department, University of Trieste
Stars: ✭ 22 (-62.71%)
URTFast Unit Root Tests and OLS regression in C++ with wrappers for R and Python
Stars: ✭ 70 (+18.64%)
KratosKratos Multiphysics (A.K.A Kratos) is a framework for building parallel multi-disciplinary simulation software. Modularity, extensibility and HPC are the main objectives. Kratos has BSD license and is written in C++ with extensive Python interface.
Stars: ✭ 558 (+845.76%)
Dive Into Ml SystemDive into machine learning system, start from reinventing the wheel.
Stars: ✭ 220 (+272.88%)
OptimOptimLib: a lightweight C++ library of numerical optimization methods for nonlinear functions
Stars: ✭ 411 (+596.61%)
matrix multiplicationParallel Matrix Multiplication Using OpenMP, Phtreads, and MPI
Stars: ✭ 41 (-30.51%)
WeaveA state-of-the-art multithreading runtime: message-passing based, fast, scalable, ultra-low overhead
Stars: ✭ 305 (+416.95%)
LaserThe HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
Stars: ✭ 191 (+223.73%)
crowdsource-video-experiments-on-androidCrowdsourcing video experiments (such as collaborative benchmarking and optimization of DNN algorithms) using Collective Knowledge Framework across diverse Android devices provided by volunteers. Results are continuously aggregated in the open repository:
Stars: ✭ 29 (-50.85%)
gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (-62.71%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (+137.29%)
pyccelPython extension language using accelerators
Stars: ✭ 189 (+220.34%)
Arm VoEfficient monocular visual odometry for ground vehicles on ARM processors
Stars: ✭ 115 (+94.92%)
clavaC/C++ Source-to-Source Tool based on Clang
Stars: ✭ 55 (-6.78%)
CompactnsearchA C++ library to compute neighborhood information for point clouds within a fixed radius. Suitable for many applications, e.g. neighborhood search for SPH fluid simulations.
Stars: ✭ 93 (+57.63%)
mythonThe Mython extensible variant of the Python programming language.
Stars: ✭ 16 (-72.88%)
Guided Missile SimulationGuided Missile, Radar and Infrared EOS Simulation Framework written in Fortran.
Stars: ✭ 33 (-44.07%)
NbodyN body gravity attraction problem solver
Stars: ✭ 40 (-32.2%)
ByteSlice"Byteslice: Pushing the envelop of main memory data processing with a new storage layout" (SIGMOD'15)
Stars: ✭ 24 (-59.32%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+1244.07%)
Abyss🔬 Assemble large genomes using short reads
Stars: ✭ 219 (+271.19%)
EmsExtended Memory Semantics - Persistent shared object memory and parallelism for Node.js and Python
Stars: ✭ 552 (+835.59%)
JohnJohn the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
Stars: ✭ 5,656 (+9486.44%)
BvhA modern C++ BVH construction and traversal library
Stars: ✭ 216 (+266.1%)
FaasmHigh-performance stateful serverless runtime based on WebAssembly
Stars: ✭ 403 (+583.05%)
hpdbscanHighly parallel DBSCAN (HPDBSCAN)
Stars: ✭ 19 (-67.8%)
Armadillo CodeArmadillo: fast C++ library for linear algebra & scientific computing - http://arma.sourceforge.net
Stars: ✭ 388 (+557.63%)
Primecount🚀 Fast prime counting function implementations
Stars: ✭ 193 (+227.12%)
StatsA C++ header-only library of statistical distribution functions.
Stars: ✭ 292 (+394.92%)
flextoolC++ compile-time programming (serialization, reflection, code modification, enum to string, better enum, enum to json, extend or parse language, etc.)
Stars: ✭ 32 (-45.76%)
boltOfficial BOLT Repository
Stars: ✭ 19 (-67.8%)
BkcrackCrack legacy zip encryption with Biham and Kocher's known plaintext attack.
Stars: ✭ 178 (+201.69%)
vercorsThe VerCors verification toolset for verifying parallel and concurrent software
Stars: ✭ 30 (-49.15%)
libquoDynamic execution environments for coupled, thread-heterogeneous MPI+X applications
Stars: ✭ 21 (-64.41%)
Torstenlibrary of C++ functions that support applications of Stan in Pharmacometrics
Stars: ✭ 38 (-35.59%)
GapbsGAP Benchmark Suite
Stars: ✭ 165 (+179.66%)
capture3C++ research project to learn more about cameras, image processing, color spaces, OpenCV and multi‑threading.
Stars: ✭ 17 (-71.19%)
contechThe Contech analysis framework provides the means for generating and analyzing task graphs that enable computer architects and programmers to gain a deeper understanding of parallel programs.
Stars: ✭ 43 (-27.12%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (+105.08%)
FoxNNSimple neural network
Stars: ✭ 20 (-66.1%)
NPB-CPPNAS Parallel Benchmark Kernels in C/C++. The parallel versions are in FastFlow, TBB, and OpenMP.
Stars: ✭ 18 (-69.49%)
com2annTool for translation type comments to type annotations in Python
Stars: ✭ 115 (+94.92%)
developkit set2021年最新总结,值得推荐的c/c++开源框架与库。持续更新中。
Stars: ✭ 654 (+1008.47%)
Corrfunc⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (+93.22%)