hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (+178.57%)
claw-compilerCLAW Compiler for Performance Portability
Stars: ✭ 38 (+171.43%)
libquoDynamic execution environments for coupled, thread-heterogeneous MPI+X applications
Stars: ✭ 21 (+50%)
cruiseUser space POSIX-like file system in main memory
Stars: ✭ 27 (+92.86%)
KratosKratos Multiphysics (A.K.A Kratos) is a framework for building parallel multi-disciplinary simulation software. Modularity, extensibility and HPC are the main objectives. Kratos has BSD license and is written in C++ with extensive Python interface.
Stars: ✭ 558 (+3885.71%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (+885.71%)
gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (+57.14%)
matrix multiplicationParallel Matrix Multiplication Using OpenMP, Phtreads, and MPI
Stars: ✭ 41 (+192.86%)
yaskYASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-difference methods and similar applications.
Stars: ✭ 81 (+478.57%)
GPU-PathtracerGPU Raytracer from scratch in C++/CUDA
Stars: ✭ 326 (+2228.57%)
ParaphraseMulti-core suitable Forth-like language
Stars: ✭ 27 (+92.86%)
bifrostA stream processing framework for high-throughput applications.
Stars: ✭ 48 (+242.86%)
briefmatchBriefMatch real-time GPU optical flow
Stars: ✭ 36 (+157.14%)
disptoolsGenerate displacement fields with known volume changes
Stars: ✭ 17 (+21.43%)
qpmadROS-compatible Eigen-based Goldfarb-Idnani quadratic programming solver
Stars: ✭ 41 (+192.86%)
nBodyGPU-accelerated N-Body particle simulator with visualizer.
Stars: ✭ 28 (+100%)
conduitSimplified Data Exchange for HPC Simulations
Stars: ✭ 114 (+714.29%)
PbfVsImplementation of Macklin, Miles, and Matthias Müller. "Position based fluids.". Visual Studio 2015 + CUDA 8.0
Stars: ✭ 100 (+614.29%)
NN-CUDA-ExampleSeveral simple examples for popular neural network toolkits calling custom CUDA operators.
Stars: ✭ 594 (+4142.86%)
QPT[内测中]前向式Python环境快捷封装工具,快速将Python打包为EXE并添加CUDA、NoAVX等支持。
Stars: ✭ 308 (+2100%)
mini-nbodyA simple gravitational N-body simulation in less than 100 lines of C code, with CUDA optimizations.
Stars: ✭ 73 (+421.43%)
argobotsOfficial Argobots Repository
Stars: ✭ 71 (+407.14%)
k-meansCode accompanying my blog post on k-means in Python, C++ and CUDA
Stars: ✭ 56 (+300%)
cereCERE: Codelet Extractor and REplayer
Stars: ✭ 27 (+92.86%)
bazel.cmakebazel.cmake mimics the behavior of bazel to simplify the usability of CMake
Stars: ✭ 38 (+171.43%)
future.batchtools🚀 R package future.batchtools: A Future API for Parallel and Distributed Processing using batchtools
Stars: ✭ 77 (+450%)
Torstenlibrary of C++ functions that support applications of Stan in Pharmacometrics
Stars: ✭ 38 (+171.43%)
ck-envCK repository with components and automation actions to enable portable workflows across diverse platforms including Linux, Windows, MacOS and Android. It includes software detection plugins and meta packages (code, data sets, models, scripts, etc) with the possibility of multiple versions to co-exist in a user or system environment:
Stars: ✭ 67 (+378.57%)
ESAEasy SimAuto (ESA): An easy-to-use Power System Analysis Automation Environment atop PowerWorld Simulator Automation Server (SimAuto)
Stars: ✭ 26 (+85.71%)
peakperfAchieve peak performance on x86 CPUs and NVIDIA GPUs
Stars: ✭ 33 (+135.71%)
articThe AlteRnaTive Impala Compiler
Stars: ✭ 16 (+14.29%)
dtype-nextA Clojure library designed to aid in the implementation of high performance algorithms and systems.
Stars: ✭ 193 (+1278.57%)
libmsrWrapper library for model-specific registers. APIs cover RAPL, performance counters, clocks and turbo.
Stars: ✭ 47 (+235.71%)
tensorflow-windowsTensorFlow builds compiled on windows with avx and avx2 extensions
Stars: ✭ 20 (+42.86%)
revisiting-sepconvan implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation using PyTorch
Stars: ✭ 43 (+207.14%)
GOSHAn ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.
Stars: ✭ 12 (-14.29%)
stiff3Adaptive solver for stiff systems of ODEs using semi-implicit Runge-Kutta method of third order
Stars: ✭ 13 (-7.14%)
gproshangeometry processing and shape analysis framework
Stars: ✭ 48 (+242.86%)
capture3C++ research project to learn more about cameras, image processing, color spaces, OpenCV and multi‑threading.
Stars: ✭ 17 (+21.43%)
libROMModel reduction library with an emphasis on large scale parallelism and linear subspace methods
Stars: ✭ 66 (+371.43%)
JampackExperimental parallel compression algorithm
Stars: ✭ 21 (+50%)
xdmodAn open framework for collecting and analyzing HPC metrics.
Stars: ✭ 55 (+292.86%)
octotigerAstrophysics program simulating the evolution of star systems based on the fast multipole method on adaptive Octrees
Stars: ✭ 30 (+114.29%)
pqdmComfortable parallel TQDM using concurrent.futures
Stars: ✭ 118 (+742.86%)
hpdbscanHighly parallel DBSCAN (HPDBSCAN)
Stars: ✭ 19 (+35.71%)
hybridCentralSolversUnited collection of hybrid Central solvers - one-phase, two-phase and multicomponent versions
Stars: ✭ 42 (+200%)
tbslasA parallel, fast solver for the scalar advection-diffusion and the incompressible Navier-Stokes equations based on semi-Lagrangian/Volume-Integral method.
Stars: ✭ 21 (+50%)
cuda-toolkitGitHub Action to install CUDA
Stars: ✭ 34 (+142.86%)
WaferParallelized 3D FDTD Schrödinger Equation Solver
Stars: ✭ 21 (+50%)
ImpalaParallel High-Performance Components for Rhino/Grasshopper
Stars: ✭ 32 (+128.57%)
SoliditySHA3MinerAll-in-one mixed multi-GPU (nVidia, AMD, Intel) & CPU miner solves proof of work to mine supported EIP918 tokens in a single instance (with API).
Stars: ✭ 28 (+100%)
EFDCPluswww.eemodelingsystem.com
Stars: ✭ 9 (-35.71%)
PartitionedArrays.jlVectors and sparse matrices partitioned into pieces for parallel distributed-memory computations.
Stars: ✭ 45 (+221.43%)
CoriumCorium is a modern scripting language which combines simple, safe and efficient programming.
Stars: ✭ 18 (+28.57%)
NewsMTSCTarget-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k sentences and a state-of-the-art classification model.
Stars: ✭ 54 (+285.71%)