QUICKQUICK: A GPU-enabled ab intio quantum chemistry software package
Stars: ✭ 79 (+58%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (+318%)
GipumaMassively Parallel Multiview Stereopsis by Surface Normal Diffusion
Stars: ✭ 220 (+340%)
job streamAn MPI-based C++ or Python library for easy distributed pipeline processing
Stars: ✭ 32 (-36%)
DkerasDistributed Keras Engine, Make Keras faster with only one line of code.
Stars: ✭ 181 (+262%)
RajaRAJA Performance Portability Layer (C++)
Stars: ✭ 231 (+362%)
t8codeParallel algorithms and data structures for tree-based AMR with arbitrary element shapes.
Stars: ✭ 37 (-26%)
JoblibComputing with Python functions.
Stars: ✭ 2,620 (+5140%)
corebenchcorebench - run your benchmarks against high performance computing servers with many CPU cores
Stars: ✭ 29 (-42%)
KlyngA message-passing distributed computing framework for node.js
Stars: ✭ 167 (+234%)
Foundations of HPC 2021This repository collects the materials from the course "Foundations of HPC", 2021, at the Data Science and Scientific Computing Department, University of Trieste
Stars: ✭ 22 (-56%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (+204%)
PothoscoreThe Pothos data-flow framework
Stars: ✭ 232 (+364%)
NwchemNWChem: Open Source High-Performance Computational Chemistry
Stars: ✭ 227 (+354%)
pcluster-managerManage AWS ParallelCluster through an easy to use web interface
Stars: ✭ 67 (+34%)
vuoA realtime visual programming language for interactive media.
Stars: ✭ 103 (+106%)
HyperactiveA hyperparameter optimization and data collection toolbox for convenient and fast prototyping of machine-learning models.
Stars: ✭ 182 (+264%)
gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (-56%)
Ngsolve Netgen/NGSolve is a high performance multiphysics finite element software. It is widely used to analyze models from solid mechanics, fluid dynamics and electromagnetics. Due to its flexible Python interface new physical equations and solution algorithms can be implemented easily.
Stars: ✭ 171 (+242%)
cef-mixerHigh Performance off-screen rendering (OSR) demo using CEF
Stars: ✭ 183 (+266%)
SamraiStructured Adaptive Mesh Refinement Application Infrastructure - a scalable C++ framework for block-structured AMR application development
Stars: ✭ 160 (+220%)
ps pytorchimplement distributed machine learning with Pytorch + OpenMPI
Stars: ✭ 47 (-6%)
OpencoarraysA parallel application binary interface for Fortran 2018 compilers.
Stars: ✭ 151 (+202%)
opensbliA framework for the automated derivation and parallel execution of finite difference solvers on a range of computer architectures.
Stars: ✭ 56 (+12%)
PyexpoolPython Multi-Process Execution Pool: concurrent asynchronous execution pool with custom resource constraints (memory, timeouts, affinity, CPU cores and caching), load balancing and profiling capabilities of the external apps on NUMA architecture
Stars: ✭ 149 (+198%)
AmadeusHarmonious distributed data analysis in Rust.
Stars: ✭ 240 (+380%)
pestpptools for scalable and non-intrusive parameter estimation, uncertainty analysis and sensitivity analysis
Stars: ✭ 90 (+80%)
mangoParallel Hyperparameter Tuning in Python
Stars: ✭ 241 (+382%)
Feelpp💎 Feel++: Finite Element Embedded Language and Library in C++
Stars: ✭ 229 (+358%)
CrossbowCrossbow: A Multi-GPU Deep Learning System for Training with Small Batch Sizes
Stars: ✭ 52 (+4%)
DispyDistributed and Parallel Computing Framework with / for Python
Stars: ✭ 222 (+344%)
learn-gpgpuAlgorithms implemented in CUDA + resources about GPGPU
Stars: ✭ 37 (-26%)
OpentimerA High-performance Timing Analysis Tool for VLSI Systems
Stars: ✭ 213 (+326%)
brian2cudaA brian2 extension to simulate spiking neural networks on GPUs
Stars: ✭ 46 (-8%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (-22%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (+180%)
SundialsSUNDIALS is a SUite of Nonlinear and DIfferential/ALgebraic equation Solvers. This is a mirror of current releases, and development will move here eventually. Pull requests are welcome for bug fixes and minor changes.
Stars: ✭ 194 (+288%)
gpuvmemGPU Framework for Radio Astronomical Image Synthesis
Stars: ✭ 27 (-46%)
FastA framework for GPU based high-performance medical image processing and visualization
Stars: ✭ 179 (+258%)
GoldenSunA path tracer based on hardware ray tracing
Stars: ✭ 20 (-60%)
DolfinxNext generation FEniCS problem solving environment
Stars: ✭ 171 (+242%)
pyabcpyABC: distributed, likelihood-free inference
Stars: ✭ 13 (-74%)
Awesome Machine Learning In CompilersMust read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
Stars: ✭ 168 (+236%)
BigmachineBigmachine is a library for self-managing serverless computing in Go
Stars: ✭ 167 (+234%)
super-workers🐴 Distribute load on front-end via parallelism
Stars: ✭ 93 (+86%)
Future.apply🚀 R package: future.apply - Apply Function to Elements in Parallel using Futures
Stars: ✭ 159 (+218%)
CARECHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.
Stars: ✭ 22 (-56%)
EmbbEmbedded Multicore Building Blocks (EMB²): Library for parallel programming of embedded systems. Star us on GitHub? +1
Stars: ✭ 153 (+206%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+200%)
JUDI.jlJulia Devito inversion.
Stars: ✭ 71 (+42%)
hp2pHeavy Peer To Peer: a MPI based benchmark for network diagnostic
Stars: ✭ 17 (-66%)
Apriori-and-Eclat-Frequent-Itemset-MiningImplementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mining frequent item sets in a set of transactions, implementation in Python.
Stars: ✭ 36 (-28%)
LazyLight-weight header-only library for parallel function calls and continuations in C++ based on Eric Niebler's talk at CppCon 2019.
Stars: ✭ 93 (+86%)
ParallelQSlimShape Aware Parallel Mesh Simplification Algorithm
Stars: ✭ 84 (+68%)
pytodTOD: GPU-accelerated Outlier Detection via Tensor Operations
Stars: ✭ 131 (+162%)
rasterA micro server framework, support coroutine, and parallel-computing, used for building flatbuffers/thrift/protobuf/http protocol service.
Stars: ✭ 19 (-62%)