BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (+75.63%)
BatchtoolsTools for computation on batch systems
Stars: ✭ 127 (+6.72%)
Tf Quant FinanceHigh-performance TensorFlow library for quantitative finance.
Stars: ✭ 2,925 (+2357.98%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (+27.73%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+187.39%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (+2.52%)
OpencoarraysA parallel application binary interface for Fortran 2018 compilers.
Stars: ✭ 151 (+26.89%)
SpeedtorchLibrary for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (+416.81%)
GraphitGraphIt - A High-Performance Domain Specific Language for Graph Analytics
Stars: ✭ 254 (+113.45%)
SundialsSUNDIALS is a SUite of Nonlinear and DIfferential/ALgebraic equation Solvers. This is a mirror of current releases, and development will move here eventually. Pull requests are welcome for bug fixes and minor changes.
Stars: ✭ 194 (+63.03%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+563.87%)
bifrostA stream processing framework for high-throughput applications.
Stars: ✭ 48 (-59.66%)
TaskflowA General-purpose Parallel and Heterogeneous Task Programming System
Stars: ✭ 6,128 (+5049.58%)
VuhVulkan compute for people
Stars: ✭ 264 (+121.85%)
PynvvlA Python wrapper of NVIDIA Video Loader (NVVL) with CuPy for fast video loading with Python
Stars: ✭ 95 (-20.17%)
course高性能并行编程与优化 - 课件
Stars: ✭ 1,610 (+1252.94%)
CupyNumPy & SciPy for GPU
Stars: ✭ 5,625 (+4626.89%)
KokkosKokkos C++ Performance Portability Programming EcoSystem: The Programming Model - Parallel Execution and Memory Abstraction
Stars: ✭ 744 (+525.21%)
Feelpp💎 Feel++: Finite Element Embedded Language and Library in C++
Stars: ✭ 229 (+92.44%)
opensbliA framework for the automated derivation and parallel execution of finite difference solvers on a range of computer architectures.
Stars: ✭ 56 (-52.94%)
MOTMulti-threaded Optimization Toolbox
Stars: ✭ 28 (-76.47%)
PSycloneDomain-specific compiler for Finite Difference/Volume/Element Earth-system models in Fortran
Stars: ✭ 67 (-43.7%)
ChainerA flexible framework of neural networks for deep learning
Stars: ✭ 5,656 (+4652.94%)
MfemLightweight, general, scalable C++ library for finite element methods
Stars: ✭ 667 (+460.5%)
t8codeParallel algorithms and data structures for tree-based AMR with arbitrary element shapes.
Stars: ✭ 37 (-68.91%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+678.99%)
DashDASH, the C++ Template Library for Distributed Data Structures with Support for Hierarchical Locality for HPC and Data-Driven Science
Stars: ✭ 134 (+12.61%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+216.81%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+566.39%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-40.34%)
Pytorch gbw lmPyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset
Stars: ✭ 101 (-15.13%)
Adacof PytorchOfficial source code for our paper "AdaCoF: Adaptive Collaboration of Flows for Video Frame Interpolation" (CVPR 2020)
Stars: ✭ 110 (-7.56%)
PygraphistryPyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Stars: ✭ 1,365 (+1047.06%)
Neural Style DockerA dockerized version of neural style transfer algorithms
Stars: ✭ 100 (-15.97%)
ThorinThe Higher-Order Intermediate Representation
Stars: ✭ 116 (-2.52%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+1278.99%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-16.81%)
Nvidia Gpu Tensor Core Accelerator Pytorch OpencvA complete machine vision container that includes Jupyter notebooks with built-in code hinting, Anaconda, CUDA-X, TensorRT inference accelerator for Tensor cores, CuPy (GPU drop in replacement for Numpy), PyTorch, TF2, Tensorboard, and OpenCV for accelerated workloads on NVIDIA Tensor cores and GPUs.
Stars: ✭ 110 (-7.56%)
EmuThe write-once-run-anywhere GPGPU library for Rust
Stars: ✭ 1,350 (+1034.45%)
NyuziprocessorGPGPU microprocessor architecture
Stars: ✭ 1,351 (+1035.29%)
DeepwayThis project is an aid to the blind. Till date there has been no technological advancement in the way the blind navigate. So I have used deep learning particularly convolutional neural networks so that they can navigate through the streets.
Stars: ✭ 118 (-0.84%)
Ds bowl 2018Kaggle Data Science Bowl 2018
Stars: ✭ 116 (-2.52%)
ChainercvChainerCV: a Library for Deep Learning in Computer Vision
Stars: ✭ 1,463 (+1129.41%)
Pp4fpgas Cn HlsHLS Project of pp4fpgas - https://github.com/xupsh/pp4fpgas-cn
Stars: ✭ 97 (-18.49%)
PymapdPython client for OmniSci GPU-accelerated SQL engine and analytics platform
Stars: ✭ 109 (-8.4%)
CharmThe Charm++ parallel programming system. Visit https://charmplusplus.org/ for more information.
Stars: ✭ 96 (-19.33%)
BoincOpen-source software for volunteer computing and grid computing.
Stars: ✭ 1,320 (+1009.24%)
MpmCB-Geo High-Performance Material Point Method
Stars: ✭ 115 (-3.36%)
Rife Ncnn VulkanRIFE, Real-Time Intermediate Flow Estimation for Video Frame Interpolation implemented with ncnn library
Stars: ✭ 108 (-9.24%)
NvfancontrolNVidia dynamic fan control for Linux and Windows
Stars: ✭ 93 (-21.85%)
Fastaudio🔊 Audio and fastai v2
Stars: ✭ 93 (-21.85%)
PytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Stars: ✭ 52,811 (+44278.99%)
NumerNumeric Erlang - vector and matrix operations with CUDA. Heavily inspired by Pteracuda - https://github.com/kevsmith/pteracuda
Stars: ✭ 91 (-23.53%)
ImpalaAn imperative and functional programming language
Stars: ✭ 118 (-0.84%)
IvyThe templated deep learning framework, enabling framework-agnostic functions, layers and libraries.
Stars: ✭ 118 (-0.84%)
JlscaSide-channel toolkit in Julia
Stars: ✭ 114 (-4.2%)
Phenomenon⚡️ A fast 2kB low-level WebGL API.
Stars: ✭ 1,551 (+1203.36%)