AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+460.45%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+491.79%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+591.79%)
OpenPHParallel reduction of boundary matrices for Persistent Homology with CUDA
Stars: ✭ 14 (-89.55%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+1288.81%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+155.22%)
PyMFEMPython wrapper for MFEM
Stars: ✭ 91 (-32.09%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (+55.97%)
opensbliA framework for the automated derivation and parallel execution of finite difference solvers on a range of computer architectures.
Stars: ✭ 56 (-58.21%)
TutorialsSome basic programming tutorials
Stars: ✭ 353 (+163.43%)
TaskflowA General-purpose Parallel and Heterogeneous Task Programming System
Stars: ✭ 6,128 (+4473.13%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (+348.51%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-57.46%)
articThe AlteRnaTive Impala Compiler
Stars: ✭ 16 (-88.06%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-57.46%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+489.55%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (+17.91%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (+4.48%)
learn-gpgpuAlgorithms implemented in CUDA + resources about GPGPU
Stars: ✭ 37 (-72.39%)
FastA framework for GPU based high-performance medical image processing and visualization
Stars: ✭ 179 (+33.58%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+211.94%)
mbsolveAn open-source solver tool for the Maxwell-Bloch equations.
Stars: ✭ 14 (-89.55%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-8.96%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+170.15%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-73.13%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-69.4%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+729.85%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (-36.57%)
vuoA realtime visual programming language for interactive media.
Stars: ✭ 103 (-23.13%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-49.25%)
gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (-83.58%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+181.34%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+296.27%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-47.01%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (+11.19%)
GOSHAn ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.
Stars: ✭ 12 (-91.04%)
NumbaNumPy aware dynamic Python compiler using LLVM
Stars: ✭ 7,090 (+5191.04%)
Autodock GpuAutoDock for GPUs and other accelerators
Stars: ✭ 65 (-51.49%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-26.12%)
VoltaCompiler for the Volt Programming Language
Stars: ✭ 118 (-11.94%)
Tensorflow Optimized WheelsTensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (-11.94%)
SpocStream Processing with OCaml
Stars: ✭ 115 (-14.18%)
FcisFully Convolutional Instance-aware Semantic Segmentation
Stars: ✭ 1,563 (+1066.42%)
MtensorA C++ Cuda Tensor Lazy Computing Library
Stars: ✭ 115 (-14.18%)
CltuneCLTune: An automatic OpenCL & CUDA kernel tuner
Stars: ✭ 114 (-14.93%)
Coreparallel finite element unstructured meshes
Stars: ✭ 124 (-7.46%)
Pytorch spnExtension package for spatial propagation network in pytorch.
Stars: ✭ 114 (-14.93%)
HikariLLVM Obfuscator
Stars: ✭ 1,585 (+1082.84%)
BatchtoolsTools for computation on batch systems
Stars: ✭ 127 (-5.22%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-15.67%)
Pytorch Unflow a reimplementation of UnFlow in PyTorch that matches the official TensorFlow version
Stars: ✭ 113 (-15.67%)
Llvm UtilsLLVM/Clang for Visual Studio 2019, 2017, 2015, 2013, 2012 and 2010. clang-cl for Python3 distutils. Utils for Clang Static Analyzer
Stars: ✭ 123 (-8.21%)
Flaxgeneral purpose programming language, in the vein of C++
Stars: ✭ 111 (-17.16%)
BrainAn esoteric programming language compiler on top of LLVM based on Brainfuck
Stars: ✭ 112 (-16.42%)