cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-54.36%)
monolishmonolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Stars: ✭ 166 (+11.41%)
dbcsrDBCSR: Distributed Block Compressed Sparse Row matrix library
Stars: ✭ 65 (-56.38%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+432.21%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+180.54%)
RelionImage-processing software for cryo-electron microscopy
Stars: ✭ 219 (+46.98%)
fmlFused Matrix Library
Stars: ✭ 24 (-83.89%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-7.38%)
gpubootcampThis repository consists for gpu bootcamp material for HPC and AI
Stars: ✭ 227 (+52.35%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-61.74%)
mbsolveAn open-source solver tool for the Maxwell-Bloch equations.
Stars: ✭ 14 (-90.6%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+2378.52%)
Autodock GpuAutoDock for GPUs and other accelerators
Stars: ✭ 65 (-56.38%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-75.84%)
CARECHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.
Stars: ✭ 22 (-85.23%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (+54.36%)
PartitionedArrays.jlVectors and sparse matrices partitioned into pieces for parallel distributed-memory computations.
Stars: ✭ 45 (-69.8%)
allgebraBase container for developing C++ and Fortran HPC applications
Stars: ✭ 14 (-90.6%)
PyMFEMPython wrapper for MFEM
Stars: ✭ 91 (-38.93%)
floatSingle precision (float) matrices for R.
Stars: ✭ 41 (-72.48%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-18.12%)
Armadillo CodeArmadillo: fast C++ library for linear algebra & scientific computing - http://arma.sourceforge.net
Stars: ✭ 388 (+160.4%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+404.03%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+256.38%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+522.15%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-72.48%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-61.74%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (+6.04%)
LvArrayPortable HPC Containers (C++)
Stars: ✭ 37 (-75.17%)
pressioModel reduction for linear and nonlinear dynamical systems: core C++ library
Stars: ✭ 35 (-76.51%)
pbdMLNo description or website provided.
Stars: ✭ 13 (-91.28%)
TutorialsSome basic programming tutorials
Stars: ✭ 353 (+136.91%)
HiSpatialClusterClustering spatial points with algorithm of Fast Search, high performace computing implements of CUDA or parallel in CPU, and runnable implements on python standalone or arcgis.
Stars: ✭ 31 (-79.19%)
StrumpackStructured Matrix Package (LBNL)
Stars: ✭ 57 (-61.74%)
GOSHAn ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.
Stars: ✭ 12 (-91.95%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (-42.95%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+646.31%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-52.35%)
CudamatPython module for performing basic dense linear algebra computations on the GPU using CUDA.
Stars: ✭ 554 (+271.81%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-33.56%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+153.02%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+142.95%)
Arrayfire PythonPython bindings for ArrayFire: A general purpose GPU library.
Stars: ✭ 358 (+140.27%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (+303.36%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+129.53%)
KttKernel Tuning Toolkit
Stars: ✭ 33 (-77.85%)
HiopHPC solver for nonlinear optimization problems
Stars: ✭ 75 (-49.66%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+1001.34%)
Eigen Git MirrorTHIS MIRROR IS DEPRECATED -- New url: https://gitlab.com/libeigen/eigen
Stars: ✭ 1,659 (+1013.42%)
MagiclMatrix Algebra proGrams In Common Lisp.
Stars: ✭ 140 (-6.04%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-12.75%)
PysnnEfficient Spiking Neural Network framework, built on top of PyTorch for GPU acceleration
Stars: ✭ 129 (-13.42%)
Node Sylvester🐱 Sylvester is a vector, matrix, and geometry library for JavaScript, that runs in the browser and on the server.
Stars: ✭ 144 (-3.36%)
VisitVisIt - Visualization and Data Analysis for Mesh-based Scientific Data
Stars: ✭ 140 (-6.04%)
AgencyExecution primitives for C++
Stars: ✭ 127 (-14.77%)