BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (-68.97%)
VexclVexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP
Stars: ✭ 626 (+60.51%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+846.92%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (-41.03%)
JohnJohn the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
Stars: ✭ 5,656 (+1350.26%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+103.33%)
LoopyA code generator for array-based code on CPUs and GPUs
Stars: ✭ 367 (-5.9%)
IlgpuILGPU JIT Compiler for high-performance .Net GPU programs
Stars: ✭ 374 (-4.1%)
Arrayfire PythonPython bindings for ArrayFire: A general purpose GPU library.
Stars: ✭ 358 (-8.21%)
SpocStream Processing with OCaml
Stars: ✭ 115 (-70.51%)
pystellaA code generator for grid-based PDE solving on CPUs and GPUs
Stars: ✭ 18 (-95.38%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (-78.21%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+102.56%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+137.69%)
gpubootcampThis repository consists for gpu bootcamp material for HPC and AI
Stars: ✭ 227 (-41.79%)
BitcrackerBitCracker is the first open source password cracking tool for memory units encrypted with BitLocker
Stars: ✭ 463 (+18.72%)
crowdsource-video-experiments-on-androidCrowdsourcing video experiments (such as collaborative benchmarking and optimization of DNN algorithms) using Collective Knowledge Framework across diverse Android devices provided by volunteers. Results are continuously aggregated in the open repository:
Stars: ✭ 29 (-92.56%)
KernelsThis is a set of simple programs that can be used to explore the features of a parallel platform.
Stars: ✭ 287 (-26.41%)
EFDCPluswww.eemodelingsystem.com
Stars: ✭ 9 (-97.69%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+320.77%)
monolishmonolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Stars: ✭ 166 (-57.44%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-81.79%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+36.15%)
HashcatWorld's fastest and most advanced password recovery utility
Stars: ✭ 11,014 (+2724.1%)
EdgeExtreme-scale Discontinuous Galerkin Environment (EDGE)
Stars: ✭ 18 (-95.38%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (-3.33%)
CUDAfy.NETCUDAfy .NET allows easy development of high performance GPGPU applications completely from the .NET. It's developed in C#.
Stars: ✭ 56 (-85.64%)
hipercHigh Performance Computing Strategies for Boundary Value Problems
Stars: ✭ 36 (-90.77%)
gpuowlGPU Mersenne primality test.
Stars: ✭ 77 (-80.26%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (-7.18%)
learn-gpgpuAlgorithms implemented in CUDA + resources about GPGPU
Stars: ✭ 37 (-90.51%)
gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (-94.36%)
3GPU-accelerated micromagnetic simulator
Stars: ✭ 324 (-16.92%)
Foundations of HPC 2021This repository collects the materials from the course "Foundations of HPC", 2021, at the Data Science and Scientific Computing Department, University of Trieste
Stars: ✭ 22 (-94.36%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (-90%)
Amplifier.NETAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 142 (-63.59%)
bandicoot-codeBandicoot: GPU accelerator add-on for the Armadillo C++ linear algebra library
Stars: ✭ 21 (-94.62%)
libquoDynamic execution environments for coupled, thread-heterogeneous MPI+X applications
Stars: ✭ 21 (-94.62%)
CLUThe OpenCL Utility library
Stars: ✭ 18 (-95.38%)
matrix multiplicationParallel Matrix Multiplication Using OpenMP, Phtreads, and MPI
Stars: ✭ 41 (-89.49%)
sycl-benchSYCL Benchmark Suite
Stars: ✭ 30 (-92.31%)
boxtreeQuad/octree building for FMMs in Python and OpenCL
Stars: ✭ 52 (-86.67%)
john-packagesCommunity packages of John the Ripper (a Docker image, a Flatpak, a Windows PortableApp, and Ubuntu SNAP packages)
Stars: ✭ 31 (-92.05%)
RayTracingRealtime GPU Path tracer based on OpenCL and OpenGL
Stars: ✭ 120 (-69.23%)
yaskYASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-difference methods and similar applications.
Stars: ✭ 81 (-79.23%)
rectdetectRealtime rectangle detector with GPGPU
Stars: ✭ 51 (-86.92%)
nodeGPU-accelerated data science and visualization in node
Stars: ✭ 85 (-78.21%)
Armadillo CodeArmadillo: fast C++ library for linear algebra & scientific computing - http://arma.sourceforge.net
Stars: ✭ 388 (-0.51%)
wxparaverwxParaver is a trace-based visualization and analysis tool designed to study quantitative detailed metrics and obtain qualitative knowledge of the performance of applications, libraries, processors and whole architectures.
Stars: ✭ 23 (-94.1%)
AparapiThe New Official Aparapi: a framework for executing native Java and Scala code on the GPU.
Stars: ✭ 352 (-9.74%)
allgebraBase container for developing C++ and Fortran HPC applications
Stars: ✭ 14 (-96.41%)
ctuning-programsCollective Knowledge extension with unified and customizable benchmarks (with extensible JSON meta information) to be easily integrated with customizable and portable Collective Knowledge workflows. You can easily compile and run these benchmarks using different compilers, environments, hardware and OS (Linux, MacOS, Windows, Android). More info:
Stars: ✭ 41 (-89.49%)
tbslasA parallel, fast solver for the scalar advection-diffusion and the incompressible Navier-Stokes equations based on semi-Lagrangian/Volume-Integral method.
Stars: ✭ 21 (-94.62%)
FGPUNo description or website provided.
Stars: ✭ 30 (-92.31%)
slibsSingle file libraries for C/C++
Stars: ✭ 80 (-79.49%)
SoliditySHA3MinerAll-in-one mixed multi-GPU (nVidia, AMD, Intel) & CPU miner solves proof of work to mine supported EIP918 tokens in a single instance (with API).
Stars: ✭ 28 (-92.82%)
dbcsrDBCSR: Distributed Block Compressed Sparse Row matrix library
Stars: ✭ 65 (-83.33%)