KttKernel Tuning Toolkit
Stars: ✭ 33 (-91.54%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (-61.03%)
Foundations of HPC 2021This repository collects the materials from the course "Foundations of HPC", 2021, at the Data Science and Scientific Computing Department, University of Trieste
Stars: ✭ 22 (-94.36%)
gpuowlGPU Mersenne primality test.
Stars: ✭ 77 (-80.26%)
AparapiThe New Official Aparapi: a framework for executing native Java and Scala code on the GPU.
Stars: ✭ 352 (-9.74%)
Pine🌲 Aimbot powered by real-time object detection with neural networks, GPU accelerated with Nvidia. Optimized for use with CS:GO.
Stars: ✭ 202 (-48.21%)
Ck CaffeCollective Knowledge workflow for Caffe to automate installation across diverse platforms and to collaboratively evaluate and optimize Caffe-based workloads across diverse hardware, software and data sets (compilers, libraries, tools, models, inputs):
Stars: ✭ 192 (-50.77%)
Awesome CudaThis is a list of useful libraries and resources for CUDA development.
Stars: ✭ 274 (-29.74%)
Compute.scalaScientific computing with N-dimensional arrays
Stars: ✭ 191 (-51.03%)
ChlorineDead Simple OpenCL
Stars: ✭ 419 (+7.44%)
gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (-94.36%)
CUDAfy.NETCUDAfy .NET allows easy development of high performance GPGPU applications completely from the .NET. It's developed in C#.
Stars: ✭ 56 (-85.64%)
learn-gpgpuAlgorithms implemented in CUDA + resources about GPGPU
Stars: ✭ 37 (-90.51%)
vercorsThe VerCors verification toolset for verifying parallel and concurrent software
Stars: ✭ 30 (-92.31%)
OpenclpapersA Collection of Articles and other OpenCL Papers
Stars: ✭ 37 (-90.51%)
libquoDynamic execution environments for coupled, thread-heterogeneous MPI+X applications
Stars: ✭ 21 (-94.62%)
matrix multiplicationParallel Matrix Multiplication Using OpenMP, Phtreads, and MPI
Stars: ✭ 41 (-89.49%)
sycl-benchSYCL Benchmark Suite
Stars: ✭ 30 (-92.31%)
Compute RuntimeIntel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver
Stars: ✭ 593 (+52.05%)
bandicoot-codeBandicoot: GPU accelerator add-on for the Armadillo C++ linear algebra library
Stars: ✭ 21 (-94.62%)
RayTracingRealtime GPU Path tracer based on OpenCL and OpenGL
Stars: ✭ 120 (-69.23%)
yaskYASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-difference methods and similar applications.
Stars: ✭ 81 (-79.23%)
Computecpp SdkCollection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation
Stars: ✭ 239 (-38.72%)
CupochRobotics with GPU computing
Stars: ✭ 225 (-42.31%)
NbodyN body gravity attraction problem solver
Stars: ✭ 40 (-89.74%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (+54.1%)
ctuning-programsCollective Knowledge extension with unified and customizable benchmarks (with extensible JSON meta information) to be easily integrated with customizable and portable Collective Knowledge workflows. You can easily compile and run these benchmarks using different compilers, environments, hardware and OS (Linux, MacOS, Windows, Android). More info:
Stars: ✭ 41 (-89.49%)
wxparaverwxParaver is a trace-based visualization and analysis tool designed to study quantitative detailed metrics and obtain qualitative knowledge of the performance of applications, libraries, processors and whole architectures.
Stars: ✭ 23 (-94.1%)
allgebraBase container for developing C++ and Fortran HPC applications
Stars: ✭ 14 (-96.41%)
Abyss🔬 Assemble large genomes using short reads
Stars: ✭ 219 (-43.85%)
OnednnoneAPI Deep Neural Network Library (oneDNN)
Stars: ✭ 2,600 (+566.67%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (-90%)
hipercHigh Performance Computing Strategies for Boundary Value Problems
Stars: ✭ 36 (-90.77%)
Amplifier.NETAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 142 (-63.59%)
Primecount🚀 Fast prime counting function implementations
Stars: ✭ 193 (-50.51%)
CLUThe OpenCL Utility library
Stars: ✭ 18 (-95.38%)
boxtreeQuad/octree building for FMMs in Python and OpenCL
Stars: ✭ 52 (-86.67%)
john-packagesCommunity packages of John the Ripper (a Docker image, a Flatpak, a Windows PortableApp, and Ubuntu SNAP packages)
Stars: ✭ 31 (-92.05%)
rectdetectRealtime rectangle detector with GPGPU
Stars: ✭ 51 (-86.92%)
mbsolveAn open-source solver tool for the Maxwell-Bloch equations.
Stars: ✭ 14 (-96.41%)
slibsSingle file libraries for C/C++
Stars: ✭ 80 (-79.49%)
hipaccA domain-specific language and compiler for image processing
Stars: ✭ 72 (-81.54%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (-64.1%)
dbcsrDBCSR: Distributed Block Compressed Sparse Row matrix library
Stars: ✭ 65 (-83.33%)
FGPUNo description or website provided.
Stars: ✭ 30 (-92.31%)
SoliditySHA3MinerAll-in-one mixed multi-GPU (nVidia, AMD, Intel) & CPU miner solves proof of work to mine supported EIP918 tokens in a single instance (with API).
Stars: ✭ 28 (-92.82%)
hpdbscanHighly parallel DBSCAN (HPDBSCAN)
Stars: ✭ 19 (-95.13%)
tbslasA parallel, fast solver for the scalar advection-diffusion and the incompressible Navier-Stokes equations based on semi-Lagrangian/Volume-Integral method.
Stars: ✭ 21 (-94.62%)
euler2d kokkosSimple 2d finite volume solver for Euler equations using c++ kokkos library
Stars: ✭ 27 (-93.08%)
nodeGPU-accelerated data science and visualization in node
Stars: ✭ 85 (-78.21%)
TimemoryModular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
Stars: ✭ 192 (-50.77%)
Xray Oxygen🌀 Oxygen Engine 2.0. [Preview] Discord: https://discord.gg/P3aMf66
Stars: ✭ 481 (+23.33%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+7.18%)
pyccelPython extension language using accelerators
Stars: ✭ 189 (-51.54%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (-12.31%)
ufo-coreGLib-based framework for GPU-based data processing
Stars: ✭ 20 (-94.87%)
Torstenlibrary of C++ functions that support applications of Stan in Pharmacometrics
Stars: ✭ 38 (-90.26%)