CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.

Stars: ✭ 22 (-86.08%)

Mutual labels: gpu-acceleration, gpu-computing

Pycuda

CUDA integration for Python, plus shiny features

Stars: ✭ 1,112 (+603.8%)

Mutual labels: cuda, gpu-computing

Autodock Gpu

AutoDock for GPUs and other accelerators

Stars: ✭ 65 (-58.86%)

Mutual labels: cuda, gpu-computing

Accelerate Llvm

LLVM backend for Accelerate

Stars: ✭ 134 (-15.19%)

Mutual labels: cuda, gpu-computing

Gpufit

GPU-accelerated Levenberg-Marquardt curve fitting in CUDA

Stars: ✭ 174 (+10.13%)

Mutual labels: gpu-acceleration, gpu-computing

Arraymancer

A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends

Stars: ✭ 793 (+401.9%)

Mutual labels: cuda, gpu-computing

GOSH

An ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.

Stars: ✭ 12 (-92.41%)

Mutual labels: cuda, gpu-computing

Accelerate

Embedded language for high-performance array computations

Stars: ✭ 751 (+375.32%)

Mutual labels: cuda, gpu-computing

Cekirdekler

Multi-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).

Stars: ✭ 76 (-51.9%)

Mutual labels: gpu-acceleration, gpu-computing

Emu

The write-once-run-anywhere GPGPU library for Rust

Stars: ✭ 1,350 (+754.43%)

Mutual labels: gpu-acceleration, gpu-computing

Nsimd

Agenium Scale vectorization library for CPUs and GPUs

Stars: ✭ 138 (-12.66%)

Mutual labels: cuda

Lantern

Stars: ✭ 150 (-5.06%)

Mutual labels: cuda

Fastflow

FastFlow pattern-based parallel programming framework (formerly on sourceforge)

Stars: ✭ 137 (-13.29%)

Mutual labels: gpu-computing

3dunderworld Sls Gpu cpu

A structured light scanner

Stars: ✭ 157 (-0.63%)

Mutual labels: cuda

Spanet

Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)

Stars: ✭ 136 (-13.92%)

Mutual labels: cuda

Go Sessions

🔐 The sessions manager for the Go Programming Language. Supports both net/http and fasthttp.

Stars: ✭ 134 (-15.19%)

Mutual labels: high-performance

Cuda Cnn

CNN accelerated by cuda. Test on mnist and finilly get 99.76%

Stars: ✭ 148 (-6.33%)

Mutual labels: cuda

Partial Order Pruning

Partial Order Pruning: for Best Speed/Accuracy Trade-off in Neural Architecture Search

Stars: ✭ 135 (-14.56%)

Mutual labels: cuda

Clvk

Experimental implementation of OpenCL on Vulkan

Stars: ✭ 158 (+0%)

Mutual labels: gpu-computing

Rmm

RAPIDS Memory Manager

Stars: ✭ 154 (-2.53%)

Mutual labels: cuda

Stitchem

Vahana VR & VideoStitch Studio: software to create immersive 360° VR video, live and in post-production

Stars: ✭ 147 (-6.96%)

Mutual labels: gpu-acceleration

Lindb

LinDB is a scalable, high performance, high availability distributed time series database.

Stars: ✭ 2,105 (+1232.28%)

Mutual labels: high-performance

Sketchgraphs

A dataset of 15 million CAD sketches with geometric constraint graphs.

Stars: ✭ 148 (-6.33%)

Mutual labels: cuda

Hedgehog Lab

Run, compile and execute JavaScript for Scientific Computing and Data Visualization TOTALLY TOTALLY TOTALLY in your BROWSER! An open source scientific computing environment for JavaScript TOTALLY in your browser, matrix operations with GPU acceleration, TeX support, data visualization and symbolic computation.

Stars: ✭ 1,797 (+1037.34%)

Mutual labels: gpu-acceleration

Nnvm

No description or website provided.

Stars: ✭ 1,639 (+937.34%)

Mutual labels: cuda

Dlaf

Diffusion-limited aggregation, fast.

Stars: ✭ 156 (-1.27%)

Mutual labels: high-performance

Optical Flow Filter

A real time optical flow algorithm implemented on GPU