Deep learning toolkit-enabled VLSI placement
A complete, open-source Excel-like calculation engine written in TypeScript. Includes 380+ built-in functions. Maintained by the Handsontable team⚡
Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
A highly efficient and modular implementation of Gaussian Processes in PyTorch
GPU-accelerated Levenberg-Marquardt curve fitting in CUDA
Propagation of distributions by Monte-Carlo sampling: Real number types with uncertainty represented by samples.
R package for automation of machine learning, forecasting, feature engineering, model evaluation, model interpretation, data generation, and recommenders.
Vahana VR & VideoStitch Studio: software to create immersive 360° VR video, live and in post-production
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
Fast Neural Machine Translation in C++ - development repository
Efficient Spiking Neural Network framework, built on top of PyTorch for GPU acceleration
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
A tool for manipulating and editing AMD VBIOSes.
Deep.Net machine learning framework for F#
The write-once-run-anywhere GPGPU library for Rust
Glove As A Tensorflow Embedding Layer
Taking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Multi-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Toy CPU and GPU implementations of the Slug rendering algorithm
Concurrent CPU-GPU Programming using Task Models
stdgpu: Efficient STL-like Data Structures on the GPU
GPU-accelerated Deep Learning on Windows 10 native
⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.
A TensorFlow-inspired neural network library built from scratch in C# 7.3 for .NET Standard 2.0, with GPU support through cuDNN
High-performance Bayesian Data Analysis on the GPU in Clojure
Vulkan compute for people
GPU Accelerated Motion Engine based on Taichi Lang.
Node-based image editor with GPU-acceleration.
Experimental parallel compression algorithm
VegasFlow: accelerating Monte Carlo simulation across multiple hardware platforms
An easy way to use anime4k in python
Galaxy generator for Unity 3D, with Custom Particle Distributors, DirectX 11 Particles and Highly customization, curve driven Generation.
NumPy drop-in replacement for Intel(R) XPUs
The Kria Robotics Stack (KRS) is a ROS 2 superset for industry, an integrated set of robot libraries and utilities to accelerate the development, maintenance and commercialization of industrial-grade robotic solutions while using adaptive computing.
🖼️ Actionscript 3, GPU accelerated 2D game engine using Stage3D
Audio Fingerprinting and Recognition in Python using NVidia's CUDA
Massively Parallel Huffman Decoding on GPUs
The primary source code repository for PHCpack, a software package to solve polynomial systems with homotopy continuation methods.
Implementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mining frequent item sets in a set of transactions, implementation in Python.
A path tracer based on hardware ray tracing
CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.
TOD: GPU-accelerated Outlier Detection via Tensor Operations
QUICK: A GPU-enabled ab intio quantum chemistry software package
Crossbow: A Multi-GPU Deep Learning System for Training with Small Batch Sizes
A brian2 extension to simulate spiking neural networks on GPUs
GPU Framework for Radio Astronomical Image Synthesis
High Performance off-screen rendering (OSR) demo using CEF