LaserThe HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
Stars: ✭ 191 (-83.14%)
DrlkitA High Level Python Deep Reinforcement Learning library. Great for beginners, prototyping and quickly comparing algorithms
Stars: ✭ 29 (-97.44%)
CupyNumPy & SciPy for GPU
Stars: ✭ 5,625 (+396.47%)
NorseDeep learning with spiking neural networks (SNNs) in PyTorch.
Stars: ✭ 211 (-81.38%)
PytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Stars: ✭ 52,811 (+4561.17%)
IlgpuILGPU JIT Compiler for high-performance .Net GPU programs
Stars: ✭ 374 (-66.99%)
DiffsharpDiffSharp: Differentiable Functional Programming
Stars: ✭ 365 (-67.78%)
Hyperlearn50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (+6.27%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (-79.7%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-91.26%)
Tensorflow Gpu MacosxUnoffcial NVIDIA CUDA GPU support version of Google Tensorflow for MAC OSX
Stars: ✭ 103 (-90.91%)
MegengineMegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架
Stars: ✭ 4,081 (+260.19%)
Ocaml TorchOCaml bindings for PyTorch
Stars: ✭ 308 (-72.82%)
Compute.scalaScientific computing with N-dimensional arrays
Stars: ✭ 191 (-83.14%)
LibxsmmLibrary for specialized dense and sparse matrix operations, and deep learning primitives.
Stars: ✭ 518 (-54.28%)
TvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Stars: ✭ 7,494 (+561.43%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-94.97%)
GpudashboardA simple dashboard for NVIDIA GPU
Stars: ✭ 37 (-96.73%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-96.82%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (-1.85%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (-95.41%)
TensorlyTensorLy: Tensor Learning in Python.
Stars: ✭ 977 (-13.77%)
RustpythonA Python Interpreter written in Rust
Stars: ✭ 10,261 (+805.65%)
WasmjitSmall Embeddable WebAssembly Runtime
Stars: ✭ 1,063 (-6.18%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-97.26%)
Go TranscodeLive on-demand transcoding in go using ffmpeg. Also with NVIDIA GPU hardware acceleration.
Stars: ✭ 39 (-96.56%)
GpuviewA lightweight web dashboard for monitoring GPU usage
Stars: ✭ 57 (-94.97%)
AiopenAIOpen是一个按人工智能三要素(数据、算法、算力)进行AI开源项目分类的汇集项目,项目致力于跟踪目前人工智能(AI)的深度学习(DL)开源项目,并尽可能地罗列目前的开源项目,同时加入了一些曾经研究过的代码。通过这些开源项目,使初次接触AI的人们对人工智能(深度学习)有更清晰和更全面的了解。
Stars: ✭ 62 (-94.53%)
ComputesharpA .NET 5 library to run C# code in parallel on the GPU through DX12 and dynamically generated HLSL compute shaders, with the goal of making GPU computing easy to use for all .NET developers! 🚀
Stars: ✭ 982 (-13.33%)
SaberWindow-Based Hybrid CPU/GPU Stream Processing Engine
Stars: ✭ 35 (-96.91%)
MttMATLAB Tensor Tools
Stars: ✭ 61 (-94.62%)
AiscmGuile numerical arrays and tensor extension
Stars: ✭ 34 (-97%)
Gpu Spline DeformationBaking spline deformation to a texture then applying it to a mesh via a shader.
Stars: ✭ 52 (-95.41%)
ComputerasterReal-time software rasterizer using compute shaders, including vertex processing stage (IA and vertex shaders), bin rasterization, tile rasterization (coarse rasterization), and pixel rasterization (fine rasterization, which calls the pixel shaders).
Stars: ✭ 32 (-97.18%)
NumericN-dimensional matrix class for Rust
Stars: ✭ 51 (-95.5%)
VkquakeVulkan Quake port based on QuakeSpasm
Stars: ✭ 955 (-15.71%)
VbiosfinderExtract embedded VBIOS from (almost) any BIOS Update
Stars: ✭ 64 (-94.35%)
Cloud VolumeRead and write Neuroglancer datasets programmatically.
Stars: ✭ 63 (-94.44%)
Hr4rExample project - "Hot Reloading 4 RequireJS" front-end web applications & some extra code demonstrating hot-reloading for Node.js Express servers
Stars: ✭ 28 (-97.53%)
Keras object detectionConvert any classification model or architecture trained in keras to an object detection model
Stars: ✭ 28 (-97.53%)
B2dpipe2D Pipeline Compiler.
Stars: ✭ 51 (-95.5%)
CdpvideorecordAn video camera,you can have realtime of a beautify,and change camera position,or turn on/off flash.Details see demo.
Stars: ✭ 27 (-97.62%)
AlacrittyAlacritty is a modern terminal emulator that comes with sensible defaults, but
allows for extensive configuration. By integrating with other
applications, rather than reimplementing their functionality, it manages to
provide a flexible set of features with high performance.
The supported platforms currently consist of BSD, Linux, macOS and Windows.
Stars: ✭ 36,273 (+3101.5%)
RayrayA tiny GPU raytracer, using Zig and WebGPU
Stars: ✭ 59 (-94.79%)
Multiple SmiPython bindings for pyNVML and psutil library over network
Stars: ✭ 49 (-95.68%)
Autooffload.jlAutomatic GPU, TPU, FPGA, Xeon Phi, Multithreaded, Distributed, etc. offloading for scientific machine learning (SciML) and differential equations
Stars: ✭ 21 (-98.15%)
MetalpetalA GPU accelerated image and video processing framework built on Metal.
Stars: ✭ 907 (-19.95%)
GbrainGPU Javascript Library for Machine Learning
Stars: ✭ 48 (-95.76%)
CubCooperative primitives for CUDA C++.
Stars: ✭ 883 (-22.07%)
GgnnGGNN: State of the Art Graph-based GPU Nearest Neighbor Search
Stars: ✭ 63 (-94.44%)
Clarrays.jlOpenCL-backed GPU Arrays
Stars: ✭ 58 (-94.88%)
Leekscript V2A dynamically typed, compiled just-in-time programming language used in Leek Wars' AIs
Stars: ✭ 46 (-95.94%)