ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+5.59%)
articThe AlteRnaTive Impala Compiler
Stars: ✭ 16 (-97.87%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (-78.96%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (-29.29%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (-81.36%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (-88.68%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+48.07%)
Autodock GpuAutoDock for GPUs and other accelerators
Stars: ✭ 65 (-91.34%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+147.8%)
OpenPHParallel reduction of boundary matrices for Persistent Homology with CUDA
Stars: ✭ 14 (-98.14%)
GOSHAn ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.
Stars: ✭ 12 (-98.4%)
TaskflowA General-purpose Parallel and Heterogeneous Task Programming System
Stars: ✭ 6,128 (+715.98%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-90.95%)
vuoA realtime visual programming language for interactive media.
Stars: ✭ 103 (-86.28%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-83.75%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-95.21%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-94.54%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (-80.16%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-92.41%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (-72.17%)
NumbaNumPy aware dynamic Python compiler using LLVM
Stars: ✭ 7,090 (+844.07%)
learn-gpgpuAlgorithms implemented in CUDA + resources about GPGPU
Stars: ✭ 37 (-95.07%)
opensbliA framework for the automated derivation and parallel execution of finite difference solvers on a range of computer architectures.
Stars: ✭ 56 (-92.54%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (-44.34%)
mbsolveAn open-source solver tool for the Maxwell-Bloch equations.
Stars: ✭ 14 (-98.14%)
TutorialsSome basic programming tutorials
Stars: ✭ 353 (-53%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+5.19%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-86.82%)
FastA framework for GPU based high-performance medical image processing and visualization
Stars: ✭ 179 (-76.17%)
gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (-97.07%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-90.55%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (-54.46%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+23.44%)
PyMFEMPython wrapper for MFEM
Stars: ✭ 91 (-87.88%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (-49.8%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-92.41%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (-51.8%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (-19.97%)
ChainerA flexible framework of neural networks for deep learning
Stars: ✭ 5,656 (+653.13%)
ThundergbmThunderGBM: Fast GBDTs and Random Forests on GPUs
Stars: ✭ 586 (-21.97%)
TrtorchPyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
Stars: ✭ 583 (-22.37%)
GunrockHigh-Performance Graph Primitives on GPUs
Stars: ✭ 718 (-4.39%)
Gocaml🐫 Practical statically typed functional programming language implementation with Go and LLVM
Stars: ✭ 653 (-13.05%)
Xmrig NvidiaMonero (XMR) NVIDIA miner
Stars: ✭ 560 (-25.43%)
KratosKratos Multiphysics (A.K.A Kratos) is a framework for building parallel multi-disciplinary simulation software. Modularity, extensibility and HPC are the main objectives. Kratos has BSD license and is written in C++ with extensive Python interface.
Stars: ✭ 558 (-25.7%)
Adaptive📈 Adaptive: parallel active learning of mathematical functions
Stars: ✭ 646 (-13.98%)
CudasiftA CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)
Stars: ✭ 555 (-26.1%)
CudamatPython module for performing basic dense linear algebra computations on the GPU using CUDA.
Stars: ✭ 554 (-26.23%)
Kubernetes Gpu GuideThis guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.
Stars: ✭ 740 (-1.46%)
LlvmswiftA Swift wrapper for the LLVM C API (version 9.0.1)
Stars: ✭ 641 (-14.65%)
LlvmsharpLLVM bindings for .NET Standard written in C# using ClangSharp
Stars: ✭ 553 (-26.36%)
Tigress protectionPlaying with the Tigress binary protection. Break some of its protections and solve some of its challenges. Automatic deobfuscation using symbolic execution, taint analysis and LLVM.
Stars: ✭ 550 (-26.76%)
Mc CnnStereo Matching by Training a Convolutional Neural Network to Compare Image Patches
Stars: ✭ 638 (-15.05%)
CupyNumPy & SciPy for GPU
Stars: ✭ 5,625 (+649%)
Lighthouse2Lighthouse 2 framework for real-time ray tracing
Stars: ✭ 542 (-27.83%)
Cuda Convnet2Automatically exported from code.google.com/p/cuda-convnet2
Stars: ✭ 690 (-8.12%)
RemillLibrary for lifting of x86, amd64, and aarch64 machine code to LLVM bitcode
Stars: ✭ 633 (-15.71%)