CupochRobotics with GPU computing
Stars: ✭ 225 (+54.11%)
Arrayfire PythonPython bindings for ArrayFire: A general purpose GPU library.
Stars: ✭ 358 (+145.21%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+534.93%)
Awesome CudaThis is a list of useful libraries and resources for CUDA development.
Stars: ✭ 274 (+87.67%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+186.3%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+158.22%)
PlotoptixData visualisation in Python based on OptiX 7.2 ray tracing framework.
Stars: ✭ 252 (+72.6%)
IlgpuILGPU JIT Compiler for high-performance .Net GPU programs
Stars: ✭ 374 (+156.16%)
tiny-cuda-nnLightning fast & tiny C++/CUDA neural network framework
Stars: ✭ 908 (+521.92%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (+57.53%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+1023.97%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+263.7%)
BitcrackerBitCracker is the first open source password cracking tool for memory units encrypted with BitLocker
Stars: ✭ 463 (+217.12%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+147.95%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-51.37%)
briefmatchBriefMatch real-time GPU optical flow
Stars: ✭ 36 (-75.34%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+2429.45%)
Lighthouse2Lighthouse 2 framework for real-time ray tracing
Stars: ✭ 542 (+271.23%)
MetalpetalA GPU accelerated image and video processing framework built on Metal.
Stars: ✭ 907 (+521.23%)
AiopenAIOpen是一个按人工智能三要素(数据、算法、算力)进行AI开源项目分类的汇集项目,项目致力于跟踪目前人工智能(AI)的深度学习(DL)开源项目,并尽可能地罗列目前的开源项目,同时加入了一些曾经研究过的代码。通过这些开源项目,使初次接触AI的人们对人工智能(深度学习)有更清晰和更全面的了解。
Stars: ✭ 62 (-57.53%)
ArboretumGradient Boosting powered by GPU(NVIDIA CUDA)
Stars: ✭ 64 (-56.16%)
Gdax Orderbook MlApplication of machine learning to the Coinbase (GDAX) orderbook
Stars: ✭ 60 (-58.9%)
ComputeA C++ GPU Computing Library for OpenCL
Stars: ✭ 1,192 (+716.44%)
Cudart.jlJulia wrapper for CUDA runtime API
Stars: ✭ 75 (-48.63%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-60.96%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (-64.38%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+661.64%)
GbrainGPU Javascript Library for Machine Learning
Stars: ✭ 48 (-67.12%)
GgnnGGNN: State of the Art Graph-based GPU Nearest Neighbor Search
Stars: ✭ 63 (-56.85%)
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
Stars: ✭ 1,120 (+667.12%)
Ab3dmot(IROS 2020, ECCVW 2020) Official Python Implementation for "3D Multi-Object Tracking: A Baseline and New Evaluation Metrics"
Stars: ✭ 1,032 (+606.85%)
Aardvark.renderingThe dependency-aware, high-performance aardvark rendering engine. This repo is part of aardvark - an open-source platform for visual computing, real-time graphics and visualization.
Stars: ✭ 79 (-45.89%)
Cuda Design PatternsSome CUDA design patterns and a bit of template magic for CUDA
Stars: ✭ 78 (-46.58%)
MprReference implementation for "Massively Parallel Rendering of Complex Closed-Form Implicit Surfaces" (SIGGRAPH 2020)
Stars: ✭ 84 (-42.47%)
Hyperlearn50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (+724.66%)
Glove As A Tensorflow Embedding LayerTaking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Stars: ✭ 85 (-41.78%)
Training MaterialA collection of code examples as well as presentations for training purposes
Stars: ✭ 85 (-41.78%)
XpediteA non-sampling profiler purpose built to measure and optimize performance of ultra low latency/real time systems
Stars: ✭ 89 (-39.04%)
ThundersvmThunderSVM: A Fast SVM Library on GPUs and CPUs
Stars: ✭ 1,282 (+778.08%)
Deeppipe2Deep Learning library using GPU(CUDA/cuBLAS)
Stars: ✭ 90 (-38.36%)
PynvvlA Python wrapper of NVIDIA Video Loader (NVVL) with CuPy for fast video loading with Python
Stars: ✭ 95 (-34.93%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-71.92%)
CekirdeklerMulti-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Stars: ✭ 76 (-47.95%)
PytorchPyTorch tutorials A to Z
Stars: ✭ 87 (-40.41%)
NumerNumeric Erlang - vector and matrix operations with CUDA. Heavily inspired by Pteracuda - https://github.com/kevsmith/pteracuda
Stars: ✭ 91 (-37.67%)
SupraSUPRA: Software Defined Ultrasound Processing for Real-Time Applications - An Open Source 2D and 3D Pipeline from Beamforming to B-Mode
Stars: ✭ 96 (-34.25%)
PygraphistryPyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Stars: ✭ 1,365 (+834.93%)
Curved Lane Linesdetect curved lane lines using HSV filtering and sliding window search.
Stars: ✭ 100 (-31.51%)
RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+1206.85%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-32.19%)
GrlRobotics tools in C++11. Implements soft real time arm drivers for Kuka LBR iiwa plus V-REP, ROS, Constrained Optimization based planning, Hand Eye Calibration and Inverse Kinematics integration.
Stars: ✭ 105 (-28.08%)
HashcatWorld's fastest and most advanced password recovery utility
Stars: ✭ 11,014 (+7443.84%)
SpocStream Processing with OCaml
Stars: ✭ 115 (-21.23%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-22.6%)
Ds bowl 2018Kaggle Data Science Bowl 2018
Stars: ✭ 116 (-20.55%)