All Projects → Amgx → Similar Projects or Alternatives

403 Open source projects that are alternatives of or similar to Amgx

Creepminer
Burstcoin C++ CPU and GPU Miner
Stars: ✭ 169 (-18.36%)
Mutual labels:  cuda
Babelstream
STREAM, for lots of devices written in many programming models
Stars: ✭ 121 (-41.55%)
Mutual labels:  cuda
Rmm
RAPIDS Memory Manager
Stars: ✭ 154 (-25.6%)
Mutual labels:  cuda
Tensorflow Optimized Wheels
TensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (-43%)
Mutual labels:  cuda
Ck Caffe
Collective Knowledge workflow for Caffe to automate installation across diverse platforms and to collaboratively evaluate and optimize Caffe-based workloads across diverse hardware, software and data sets (compilers, libraries, tools, models, inputs):
Stars: ✭ 192 (-7.25%)
Mutual labels:  cuda
Mtensor
A C++ Cuda Tensor Lazy Computing Library
Stars: ✭ 115 (-44.44%)
Mutual labels:  cuda
Dsmnet
Domain-invariant Stereo Matching Networks
Stars: ✭ 153 (-26.09%)
Mutual labels:  cuda
Pytorch spn
Extension package for spatial propagation network in pytorch.
Stars: ✭ 114 (-44.93%)
Mutual labels:  cuda
Cuda programming
Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch
Stars: ✭ 169 (-18.36%)
Mutual labels:  cuda
Pytorch Unflow
a reimplementation of UnFlow in PyTorch that matches the official TensorFlow version
Stars: ✭ 113 (-45.41%)
Mutual labels:  cuda
Jetson
Helmut Hoffer von Ankershoffen experimenting with arm64 based NVIDIA Jetson (Nano and AGX Xavier) edge devices running Kubernetes (K8s) for machine learning (ML) including Jupyter Notebooks, TensorFlow Training and TensorFlow Serving using CUDA for smart IoT.
Stars: ✭ 151 (-27.05%)
Mutual labels:  cuda
Futhark
💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+692.75%)
Mutual labels:  cuda
Simplegpuhashtable
A simple GPU hash table implemented in CUDA using lock free techniques
Stars: ✭ 198 (-4.35%)
Mutual labels:  cuda
Torch Mesh Isect
Stars: ✭ 107 (-48.31%)
Mutual labels:  cuda
Ginkgo
Numerical linear algebra software package
Stars: ✭ 149 (-28.02%)
Mutual labels:  cuda
Dace
DaCe - Data Centric Parallel Programming
Stars: ✭ 106 (-48.79%)
Mutual labels:  cuda
Deformable Kernels
Deforming kernels to adapt towards object deformation. In ICLR 2020.
Stars: ✭ 166 (-19.81%)
Mutual labels:  cuda
Cuda Winograd
Fast CUDA Kernels for ResNet Inference.
Stars: ✭ 104 (-49.76%)
Mutual labels:  cuda
Sketchgraphs
A dataset of 15 million CAD sketches with geometric constraint graphs.
Stars: ✭ 148 (-28.5%)
Mutual labels:  cuda
Deepnet
Deep.Net machine learning framework for F#
Stars: ✭ 99 (-52.17%)
Mutual labels:  cuda
Macos Egpu Cuda Guide
Set up CUDA for machine learning (and gaming) on macOS using a NVIDIA eGPU
Stars: ✭ 187 (-9.66%)
Mutual labels:  cuda
Extending Jax
Extending JAX with custom C++ and CUDA code
Stars: ✭ 98 (-52.66%)
Mutual labels:  cuda
Volumetric Path Tracer
☁️ Volumetric path tracer using cuda
Stars: ✭ 145 (-29.95%)
Mutual labels:  cuda
Pynvvl
A Python wrapper of NVIDIA Video Loader (NVVL) with CuPy for fast video loading with Python
Stars: ✭ 95 (-54.11%)
Mutual labels:  cuda
Quda
QUDA is a library for performing calculations in lattice QCD on GPUs.
Stars: ✭ 166 (-19.81%)
Mutual labels:  cuda
Fbtt Embedding
This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as recommendation and natural language processing. We showed this library can reduce the total model size by up to 100x in Facebook’s open sourced DLRM model while achieving same model quality. Our implementation is faster than the state-of-the-art implementations. Existing the state-of-the-art library also decompresses the whole embedding tables on the fly therefore they do not provide memory reduction during runtime of the training. Our library decompresses only the requested rows therefore can provide 10,000 times memory footprint reduction per embedding table. The library also includes a software cache to store a portion of the entries in the table in decompressed format for faster lookup and process.
Stars: ✭ 92 (-55.56%)
Mutual labels:  cuda
Remotery
Single C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+821.74%)
Mutual labels:  cuda
Tutorial Ubuntu 18.04 Install Nvidia Driver And Cuda And Cudnn And Build Tensorflow For Gpu
Ubuntu 18.04 How to install Nvidia driver + CUDA + CUDNN + build tensorflow for gpu step by step command line
Stars: ✭ 91 (-56.04%)
Mutual labels:  cuda
Oneflow
OneFlow is a performance-centered and open-source deep learning framework.
Stars: ✭ 2,868 (+1285.51%)
Mutual labels:  cuda
Matconvnet
MatConvNet: CNNs for MATLAB
Stars: ✭ 1,299 (+527.54%)
Mutual labels:  cuda
Libgdf
[ARCHIVED] C GPU DataFrame Library
Stars: ✭ 142 (-31.4%)
Mutual labels:  cuda
Deeppipe2
Deep Learning library using GPU(CUDA/cuBLAS)
Stars: ✭ 90 (-56.52%)
Mutual labels:  cuda
Nccl Tests
NCCL Tests
Stars: ✭ 166 (-19.81%)
Mutual labels:  cuda
Weighted softmax loss
Weighted Softmax Loss Layer for Caffe
Stars: ✭ 89 (-57%)
Mutual labels:  cuda
Ctranslate2
Fast inference engine for OpenNMT models
Stars: ✭ 140 (-32.37%)
Mutual labels:  cuda
Thundersvm
ThunderSVM: A Fast SVM Library on GPUs and CPUs
Stars: ✭ 1,282 (+519.32%)
Mutual labels:  cuda
Hybridizer Basic Samples
Examples of C# code compiled to GPU by hybridizer
Stars: ✭ 186 (-10.14%)
Mutual labels:  cuda
Deep Learning Boot Camp
A community run, 5-day PyTorch Deep Learning Bootcamp
Stars: ✭ 1,270 (+513.53%)
Mutual labels:  cuda
Marian Dev
Fast Neural Machine Translation in C++ - development repository
Stars: ✭ 136 (-34.3%)
Mutual labels:  cuda
Knn cuda
pytorch knn [cuda version]
Stars: ✭ 86 (-58.45%)
Mutual labels:  cuda
Opencuda
Stars: ✭ 164 (-20.77%)
Mutual labels:  cuda
Pytorch Emdloss
PyTorch 1.0 implementation of the approximate Earth Mover's Distance
Stars: ✭ 82 (-60.39%)
Mutual labels:  cuda
Partial Order Pruning
Partial Order Pruning: for Best Speed/Accuracy Trade-off in Neural Architecture Search
Stars: ✭ 135 (-34.78%)
Mutual labels:  cuda
Nnabla Ext Cuda
A CUDA Extension of Neural Network Libraries
Stars: ✭ 79 (-61.84%)
Mutual labels:  cuda
Viseron
Self-hosted NVR with object detection
Stars: ✭ 192 (-7.25%)
Mutual labels:  cuda
Cuda Design Patterns
Some CUDA design patterns and a bit of template magic for CUDA
Stars: ✭ 78 (-62.32%)
Mutual labels:  cuda
Accelerate Llvm
LLVM backend for Accelerate
Stars: ✭ 134 (-35.27%)
Mutual labels:  cuda
Cudart.jl
Julia wrapper for CUDA runtime API
Stars: ✭ 75 (-63.77%)
Mutual labels:  cuda
Primitiv
A Neural Network Toolkit.
Stars: ✭ 164 (-20.77%)
Mutual labels:  cuda
Parenchyma
An extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-65.7%)
Mutual labels:  cuda
Libcudacxx
The C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+799.03%)
Mutual labels:  cuda
Deepjointfilter
The source code of ECCV16 'Deep Joint Image Filtering'.
Stars: ✭ 68 (-67.15%)
Mutual labels:  cuda
Ssd Gpu Dma
Build userspace NVMe drivers and storage applications with CUDA support
Stars: ✭ 172 (-16.91%)
Mutual labels:  cuda
Agency
Execution primitives for C++
Stars: ✭ 127 (-38.65%)
Mutual labels:  cuda
Cunn
Stars: ✭ 205 (-0.97%)
Mutual labels:  cuda
Pine
🌲 Aimbot powered by real-time object detection with neural networks, GPU accelerated with Nvidia. Optimized for use with CS:GO.
Stars: ✭ 202 (-2.42%)
Mutual labels:  cuda
Timemory
Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
Stars: ✭ 192 (-7.25%)
Mutual labels:  cuda
Gmonitor
gmonitor is a GPU monitor (Nvidia only at the moment)
Stars: ✭ 169 (-18.36%)
Mutual labels:  cuda
Clojurecuda
Clojure library for CUDA development
Stars: ✭ 158 (-23.67%)
Mutual labels:  cuda
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+5286.96%)
Mutual labels:  cuda
61-120 of 403 similar projects