CoffeeBeforeArch / Cuda_programming
Licence: gpl-3.0
Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch
Stars: ✭ 169
Labels
Projects that are alternatives of or similar to Cuda programming
Compactcnncascade
A binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (-10.06%)
Mutual labels: cuda
Cx db8
a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)
Stars: ✭ 164 (-2.96%)
Mutual labels: cuda
Cumf als
CUDA Matrix Factorization Library with Alternating Least Square (ALS)
Stars: ✭ 154 (-8.88%)
Mutual labels: cuda
Multi Gpu Programming Models
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
Stars: ✭ 165 (-2.37%)
Mutual labels: cuda
Deformable Kernels
Deforming kernels to adapt towards object deformation. In ICLR 2020.
Stars: ✭ 166 (-1.78%)
Mutual labels: cuda
Khiva
An open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (-4.73%)
Mutual labels: cuda
Quda
QUDA is a library for performing calculations in lattice QCD on GPUs.
Stars: ✭ 166 (-1.78%)
Mutual labels: cuda
Jetson
Helmut Hoffer von Ankershoffen experimenting with arm64 based NVIDIA Jetson (Nano and AGX Xavier) edge devices running Kubernetes (K8s) for machine learning (ML) including Jupyter Notebooks, TensorFlow Training and TensorFlow Serving using CUDA for smart IoT.
Stars: ✭ 151 (-10.65%)
Mutual labels: cuda
Dragon
Dragon: A Computation Graph Virtual Machine Based Deep Learning Framework.
Stars: ✭ 168 (-0.59%)
Mutual labels: cuda
Floor
A C++ Compute/Graphics Library and Toolchain enabling same-source CUDA/Host/Metal/OpenCL/Vulkan C++ programming and execution.
Stars: ✭ 166 (-1.78%)
Mutual labels: cuda
GPGPU Programming with CUDA
This repository contains all code from the YouTube series "CUDA Crash Course (v3)" by CoffeeBeforeArch.
Contact
Suggestions for specific content can be sent to: [email protected]
Environment
Operating System: Ubuntu 18.04
Text Editor: VIM
GPU: NVIDIA GTX 2060
CUDA version: 11, 10, 9
Concepts covered in each video
Environment Setup
Video | Concepts | Files |
---|---|---|
Visual Studio 2019 | IDE, Linker | |
Linux | VIM, NVCC |
Vector Addition
Video | Concepts | Files |
---|---|---|
Baseline | Allocation, Copying Memory, Indexing, Thread IDs, PTX, SASS | |
Pinned Memory | Pinned Memory | |
Unified Memory | Unified Memory, Prefetching |
Matrix Multiplication (GEMM)
Video | Concepts | Files |
---|---|---|
2D Indexing | ||
Shared Memory |
Sum Reduction
Video | Concepts | Files |
---|---|---|
Inter-kernel synchronization | ||
Cooperative Groups |
Convolution
Video | Concepts | Files |
---|---|---|
Constant Memory |
Histogram
Video | Concepts | Files |
---|---|---|
Global Atomics | ||
Shared Memory Atomics |
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].