All Projects → CoffeeBeforeArch → Cuda_programming

CoffeeBeforeArch / Cuda_programming

Licence: gpl-3.0
Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch

Labels

Projects that are alternatives of or similar to Cuda programming

Compactcnncascade
A binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (-10.06%)
Mutual labels:  cuda
Cx db8
a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)
Stars: ✭ 164 (-2.96%)
Mutual labels:  cuda
Sporco
Sparse Optimisation Research Code
Stars: ✭ 164 (-2.96%)
Mutual labels:  cuda
Cumf als
CUDA Matrix Factorization Library with Alternating Least Square (ALS)
Stars: ✭ 154 (-8.88%)
Mutual labels:  cuda
Clojurecuda
Clojure library for CUDA development
Stars: ✭ 158 (-6.51%)
Mutual labels:  cuda
Multi Gpu Programming Models
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
Stars: ✭ 165 (-2.37%)
Mutual labels:  cuda
Lantern
Stars: ✭ 150 (-11.24%)
Mutual labels:  cuda
Deformable Kernels
Deforming kernels to adapt towards object deformation. In ICLR 2020.
Stars: ✭ 166 (-1.78%)
Mutual labels:  cuda
Khiva
An open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (-4.73%)
Mutual labels:  cuda
Nccl Tests
NCCL Tests
Stars: ✭ 166 (-1.78%)
Mutual labels:  cuda
Rmm
RAPIDS Memory Manager
Stars: ✭ 154 (-8.88%)
Mutual labels:  cuda
Xmrminer
🐜 A CUDA based miner for Monero
Stars: ✭ 158 (-6.51%)
Mutual labels:  cuda
Opencuda
Stars: ✭ 164 (-2.96%)
Mutual labels:  cuda
Dsmnet
Domain-invariant Stereo Matching Networks
Stars: ✭ 153 (-9.47%)
Mutual labels:  cuda
Quda
QUDA is a library for performing calculations in lattice QCD on GPUs.
Stars: ✭ 166 (-1.78%)
Mutual labels:  cuda
Jetson
Helmut Hoffer von Ankershoffen experimenting with arm64 based NVIDIA Jetson (Nano and AGX Xavier) edge devices running Kubernetes (K8s) for machine learning (ML) including Jupyter Notebooks, TensorFlow Training and TensorFlow Serving using CUDA for smart IoT.
Stars: ✭ 151 (-10.65%)
Mutual labels:  cuda
Primitiv
A Neural Network Toolkit.
Stars: ✭ 164 (-2.96%)
Mutual labels:  cuda
Dragon
Dragon: A Computation Graph Virtual Machine Based Deep Learning Framework.
Stars: ✭ 168 (-0.59%)
Mutual labels:  cuda
Floor
A C++ Compute/Graphics Library and Toolchain enabling same-source CUDA/Host/Metal/OpenCL/Vulkan C++ programming and execution.
Stars: ✭ 166 (-1.78%)
Mutual labels:  cuda
Jcuda
JCuda - Java bindings for CUDA
Stars: ✭ 165 (-2.37%)
Mutual labels:  cuda

GPGPU Programming with CUDA

This repository contains all code from the YouTube series "CUDA Crash Course (v3)" by CoffeeBeforeArch.

Contact

Suggestions for specific content can be sent to: [email protected]

Environment

Operating System: Ubuntu 18.04

Text Editor: VIM

GPU: NVIDIA GTX 2060

CUDA version: 11, 10, 9

Concepts covered in each video

Environment Setup

Video Concepts Files
Visual Studio 2019 IDE, Linker
Linux VIM, NVCC

Vector Addition

Video Concepts Files
Baseline Allocation, Copying Memory, Indexing, Thread IDs, PTX, SASS
Pinned Memory Pinned Memory
Unified Memory Unified Memory, Prefetching

Matrix Multiplication (GEMM)

Video Concepts Files
2D Indexing
Shared Memory

Sum Reduction

Video Concepts Files
Inter-kernel synchronization
Cooperative Groups

Convolution

Video Concepts Files
Constant Memory

Histogram

Video Concepts Files
Global Atomics
Shared Memory Atomics
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].