Deep Painterly HarmonizationCode and data for paper "Deep Painterly Harmonization": https://arxiv.org/abs/1804.03189
Stars: ✭ 6,027 (+37568.75%)
CubertFast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL
Stars: ✭ 395 (+2368.75%)
Cudanative.jlJulia support for native CUDA programming
Stars: ✭ 393 (+2356.25%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+4837.5%)
Neuralnetwork.netA TensorFlow-inspired neural network library built from scratch in C# 7.3 for .NET Standard 2.0, with GPU support through cuDNN
Stars: ✭ 392 (+2350%)
CudamatPython module for performing basic dense linear algebra computations on the GPU using CUDA.
Stars: ✭ 554 (+3362.5%)
CudfcuDF - GPU DataFrame Library
Stars: ✭ 4,370 (+27212.5%)
Cuda Convnet2Automatically exported from code.google.com/p/cuda-convnet2
Stars: ✭ 690 (+4212.5%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+2256.25%)
Lighthouse2Lighthouse 2 framework for real-time ray tracing
Stars: ✭ 542 (+3287.5%)
Cuda.jlCUDA programming in Julia.
Stars: ✭ 370 (+2212.5%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+4918.75%)
VudaVUDA is a header-only library based on Vulkan that provides a CUDA Runtime API interface for writing GPU-accelerated applications.
Stars: ✭ 373 (+2231.25%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+3218.75%)
LibsgmStereo Semi Global Matching by cuda
Stars: ✭ 368 (+2200%)
Nv WavenetReference implementation of real-time autoregressive wavenet inference
Stars: ✭ 681 (+4156.25%)
LoopyA code generator for array-based code on CPUs and GPUs
Stars: ✭ 367 (+2193.75%)
DepthwiseconvolutionA personal depthwise convolution layer implementation on caffe by liuhao.(only GPU)
Stars: ✭ 512 (+3100%)
Arrayfire PythonPython bindings for ArrayFire: A general purpose GPU library.
Stars: ✭ 358 (+2137.5%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (+4756.25%)
TutorialsSome basic programming tutorials
Stars: ✭ 353 (+2106.25%)
RustacudaRusty wrapper for the CUDA Driver API
Stars: ✭ 511 (+3093.75%)
HardnetHardnet descriptor model - "Working hard to know your neighbor's margins: Local descriptor learning loss"
Stars: ✭ 350 (+2087.5%)
ChainerA flexible framework of neural networks for deep learning
Stars: ✭ 5,656 (+35250%)
NimtorchPyTorch - Python + Nim
Stars: ✭ 346 (+2062.5%)
CudahandbookSource code that accompanies The CUDA Handbook.
Stars: ✭ 345 (+2056.25%)
LibcudarangeAn interval arithmetic and affine arithmetic library for NVIDIA CUDA
Stars: ✭ 5 (-68.75%)
CudppCUDA Data Parallel Primitives Library
Stars: ✭ 333 (+1981.25%)
Xray Oxygen🌀 Oxygen Engine 2.0. [Preview] Discord: https://discord.gg/P3aMf66
Stars: ✭ 481 (+2906.25%)
ArtificioDeep Learning Computer Vision Algorithms for Real-World Use
Stars: ✭ 326 (+1937.5%)
SlangMaking it easier to work with shaders
Stars: ✭ 627 (+3818.75%)
CutorchA CUDA backend for Torch7
Stars: ✭ 322 (+1912.5%)
BitcrackerBitCracker is the first open source password cracking tool for memory units encrypted with BitLocker
Stars: ✭ 463 (+2793.75%)
Cuda ProgrammingSample codes for my CUDA programming book
Stars: ✭ 313 (+1856.25%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+4593.75%)
Fast gicpA collection of GICP-based fast point cloud registration algorithms
Stars: ✭ 307 (+1818.75%)
Open3dOpen3D: A Modern Library for 3D Data Processing
Stars: ✭ 5,860 (+36525%)
Unsupervised VideosUnsupervised Learning of Video Representations using LSTMs
Stars: ✭ 309 (+1831.25%)
VexclVexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP
Stars: ✭ 626 (+3812.5%)
PyretriOpen source deep learning based unsupervised image retrieval toolbox built on PyTorch🔥
Stars: ✭ 795 (+4868.75%)
KomputationKomputation is a neural network framework for the Java Virtual Machine written in Kotlin and CUDA C.
Stars: ✭ 295 (+1743.75%)
Tensorflow CmakeTensorFlow examples in C, C++, Go and Python without bazel but with cmake and FindTensorFlow.cmake
Stars: ✭ 418 (+2512.5%)
Open quadtree mappingThis is a monocular dense mapping system corresponding to IROS 2018 "Quadtree-accelerated Real-time Monocular Dense Mapping"
Stars: ✭ 292 (+1725%)
SpeedtorchLibrary for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (+3743.75%)
Deep DiamondA fast Clojure Tensor & Deep Learning library
Stars: ✭ 288 (+1700%)
IcpcudaSuper fast implementation of ICP in CUDA for compute capable devices 3.5 or higher
Stars: ✭ 416 (+2500%)
Trace.moe Telegram BotThis Telegram Bot can tell the anime when you send an screenshot to it
Stars: ✭ 284 (+1675%)
LireOpen source library for content based image retrieval / visual information retrieval.
Stars: ✭ 740 (+4525%)
H2o4gpuH2Oai GPU Edition
Stars: ✭ 416 (+2500%)
CudadbclusteringClustering via Graphics Processor, using NVIDIA CUDA sdk to preform database clustering on the massively parallel graphics card processor
Stars: ✭ 6 (-62.5%)
Pytorch Losslabel-smooth, amsoftmax, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
Stars: ✭ 812 (+4975%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+4856.25%)
KintinuousReal-time large scale dense visual SLAM system
Stars: ✭ 740 (+4525%)
ThundergbmThunderGBM: Fast GBDTs and Random Forests on GPUs
Stars: ✭ 586 (+3562.5%)
Cbir🏞 A content-based image retrieval (CBIR) system
Stars: ✭ 407 (+2443.75%)