Cglm📽 Highly Optimized Graphics Math (glm) for C
Stars: ✭ 887 (+651.69%)
hipaccA domain-specific language and compiler for image processing
Stars: ✭ 72 (-38.98%)
ThundersvmThunderSVM: A Fast SVM Library on GPUs and CPUs
Stars: ✭ 1,282 (+986.44%)
tiny-cuda-nnLightning fast & tiny C++/CUDA neural network framework
Stars: ✭ 908 (+669.49%)
CudadbclusteringClustering via Graphics Processor, using NVIDIA CUDA sdk to preform database clustering on the massively parallel graphics card processor
Stars: ✭ 6 (-94.92%)
CudaSHA256Simple tool to calculate sha256 on GPU using Cuda
Stars: ✭ 38 (-67.8%)
MpmSimulating on GPU using Material Point Method and rendering.
Stars: ✭ 61 (-48.31%)
revisiting-sepconvan implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation using PyTorch
Stars: ✭ 43 (-63.56%)
Pytorch Losslabel-smooth, amsoftmax, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
Stars: ✭ 812 (+588.14%)
lbvhan implementation of parallel linear BVH (LBVH) on GPU
Stars: ✭ 67 (-43.22%)
MtensorA C++ Cuda Tensor Lazy Computing Library
Stars: ✭ 115 (-2.54%)
ThrustRTCCUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.
Stars: ✭ 41 (-65.25%)
BlocksparseEfficient GPU kernels for block-sparse matrix multiplication and convolution
Stars: ✭ 797 (+575.42%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-73.73%)
Arrayfire PythonPython bindings for ArrayFire: A general purpose GPU library.
Stars: ✭ 358 (+203.39%)
dynamic-occupancy-grid-mapImplementation of A Random Finite Set Approach for Dynamic Occupancy Grid Maps with Real-Time Application
Stars: ✭ 89 (-24.58%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+569.49%)
warpcontinuous energy monte carlo neutron transport in general geometries on GPUs
Stars: ✭ 27 (-77.12%)
Spring 5 ExamplesThis repository is contains spring-boot 2 / spring framework 5 project examples. Using reactive programming model / paradigm and Kotlin
Stars: ✭ 87 (-26.27%)
bazel.cmakebazel.cmake mimics the behavior of bazel to simplify the usability of CMake
Stars: ✭ 38 (-67.8%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (+558.47%)
JampackExperimental parallel compression algorithm
Stars: ✭ 21 (-82.2%)
block-alignerSIMD-accelerated library for computing global and X-drop affine gap penalty sequence-to-sequence or sequence-to-profile alignments using an adaptive block-based algorithm.
Stars: ✭ 58 (-50.85%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+536.44%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-16.1%)
Fat-CloudsGPU Fluid Simulation with Volumetric Rendering
Stars: ✭ 81 (-31.36%)
KintinuousReal-time large scale dense visual SLAM system
Stars: ✭ 740 (+527.12%)
FLAMEGPU2FLAME GPU 2 is a GPU accelerated agent based modelling framework for C++ and Python
Stars: ✭ 25 (-78.81%)
positional-popcountFast C functions for the computing the positional popcount (pospopcnt).
Stars: ✭ 47 (-60.17%)
Iodineiodine - HTTP / WebSockets Server for Ruby with Pub/Sub support
Stars: ✭ 720 (+510.17%)
cs开箱即用的基于命令的消息处理框架,让 websocket 和 tcp 开发就像 http 那样简单
Stars: ✭ 19 (-83.9%)
Python Opencv Cudacustom opencv_contrib module which exposes opencv cuda optical flow methods with python bindings
Stars: ✭ 86 (-27.12%)
pyRenderLightweight Cuda Renderer with Python Wrapper.
Stars: ✭ 49 (-58.47%)
Cuda Convnet2Automatically exported from code.google.com/p/cuda-convnet2
Stars: ✭ 690 (+484.75%)
Cuda SamplesSamples for CUDA Developers which demonstrates features in CUDA Toolkit
Stars: ✭ 1,087 (+821.19%)
GPU-PathtracerGPU Raytracer from scratch in C++/CUDA
Stars: ✭ 326 (+176.27%)
Nv WavenetReference implementation of real-time autoregressive wavenet inference
Stars: ✭ 681 (+477.12%)
briefmatchBriefMatch real-time GPU optical flow
Stars: ✭ 36 (-69.49%)
Adacof PytorchOfficial source code for our paper "AdaCoF: Adaptive Collaboration of Flows for Video Frame Interpolation" (CVPR 2020)
Stars: ✭ 110 (-6.78%)
NN-CUDA-ExampleSeveral simple examples for popular neural network toolkits calling custom CUDA operators.
Stars: ✭ 594 (+403.39%)
HighwayhashNative Go version of HighwayHash with optimized assembly implementations on Intel and ARM. Able to process over 10 GB/sec on a single core on Intel CPUs - https://en.wikipedia.org/wiki/HighwayHash
Stars: ✭ 670 (+467.8%)
Hzproctorch data augmentation toolbox (supports affine transform)
Stars: ✭ 56 (-52.54%)
K2FSA/FST algorithms, differentiable, with PyTorch compatibility.
Stars: ✭ 354 (+200%)
ElasticfusionReal-time dense visual SLAM system
Stars: ✭ 1,298 (+1000%)
DeepjointfilterThe source code of ECCV16 'Deep Joint Image Filtering'.
Stars: ✭ 68 (-42.37%)
Cuda word splitThis project is an old code for Chinese words split. It is written by CUDA at 2010, so it could not run well directly under you platform without an GPU card.
Stars: ✭ 31 (-73.73%)
TutorialsSome basic programming tutorials
Stars: ✭ 353 (+199.15%)
Mc CnnStereo Matching by Training a Convolutional Neural Network to Compare Image Patches
Stars: ✭ 638 (+440.68%)
GOSHAn ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.
Stars: ✭ 12 (-89.83%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (+199.15%)
AtmosphereRealtime Client Server Framework for the JVM, supporting WebSockets with Cross-Browser Fallbacks
Stars: ✭ 3,552 (+2910.17%)
VisionarayA C++-based, cross platform ray tracing library
Stars: ✭ 342 (+189.83%)
Torch samplingEfficient reservoir sampling implementation for PyTorch
Stars: ✭ 68 (-42.37%)
Cuda CnnImplementation of a simple CNN using CUDA
Stars: ✭ 29 (-75.42%)
NimtorchPyTorch - Python + Nim
Stars: ✭ 346 (+193.22%)