HashcatWorld's fastest and most advanced password recovery utility
Stars: ✭ 11,014 (+9233.9%)
NvpipeNVIDIA-accelerated zero latency video compression library for interactive remoting applications
Stars: ✭ 376 (+218.64%)
Md5 SimdAccelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Stars: ✭ 71 (-39.83%)
VudaVUDA is a header-only library based on Vulkan that provides a CUDA Runtime API interface for writing GPU-accelerated applications.
Stars: ✭ 373 (+216.1%)
XsimdC++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Stars: ✭ 964 (+716.95%)
LibsgmStereo Semi Global Matching by cuda
Stars: ✭ 368 (+211.86%)
SapphiredbSapphireDb Server, a self-hosted, easy to use realtime database for Asp.Net Core and EF Core
Stars: ✭ 326 (+176.27%)
Des CudaDES cracking using brute force algorithm and CUDA
Stars: ✭ 21 (-82.2%)
CutorchA CUDA backend for Torch7
Stars: ✭ 322 (+172.88%)
Cuda ProgrammingSample codes for my CUDA programming book
Stars: ✭ 313 (+165.25%)
OpensseOpen Sketch Search Engine- 3D object retrieval based on sketch image as input
Stars: ✭ 883 (+648.31%)
DespacerC library to remove white space from strings as fast as possible
Stars: ✭ 90 (-23.73%)
DarkposeDistribution-Aware Coordinate Representation for Human Pose Estimation
Stars: ✭ 369 (+212.71%)
LoopyA code generator for array-based code on CPUs and GPUs
Stars: ✭ 367 (+211.02%)
Knn CudaFast k nearest neighbor search using GPU
Stars: ✭ 310 (+162.71%)
Fastbase64SIMD-accelerated base64 codecs
Stars: ✭ 309 (+161.86%)
Autodock GpuAutoDock for GPUs and other accelerators
Stars: ✭ 65 (-44.92%)
Person Reid ganICCV2017 Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in vitro
Stars: ✭ 301 (+155.08%)
KsimThe little simulator that could.
Stars: ✭ 11 (-90.68%)
Ffmpeg Build ScriptThe FFmpeg build script provides an easy way to build a static FFmpeg on OSX and Linux with non-free codecs included.
Stars: ✭ 290 (+145.76%)
Theano Roi AlignAn implementation of the RoiAlign operation for Theano
Stars: ✭ 11 (-90.68%)
Cudadrv.jlA Julia wrapper for the CUDA driver API.
Stars: ✭ 64 (-45.76%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+206.78%)
Cuarrays.jlA Curious Cumulation of CUDA Cuisine
Stars: ✭ 283 (+139.83%)
Gpu badmm mtBregman ADMM for mass transportation on GPU
Stars: ✭ 10 (-91.53%)
Deeppipe2Deep Learning library using GPU(CUDA/cuBLAS)
Stars: ✭ 90 (-23.73%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-73.73%)
Arrayfire PythonPython bindings for ArrayFire: A general purpose GPU library.
Stars: ✭ 358 (+203.39%)
FbcudaFacebook's CUDA extensions.
Stars: ✭ 275 (+133.05%)
Stn3d3D Spatial Transformer Network
Stars: ✭ 8 (-93.22%)
HemiSimple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
Stars: ✭ 275 (+133.05%)
CudadtwGPU-Suite
Stars: ✭ 63 (-46.61%)
Go CyberYour 🔵 Superintelligence
Stars: ✭ 270 (+128.81%)
CupoissonCUDA implementation of the 2D fast Poisson solver
Stars: ✭ 7 (-94.07%)
Dynamicfusion Implementation of Newcombe et al. CVPR 2015 DynamicFusion paper
Stars: ✭ 267 (+126.27%)
Kinectfusionlib Implementation of the KinectFusion approach in modern C++14 and CUDA
Stars: ✭ 261 (+121.19%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+685.59%)
PopsiftPopSift is an implementation of the SIFT algorithm in CUDA.
Stars: ✭ 259 (+119.49%)
CutlassCUDA Templates for Linear Algebra Subroutines
Stars: ✭ 1,123 (+851.69%)
ThorAtmospheric fluid dynamics solver optimized for GPUs.
Stars: ✭ 23 (-80.51%)
K2FSA/FST algorithms, differentiable, with PyTorch compatibility.
Stars: ✭ 354 (+200%)
LuisaRenderHigh-Performance Multiple-Backend Renderer Based on LuisaCompute
Stars: ✭ 47 (-60.17%)
Sepconv Slomoan implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch
Stars: ✭ 918 (+677.97%)
WICWIUWICWIU(What I can Create is What I Understand)
Stars: ✭ 103 (-12.71%)
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
Stars: ✭ 1,120 (+849.15%)
ElasticfusionReal-time dense visual SLAM system
Stars: ✭ 1,298 (+1000%)
DeepjointfilterThe source code of ECCV16 'Deep Joint Image Filtering'.
Stars: ✭ 68 (-42.37%)
Cuda word splitThis project is an old code for Chinese words split. It is written by CUDA at 2010, so it could not run well directly under you platform without an GPU card.
Stars: ✭ 31 (-73.73%)
TutorialsSome basic programming tutorials
Stars: ✭ 353 (+199.15%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (+199.15%)
AtmosphereRealtime Client Server Framework for the JVM, supporting WebSockets with Cross-Browser Fallbacks
Stars: ✭ 3,552 (+2910.17%)
VisionarayA C++-based, cross platform ray tracing library
Stars: ✭ 342 (+189.83%)
Torch samplingEfficient reservoir sampling implementation for PyTorch
Stars: ✭ 68 (-42.37%)