Amplifier.NETAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 142 (-84.68%)
rectdetectRealtime rectangle detector with GPGPU
Stars: ✭ 51 (-94.5%)
CLUThe OpenCL Utility library
Stars: ✭ 18 (-98.06%)
dlprimitivesDeep Learning Primitives and Mini-Framework for OpenCL
Stars: ✭ 65 (-92.99%)
john-packagesCommunity packages of John the Ripper (a Docker image, a Flatpak, a Windows PortableApp, and Ubuntu SNAP packages)
Stars: ✭ 31 (-96.66%)
RayTracingRealtime GPU Path tracer based on OpenCL and OpenGL
Stars: ✭ 120 (-87.06%)
sycl-benchSYCL Benchmark Suite
Stars: ✭ 30 (-96.76%)
komputeGeneral purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.
Stars: ✭ 872 (-5.93%)
nodeGPU-accelerated data science and visualization in node
Stars: ✭ 85 (-90.83%)
gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (-97.63%)
EtalerA flexable HTM (Hierarchical Temporal Memory) framework with full GPU support.
Stars: ✭ 79 (-91.48%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (-90.83%)
ctuning-programsCollective Knowledge extension with unified and customizable benchmarks (with extensible JSON meta information) to be easily integrated with customizable and portable Collective Knowledge workflows. You can easily compile and run these benchmarks using different compilers, environments, hardware and OS (Linux, MacOS, Windows, Android). More info:
Stars: ✭ 41 (-95.58%)
peakperfAchieve peak performance on x86 CPUs and NVIDIA GPUs
Stars: ✭ 33 (-96.44%)
gpubootcampThis repository consists for gpu bootcamp material for HPC and AI
Stars: ✭ 227 (-75.51%)
FGPUNo description or website provided.
Stars: ✭ 30 (-96.76%)
slibsSingle file libraries for C/C++
Stars: ✭ 80 (-91.37%)
briefmatchBriefMatch real-time GPU optical flow
Stars: ✭ 36 (-96.12%)
FLAMEGPU2FLAME GPU 2 is a GPU accelerated agent based modelling framework for C++ and Python
Stars: ✭ 25 (-97.3%)
hipercHigh Performance Computing Strategies for Boundary Value Problems
Stars: ✭ 36 (-96.12%)
allgebraBase container for developing C++ and Fortran HPC applications
Stars: ✭ 14 (-98.49%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (-13.38%)
Fat-CloudsGPU Fluid Simulation with Volumetric Rendering
Stars: ✭ 81 (-91.26%)
SoliditySHA3MinerAll-in-one mixed multi-GPU (nVidia, AMD, Intel) & CPU miner solves proof of work to mine supported EIP918 tokens in a single instance (with API).
Stars: ✭ 28 (-96.98%)
ufo-coreGLib-based framework for GPU-based data processing
Stars: ✭ 20 (-97.84%)
warpcontinuous energy monte carlo neutron transport in general geometries on GPUs
Stars: ✭ 27 (-97.09%)
TvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Stars: ✭ 7,494 (+708.41%)
PbfVsImplementation of Macklin, Miles, and Matthias Müller. "Position based fluids.". Visual Studio 2015 + CUDA 8.0
Stars: ✭ 100 (-89.21%)
lbvhan implementation of parallel linear BVH (LBVH) on GPU
Stars: ✭ 67 (-92.77%)
tiny-cuda-nnLightning fast & tiny C++/CUDA neural network framework
Stars: ✭ 908 (-2.05%)
opencv-cuda-dockerDockerfiles for OpenCV compiled with CUDA, opencv_contrib modules and Python 3 bindings
Stars: ✭ 55 (-94.07%)
XLearning-GPUqihoo360 xlearning with GPU support; AI on Hadoop
Stars: ✭ 22 (-97.63%)
QPT[内测中]前向式Python环境快捷封装工具,快速将Python打包为EXE并添加CUDA、NoAVX等支持。
Stars: ✭ 308 (-66.77%)
crowdsource-video-experiments-on-androidCrowdsourcing video experiments (such as collaborative benchmarking and optimization of DNN algorithms) using Collective Knowledge Framework across diverse Android devices provided by volunteers. Results are continuously aggregated in the open repository:
Stars: ✭ 29 (-96.87%)
MOTMulti-threaded Optimization Toolbox
Stars: ✭ 28 (-96.98%)
PopsiftPopSift is an implementation of the SIFT algorithm in CUDA.
Stars: ✭ 259 (-72.06%)
gpu-monitorScript to remotely check GPU servers for free GPUs
Stars: ✭ 85 (-90.83%)
GprmaxgprMax is open source software that simulates electromagnetic wave propagation using the Finite-Difference Time-Domain (FDTD) method for numerical modelling of Ground Penetrating Radar (GPR)
Stars: ✭ 268 (-71.09%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (-3.88%)
LibrecLibRec: A Leading Java Library for Recommender Systems, see
Stars: ✭ 3,045 (+228.48%)
BlendluxcoreBlender Integration for LuxCore
Stars: ✭ 287 (-69.04%)
articThe AlteRnaTive Impala Compiler
Stars: ✭ 16 (-98.27%)
LuisaRenderHigh-Performance Multiple-Backend Renderer Based on LuisaCompute
Stars: ✭ 47 (-94.93%)
HemiSimple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
Stars: ✭ 275 (-70.33%)
Deep DiamondA fast Clojure Tensor & Deep Learning library
Stars: ✭ 288 (-68.93%)
KomputationKomputation is a neural network framework for the Java Virtual Machine written in Kotlin and CUDA C.
Stars: ✭ 295 (-68.18%)
Vulkan KomputeGeneral purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases.
Stars: ✭ 350 (-62.24%)
ThrustThe C++ parallel algorithms library.
Stars: ✭ 3,595 (+287.81%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (-61.92%)
Trisycl Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
Stars: ✭ 354 (-61.81%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (-16.18%)
Fast gicpA collection of GICP-based fast point cloud registration algorithms
Stars: ✭ 307 (-66.88%)
TutorialsSome basic programming tutorials
Stars: ✭ 353 (-61.92%)
LoopyA code generator for array-based code on CPUs and GPUs
Stars: ✭ 367 (-60.41%)
Cuda.jlCUDA programming in Julia.
Stars: ✭ 370 (-60.09%)
Armadillo CodeArmadillo: fast C++ library for linear algebra & scientific computing - http://arma.sourceforge.net
Stars: ✭ 388 (-58.14%)
H2o4gpuH2Oai GPU Edition
Stars: ✭ 416 (-55.12%)
Tf CorianderOpenCL 1.2 implementation for Tensorflow
Stars: ✭ 775 (-16.4%)
CudfcuDF - GPU DataFrame Library
Stars: ✭ 4,370 (+371.41%)