Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-81.94%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+308.37%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+389.87%)
ArboretumGradient Boosting powered by GPU(NVIDIA CUDA)
Stars: ✭ 64 (-71.81%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+292.51%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (-62.56%)
MprReference implementation for "Massively Parallel Rendering of Complex Closed-Form Implicit Surfaces" (SIGGRAPH 2020)
Stars: ✭ 84 (-63%)
Optix PathtracerSimple physically based path tracer based on Nvidia's Optix Ray Tracing Engine
Stars: ✭ 231 (+1.76%)
PlotoptixData visualisation in Python based on OptiX 7.2 ray tracing framework.
Stars: ✭ 252 (+11.01%)
NumerNumeric Erlang - vector and matrix operations with CUDA. Heavily inspired by Pteracuda - https://github.com/kevsmith/pteracuda
Stars: ✭ 91 (-59.91%)
Deeppipe2Deep Learning library using GPU(CUDA/cuBLAS)
Stars: ✭ 90 (-60.35%)
EFDCPluswww.eemodelingsystem.com
Stars: ✭ 9 (-96.04%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+253.74%)
pbdMLNo description or website provided.
Stars: ✭ 13 (-94.27%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-42.73%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-50.22%)
Optical Flow FilterA real time optical flow algorithm implemented on GPU
Stars: ✭ 146 (-35.68%)
RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+740.53%)
Xmrminer🐜 A CUDA based miner for Monero
Stars: ✭ 158 (-30.4%)
BlazingsqlBlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
Stars: ✭ 1,652 (+627.75%)
MpichOfficial MPICH Repository
Stars: ✭ 275 (+21.15%)
JcudaJCuda - Java bindings for CUDA
Stars: ✭ 165 (-27.31%)
Ssd Gpu DmaBuild userspace NVMe drivers and storage applications with CUDA support
Stars: ✭ 172 (-24.23%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-27.75%)
Nvidia DockerBuild and run Docker containers leveraging NVIDIA GPUs
Stars: ✭ 13,961 (+6050.22%)
SimplegpuhashtableA simple GPU hash table implemented in CUDA using lock free techniques
Stars: ✭ 198 (-12.78%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+248.02%)
CupochRobotics with GPU computing
Stars: ✭ 225 (-0.88%)
boltOfficial BOLT Repository
Stars: ✭ 19 (-91.63%)
HiSpatialClusterClustering spatial points with algorithm of Fast Search, high performace computing implements of CUDA or parallel in CPU, and runnable implements on python standalone or arcgis.
Stars: ✭ 31 (-86.34%)
OhpcOpenHPC Integration, Packaging, and Test Repo
Stars: ✭ 544 (+139.65%)
KttKernel Tuning Toolkit
Stars: ✭ 33 (-85.46%)
ComputeA C++ GPU Computing Library for OpenCL
Stars: ✭ 1,192 (+425.11%)
Nvidia Modded InfModified nVidia .inf files to run drivers on all video cards, research & telemetry free drivers
Stars: ✭ 227 (+0%)
Armadillo CodeArmadillo: fast C++ library for linear algebra & scientific computing - http://arma.sourceforge.net
Stars: ✭ 388 (+70.93%)
OmpiOpen MPI main development repository
Stars: ✭ 1,221 (+437.89%)
PymapdPython client for OmniSci GPU-accelerated SQL engine and analytics platform
Stars: ✭ 109 (-51.98%)
Easylambdadistributed dataflows with functional list operations for data processing with C++14
Stars: ✭ 475 (+109.25%)
UcxUnified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
Stars: ✭ 471 (+107.49%)
pystellaA code generator for grid-based PDE solving on CPUs and GPUs
Stars: ✭ 18 (-92.07%)
UmpireAn application-focused API for memory management on NUMA & GPU architectures
Stars: ✭ 154 (-32.16%)
RelionImage-processing software for cryo-electron microscopy
Stars: ✭ 219 (-3.52%)
hp2pHeavy Peer To Peer: a MPI based benchmark for network diagnostic
Stars: ✭ 17 (-92.51%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (-34.36%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-39.21%)
HpcinfoInformation about many aspects of high-performance computing. Wiki content moved to ~/docs.
Stars: ✭ 171 (-24.67%)
DashDASH, the C++ Template Library for Distributed Data Structures with Support for Hierarchical Locality for HPC and Data-Driven Science
Stars: ✭ 134 (-40.97%)
azurehpcThis repository provides easy automation scripts for building a HPC environment in Azure. It also includes examples to build e2e environment and run some of the key HPC benchmarks and applications.
Stars: ✭ 102 (-55.07%)
Coreparallel finite element unstructured meshes
Stars: ✭ 124 (-45.37%)
gslibsparse communication library
Stars: ✭ 22 (-90.31%)
euler2d kokkosSimple 2d finite volume solver for Euler equations using c++ kokkos library
Stars: ✭ 27 (-88.11%)
t8codeParallel algorithms and data structures for tree-based AMR with arbitrary element shapes.
Stars: ✭ 37 (-83.7%)
az-hopThe Azure HPC On-Demand Platform provides an HPC Cluster Ready solution
Stars: ✭ 33 (-85.46%)
GunrockHigh-Performance Graph Primitives on GPUs
Stars: ✭ 718 (+216.3%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (+242.29%)
GenomeworksSDK for GPU accelerated genome assembly and analysis
Stars: ✭ 215 (-5.29%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (-82.82%)