pystellaA code generator for grid-based PDE solving on CPUs and GPUs
Stars: ✭ 18 (-58.14%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (-9.3%)
AmgclC++ library for solving large sparse linear systems with algebraic multigrid method
Stars: ✭ 390 (+806.98%)
JohnJohn the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
Stars: ✭ 5,656 (+13053.49%)
KernelsThis is a set of simple programs that can be used to explore the features of a parallel platform.
Stars: ✭ 287 (+567.44%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+1737.21%)
Easylambdadistributed dataflows with functional list operations for data processing with C++14
Stars: ✭ 475 (+1004.65%)
Aidlearning Framework🔥🔥AidLearning is a powerful mobile development platform, AidLearning builds a linux env supporting GUI, deep learning and visual IDE on Android...Now Aid supports OpenCL (GPU+NPU) for high performance acceleration...Linux on Android or HarmonyOS
Stars: ✭ 4,537 (+10451.16%)
GpusortingImplementation of a few sorting algorithms in OpenCL
Stars: ✭ 9 (-79.07%)
CorianderBuild NVIDIA® CUDA™ code for OpenCL™ 1.2 devices
Stars: ✭ 665 (+1446.51%)
BytecoderRich Domain Model for JVM Bytecode and Framework to interpret and transpile it.
Stars: ✭ 401 (+832.56%)
CtcdecoderConnectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Stars: ✭ 529 (+1130.23%)
TvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Stars: ✭ 7,494 (+17327.91%)
Xray Oxygen🌀 Oxygen Engine 2.0. [Preview] Discord: https://discord.gg/P3aMf66
Stars: ✭ 481 (+1018.6%)
Prplparallel Raster Processing Library (pRPL) is a MPI-enabled C++ programming library that provides easy-to-use interfaces to parallelize raster/image processing algorithms
Stars: ✭ 15 (-65.12%)
MaceMACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
Stars: ✭ 4,536 (+10448.84%)
Tf CorianderOpenCL 1.2 implementation for Tensorflow
Stars: ✭ 775 (+1702.33%)
FaasmHigh-performance stateful serverless runtime based on WebAssembly
Stars: ✭ 403 (+837.21%)
Vc4clOpenCL implementation running on the VideoCore IV GPU of the Raspberry Pi models
Stars: ✭ 611 (+1320.93%)
ClspvClspv is a prototype compiler for a subset of OpenCL C to Vulkan compute shaders
Stars: ✭ 381 (+786.05%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+776.74%)
Esmpy TutorialBasic tutorial for ESMPy Python package
Stars: ✭ 22 (-48.84%)
Compute RuntimeIntel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver
Stars: ✭ 593 (+1279.07%)
LoopyA code generator for array-based code on CPUs and GPUs
Stars: ✭ 367 (+753.49%)
Poclpocl - Portable Computing Language
Stars: ✭ 537 (+1148.84%)
MpimemuMPI Memory Consumption Utilities
Stars: ✭ 17 (-60.47%)
ElmerfemOfficial git repository of Elmer FEM software
Stars: ✭ 523 (+1116.28%)
DfloDiscontinuous Galerkin solver for compressible flows
Stars: ✭ 31 (-27.91%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+1744.19%)
TornadovmTornadoVM: A practical and efficient heterogeneous programming framework for managed languages
Stars: ✭ 479 (+1013.95%)
OpenclpapersA Collection of Articles and other OpenCL Papers
Stars: ✭ 37 (-13.95%)
UcxUnified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
Stars: ✭ 471 (+995.35%)
PipecnnAn OpenCL-based FPGA Accelerator for Convolutional Neural Networks
Stars: ✭ 775 (+1702.33%)
BitcrackerBitCracker is the first open source password cracking tool for memory units encrypted with BitLocker
Stars: ✭ 463 (+976.74%)
Pp Mm A03Parallel Processing - Matrix Multiplication (Cannon, DNS, LUdecomp)
Stars: ✭ 12 (-72.09%)
LibtommathLibTomMath is a free open source portable number theoretic multiple-precision integer library written entirely in C.
Stars: ✭ 438 (+918.6%)
JuiceThe Hacker's Machine Learning Engine
Stars: ✭ 743 (+1627.91%)
ChlorineDead Simple OpenCL
Stars: ✭ 419 (+874.42%)
Soul EnginePhysically based renderer and simulation engine for real-time applications.
Stars: ✭ 37 (-13.95%)
VexclVexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP
Stars: ✭ 626 (+1355.81%)
Trisycl Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
Stars: ✭ 354 (+723.26%)
Mpi4pyPython bindings for MPI
Stars: ✭ 388 (+802.33%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+2055.81%)
QcgpuHigh Performance Tools for Quantum Computing
Stars: ✭ 380 (+783.72%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (+1297.67%)
IlgpuILGPU JIT Compiler for high-performance .Net GPU programs
Stars: ✭ 374 (+769.77%)
QballQball (also known as [email protected]) is a first-principles molecular dynamics code that is used to compute the electronic structure of atoms, molecules, solids, and liquids within the Density Functional Theory (DFT) formalism. It is a fork of the Qbox code by Francois Gygi.
Stars: ✭ 33 (-23.26%)
Arrayfire PythonPython bindings for ArrayFire: A general purpose GPU library.
Stars: ✭ 358 (+732.56%)
ClblastTuned OpenCL BLAS
Stars: ✭ 559 (+1200%)
SimpleopenclsamplesSimple OpenCL Samples that Build with Khronos Headers and Libs
Stars: ✭ 22 (-48.84%)
AparapiThe New Official Aparapi: a framework for executing native Java and Scala code on the GPU.
Stars: ✭ 352 (+718.6%)
KratosKratos Multiphysics (A.K.A Kratos) is a framework for building parallel multi-disciplinary simulation software. Modularity, extensibility and HPC are the main objectives. Kratos has BSD license and is written in C++ with extensive Python interface.
Stars: ✭ 558 (+1197.67%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+695.35%)
OhpcOpenHPC Integration, Packaging, and Test Repo
Stars: ✭ 544 (+1165.12%)
Opencl 101Learn OpenCL step by step.
Stars: ✭ 43 (+0%)
GpwfcopenCL-accelerated python implementation of the Wave Function Collapse procgen algorithm
Stars: ✭ 37 (-13.95%)