babel-plugin-file-loaderLike webpack's file-loader, but on server side. Allows for production-grade require('./file.png')
Stars: ✭ 36 (-67.27%)
sliceslice-rsA fast implementation of single-pattern substring search using SIMD acceleration.
Stars: ✭ 66 (-40%)
BytecoderRich Domain Model for JVM Bytecode and Framework to interpret and transpile it.
Stars: ✭ 401 (+264.55%)
FastA framework for GPU based high-performance medical image processing and visualization
Stars: ✭ 179 (+62.73%)
ClspvClspv is a prototype compiler for a subset of OpenCL C to Vulkan compute shaders
Stars: ✭ 381 (+246.36%)
Opencl 101Learn OpenCL step by step.
Stars: ✭ 43 (-60.91%)
Sha256 SimdAccelerate SHA256 computations in pure Go using Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performance boost of close to 4x over native.
Stars: ✭ 657 (+497.27%)
KaoYan 807北京邮电大学软件学院考研专业课笔记(2019年)
Stars: ✭ 31 (-71.82%)
SSE-GithubThis repository contains the demo app for the blog
Stars: ✭ 17 (-84.55%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+242.73%)
Jsturbo.js - perform massive parallel computations in your browser with GPGPU.
Stars: ✭ 2,591 (+2255.45%)
42 cheatsheetAlso referred to as "The C Man"
Stars: ✭ 204 (+85.45%)
LoopyA code generator for array-based code on CPUs and GPUs
Stars: ✭ 367 (+233.64%)
Fastnoise2Modular node based noise generation library using SIMD, C++17 and templates
Stars: ✭ 196 (+78.18%)
OpenCLAdaAn Ada binding for the OpenCL host API
Stars: ✭ 15 (-86.36%)
Soul EnginePhysically based renderer and simulation engine for real-time applications.
Stars: ✭ 37 (-66.36%)
LaserThe HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
Stars: ✭ 191 (+73.64%)
Trisycl Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
Stars: ✭ 354 (+221.82%)
DecomposedCATransform3D manipulation made easy.
Stars: ✭ 184 (+67.27%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (+49.09%)
Ncnnncnn is a high-performance neural network inference framework optimized for the mobile platform
Stars: ✭ 13,376 (+12060%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+210.91%)
heyoka.pyPython library for ODE integration via Taylor's method and LLVM
Stars: ✭ 45 (-59.09%)
Jpeg QuantsmoothJPEG artifacts removal based on quantization coefficients.
Stars: ✭ 134 (+21.82%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+3257.27%)
ImpalaAn imperative and functional programming language
Stars: ✭ 118 (+7.27%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (+46.36%)
ThorinThe Higher-Order Intermediate Representation
Stars: ✭ 116 (+5.45%)
Xmrig AmdMonero AMD (OpenCL) miner
Stars: ✭ 322 (+192.73%)
SketchC++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
Stars: ✭ 96 (-12.73%)
redis-subscribe-sseStream Redis "SUBSCRIBE" or "PSUBSCRIBE" events to browsers using HTML5 Server-Sent Events (SSE)
Stars: ✭ 45 (-59.09%)
FasterSIMD for humans
Stars: ✭ 1,304 (+1085.45%)
BlendluxcoreBlender Integration for LuxCore
Stars: ✭ 287 (+160.91%)
WideA crate to help you go wide. By which I mean use SIMD stuff.
Stars: ✭ 72 (-34.55%)
Tvm MaliOptimizing Mobile Deep Learning on ARM GPU with TVM
Stars: ✭ 156 (+41.82%)
Go CvComputer Vision package in pure Go taking advantage of SIMD acceleration
Stars: ✭ 66 (-40%)
ClOpenCL binding for Erlang
Stars: ✭ 282 (+156.36%)
learn-gpgpuAlgorithms implemented in CUDA + resources about GPGPU
Stars: ✭ 37 (-66.36%)
GpwfcopenCL-accelerated python implementation of the Wave Function Collapse procgen algorithm
Stars: ✭ 37 (-66.36%)
Parallel XxhashCompute xxHash hash codes for 8 keys in parallel
Stars: ✭ 36 (-67.27%)
ClojureclClojureCL is a Clojure library for parallel computations with OpenCL.
Stars: ✭ 266 (+141.82%)
Simdsetoperationstestbed for different SIMD implementations for set intersection and set union
Stars: ✭ 24 (-78.18%)
Amdovx CoreAMD OpenVX Core -- a sub-module of amdovx-modules:
Stars: ✭ 139 (+26.36%)
XnnpackHigh-efficiency floating-point neural network inference operators for mobile, server, and Web
Stars: ✭ 808 (+634.55%)
MOTMulti-threaded Optimization Toolbox
Stars: ✭ 28 (-74.55%)
LinqfasterLinq-like extension functions for Arrays, Span<T>, and List<T> that are faster and allocate less.
Stars: ✭ 615 (+459.09%)
memchrOptimized string search routines for Rust.
Stars: ✭ 474 (+330.91%)
caffe-android-opencl-fp16Optimised Caffe with OpenCL supporting for less powerful devices such as mobile phones
Stars: ✭ 17 (-84.55%)
goetheThreading and Caching Utilities for golang
Stars: ✭ 30 (-72.73%)
thread-poolBS::thread_pool: a fast, lightweight, and easy-to-use C++17 thread pool library
Stars: ✭ 1,043 (+848.18%)
vexed-generationPolymorphic helper functions & geometry ops for Houdini VEX / OpenCL
Stars: ✭ 32 (-70.91%)
restioHTTP Client for Dart inspired by OkHttp
Stars: ✭ 46 (-58.18%)
ThreadPool2Lightweight, Generic, Pure C++11 ThreadPool
Stars: ✭ 28 (-74.55%)
OpenclpapersA Collection of Articles and other OpenCL Papers
Stars: ✭ 37 (-66.36%)
CoriumCorium is a modern scripting language which combines simple, safe and efficient programming.
Stars: ✭ 18 (-83.64%)