qHilbertqHilbert is a vectorized speedup of Hilbert curve generation using SIMD intrinsics
Stars: ✭ 22 (-90.76%)
runtimeAnyDSL Runtime Library
Stars: ✭ 17 (-92.86%)
Chromium ClangChromium browser compiled with the Clang/LLVM compiler.
Stars: ✭ 77 (-67.65%)
FFmpegPlayerSimple FFmpeg video player
Stars: ✭ 72 (-69.75%)
simdjson-rsRust version of lemire's SimdJson
Stars: ✭ 18 (-92.44%)
UgmUbpa Graphics Mathematics
Stars: ✭ 178 (-25.21%)
CoriumCorium is a modern scripting language which combines simple, safe and efficient programming.
Stars: ✭ 18 (-92.44%)
block-alignerSIMD-accelerated library for computing global and X-drop affine gap penalty sequence-to-sequence or sequence-to-profile alignments using an adaptive block-based algorithm.
Stars: ✭ 58 (-75.63%)
awesome-simdA curated list of awesome SIMD frameworks, libraries and software
Stars: ✭ 39 (-83.61%)
BitmagicBitMagic Library
Stars: ✭ 263 (+10.5%)
Fastbase64SIMD-accelerated base64 codecs
Stars: ✭ 309 (+29.83%)
Asm DudeVisual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Stars: ✭ 3,898 (+1537.82%)
Cranium🤖 A portable, header-only, artificial neural network library written in C99
Stars: ✭ 501 (+110.5%)
hlmlvectorized high-level math library
Stars: ✭ 42 (-82.35%)
SimdjsonsharpC# bindings for lemire/simdjson (and full C# port)
Stars: ✭ 506 (+112.61%)
Sha256 SimdAccelerate SHA256 computations in pure Go using Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performance boost of close to 4x over native.
Stars: ✭ 657 (+176.05%)
HighwayhashNative Go version of HighwayHash with optimized assembly implementations on Intel and ARM. Able to process over 10 GB/sec on a single core on Intel CPUs - https://en.wikipedia.org/wiki/HighwayHash
Stars: ✭ 670 (+181.51%)
FastapproxApproximate and vectorized versions of common mathematical functions
Stars: ✭ 128 (-46.22%)
Ozz AnimationOpen source c++ skeletal animation library and toolset
Stars: ✭ 1,250 (+425.21%)
PackettracerThe SIMD-accelereted ray tracing in C# powered by Intel hardware intrinsic of .NET Core.
Stars: ✭ 109 (-54.2%)
ThorinThe Higher-Order Intermediate Representation
Stars: ✭ 116 (-51.26%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+274.37%)
Tensorflow Optimized WheelsTensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (-50.42%)
ImpalaAn imperative and functional programming language
Stars: ✭ 118 (-50.42%)
KlyngA message-passing distributed computing framework for node.js
Stars: ✭ 167 (-29.83%)
SundialsSUNDIALS is a SUite of Nonlinear and DIfferential/ALgebraic equation Solvers. This is a mirror of current releases, and development will move here eventually. Pull requests are welcome for bug fixes and minor changes.
Stars: ✭ 194 (-18.49%)
Facenet mtcnn to mobileconvert facenet and mtcnn models from tensorflow to tensorflow lite and coreml (使用 TFLite 将 FaceNet 和 MTCNN 移植到移动端)
Stars: ✭ 166 (-30.25%)
BigmachineBigmachine is a library for self-managing serverless computing in Go
Stars: ✭ 167 (-29.83%)
LibonnxA lightweight, portable pure C99 onnx inference engine for embedded devices with hardware acceleration support.
Stars: ✭ 217 (-8.82%)
Messenger For DesktopThis is not an official Facebook product, and is not affiliated with, or sponsored or endorsed by, Facebook.
Stars: ✭ 2,180 (+815.97%)
McsemaFramework for lifting x86, amd64, aarch64, sparc32, and sparc64 program binaries to LLVM bitcode
Stars: ✭ 2,198 (+823.53%)
AutocompletePersistent, simple, powerful and portable autocomplete library
Stars: ✭ 166 (-30.25%)
LaserThe HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
Stars: ✭ 191 (-19.75%)
UnifrostMaking it easier to push pubsub events directly to the browser.
Stars: ✭ 166 (-30.25%)
Run On Arch ActionA Github Action that executes jobs/commands on non-x86 cpu architectures (ARMv6, ARMv7, aarch64, s390x, ppc64le)
Stars: ✭ 165 (-30.67%)
Feelpp💎 Feel++: Finite Element Embedded Language and Library in C++
Stars: ✭ 229 (-3.78%)
SftpgoFully featured and highly configurable SFTP server with optional HTTP, FTP/S and WebDAV support - S3, Google Cloud Storage, Azure Blob
Stars: ✭ 3,534 (+1384.87%)
Wagon免安裝可攜的 Laravel 開發環境
Stars: ✭ 189 (-20.59%)
Objfw[Official Mirror] A portable framework for the Objective-C language.
Stars: ✭ 161 (-32.35%)
Iboot64helperIDAPython loader to help with AArch64 iBoot, iBEC, and SecureROM reverse engineering
Stars: ✭ 189 (-20.59%)
SamraiStructured Adaptive Mesh Refinement Application Infrastructure - a scalable C++ framework for block-structured AMR application development
Stars: ✭ 160 (-32.77%)
OpentimerA High-performance Timing Analysis Tool for VLSI Systems
Stars: ✭ 213 (-10.5%)
Future.apply🚀 R package: future.apply - Apply Function to Elements in Parallel using Futures
Stars: ✭ 159 (-33.19%)
Neon Color SchemeA colorful bright-on-black color scheme for Sublime Text and TextMate. Its aim is to make as many languages as possible look as good as possible. Includes extended support for Python, Ruby, Clojure, JavaScript/JSON, C/C++, diff, HTML/XML, Markdown, PHP, CSS/SCSS/SASS, GitGutter, Find In Files, PackageDev, Regex, SublimeLinter, and much more.
Stars: ✭ 159 (-33.19%)
Atom PortablePortable version of the Atom text editor
Stars: ✭ 187 (-21.43%)
ShotgunFor the times you need more than just a gun.
Stars: ✭ 158 (-33.61%)
Jsturbo.js - perform massive parallel computations in your browser with GPGPU.
Stars: ✭ 2,591 (+988.66%)
NwchemNWChem: Open Source High-Performance Computational Chemistry
Stars: ✭ 227 (-4.62%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-36.13%)
Cross“Zero setup” cross compilation and “cross testing” of Rust crates
Stars: ✭ 2,461 (+934.03%)
EmbbEmbedded Multicore Building Blocks (EMB²): Library for parallel programming of embedded systems. Star us on GitHub? +1
Stars: ✭ 153 (-35.71%)
JoblibComputing with Python functions.
Stars: ✭ 2,620 (+1000.84%)
HyperactiveA hyperparameter optimization and data collection toolbox for convenient and fast prototyping of machine-learning models.
Stars: ✭ 182 (-23.53%)
Swift On BalenaDocker images for Swift on Raspberry Pi and other ARM devices from balena's base images.
Stars: ✭ 153 (-35.71%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (-36.13%)
DecomposedCATransform3D manipulation made easy.
Stars: ✭ 184 (-22.69%)
OpencoarraysA parallel application binary interface for Fortran 2018 compilers.
Stars: ✭ 151 (-36.55%)