All Projects → ARM-software → Computelibrary

ARM-software / Computelibrary

Licence: mit
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.

Programming Languages

C++
36643 projects - #6 most used programming language
c
50402 projects - #5 most used programming language

Projects that are alternatives of or similar to Computelibrary

Sse2neon
A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
Stars: ✭ 316 (-85.12%)
Mutual labels:  arm, simd, aarch64, armv8, neon
alpine-qbittorrent-openvpn
qBittorrent docker container with OpenVPN client running as unprivileged user on alpine linux
Stars: ✭ 230 (-89.17%)
Mutual labels:  arm, armv7, aarch64, armv8
Simdjson
Parsing gigabytes of JSON per second
Stars: ✭ 15,115 (+611.96%)
Mutual labels:  arm, simd, aarch64, neon
Sleef
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (-83.37%)
Mutual labels:  arm, simd, aarch64, neon
Unisimd Assembler
SIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (-97.03%)
Mutual labels:  simd, aarch64, armv7, neon
tensorflow-serving-arm
TensorFlow Serving ARM - A project for cross-compiling TensorFlow Serving targeting popular ARM cores
Stars: ✭ 75 (-96.47%)
Mutual labels:  arm, armv7, aarch64, armv8
Compute Engine
Highly optimized inference engine for Binarized Neural Networks
Stars: ✭ 138 (-93.5%)
Mutual labels:  simd, aarch64, armv7, armv8
TensorFlow Lite SSD RPi 64-bits
TensorFlow Lite SSD on bare Raspberry Pi 4 with 64-bit OS at 24 FPS
Stars: ✭ 25 (-98.82%)
Mutual labels:  armv7, aarch64, armv8
utf8
Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)
Stars: ✭ 60 (-97.17%)
Mutual labels:  arm, neon, simd
focalboard-docker
Cross platform Docker images for Focalboard.
Stars: ✭ 12 (-99.43%)
Mutual labels:  arm, armv7, armv8
Raspberrypipkg
DEPRECATED - DO NOT USE | Go here instead ->
Stars: ✭ 758 (-64.3%)
Mutual labels:  arm, aarch64, armv8
Rappel
A linux-based assembly REPL for x86, amd64, armv7, and armv8
Stars: ✭ 818 (-61.47%)
Mutual labels:  aarch64, armv7, armv8
Arm Assembly Cheat
MOVED TO: https://github.com/cirosantilli/linux-kernel-module-cheat#userland-assembly SEE README. ARMv7 and ARMv8 assembly userland minimal examples tutorial. Runnable asserts on x86 hosts with QEMU user mode or natively on ARM targets. Nice GDB step debug setup. Tested on Ubuntu 18.04 host and Raspberry Pi 2 and 3 targets.
Stars: ✭ 159 (-92.51%)
Mutual labels:  arm, armv7, armv8
simdutf8
SIMD-accelerated UTF-8 validation for Rust.
Stars: ✭ 426 (-79.93%)
Mutual labels:  neon, simd, aarch64
simonpi
A quick & dirty script to emulate Raspberry PI family devices on your laptop.
Stars: ✭ 61 (-97.13%)
Mutual labels:  armv7, aarch64, armv8
dnnl aarch64
No description or website provided.
Stars: ✭ 44 (-97.93%)
Mutual labels:  aarch64, armv8, sve
tensorflow-aarch64
Compiled tensorflow for aarch64 architecture
Stars: ✭ 20 (-99.06%)
Mutual labels:  arm, aarch64, armv8
Synestiaos
The Synestia Operating System
Stars: ✭ 159 (-92.51%)
Mutual labels:  arm, armv7, armv8
Simd
C++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (-40.51%)
Mutual labels:  arm, simd, neon
Simde
Implementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (-52.33%)
Mutual labels:  arm, simd, neon



Compute Library

The Compute Library is a collection of low-level machine learning functions optimized for Arm® Cortex®-A and Arm® Mali™ GPUs architectures.

The library provides superior performance to other open source alternatives and immediate support for new Arm® technologies e.g. SVE2.

Key Features:

  • Open source software available under a permissive MIT license
  • Over 100 machine learning functions for CPU and GPU
  • Multiple convolution algorithms (GeMM, Winograd, FFT, Direct and indirect-GeMM)
  • Support for multiple data types: FP32, FP16, INT8, UINT8, BFLOAT16
  • Micro-architecture optimization for key ML primitives
  • Highly configurable build options enabling lightweight binaries
  • Advanced optimization techniques such as kernel fusion, Fast math enablement and texture utilization
  • Device and workload specific tuning using OpenCL tuner and GeMM optimized heuristics

Repository Link
Release https://github.com/arm-software/ComputeLibrary
Development https://review.mlplatform.org/#/admin/projects/ml/ComputeLibrary

Documentation

Documentation

Note: The documentation includes the reference API, changelogs, build guide, contribution guide, errata, etc.


Pre-built binaries

All the binaries can be downloaded from here or from the tables below.


Platform Operating System Release archive (Download)
Raspberry Pi 4 Linux 32bit
Raspberry Pi 4 Linux 64bit
Odroid N2 Linux 64bit
HiKey960 Linux 64bit

Architecture Operating System Release archive (Download)
armv7 Android
armv7 Linux
arm64-v8a Android
arm64-v8a Linux
arm64-v8.2-a Android
arm64-v8.2-a Linux

Supported Architectures/Technologies

  • Arm® CPUs:

    • Arm® Cortex®-A processor family using Arm® Neon™ technology
    • Arm® Cortex®-R processor family with Armv8-R AArch64 architecture using Arm® Neon™ technology
    • Arm® Cortex®-X1 processor using Arm® Neon™ technology
  • Arm® Mali™ GPUs:

    • Arm® Mali™-G processor family
    • Arm® Mali™-T processor family
  • x86


Supported Systems

  • Android™
  • Bare Metal
  • Linux®
  • macOS®
  • Tizen™

Resources


How to contribute

Contributions to the Compute Library are more than welcome. If you are interested on contributing, please have a look at our how to contribute guidelines.

Developer Certificate of Origin (DCO)

Before the Compute Library accepts your contribution, you need to certify its origin and give us your permission. To manage this process we use the Developer Certificate of Origin (DCO) V1.1 (https://developercertificate.org/)

To indicate that you agree to the the terms of the DCO, you "sign off" your contribution by adding a line with your name and e-mail address to every git commit message:

Signed-off-by: John Doe <[email protected]>

You must use your real name, no pseudonyms or anonymous contributions are accepted.

Public mailing list

For technical discussion, the ComputeLibrary project has a public mailing list: [email protected] The list is open to anyone inside or outside of Arm to self subscribe. In order to subscribe, please visit the following website: https://lists.linaro.org/mailman/listinfo/acl-dev


License and Contributions

The software is provided under MIT license. Contributions to this project are accepted under the same license.


Trademarks and Copyrights

Android is a trademark of Google LLC.

Arm, Cortex, Mali and Neon are registered trademarks or trademarks of Arm Limited (or its subsidiaries) in the US and/or elsewhere.

Linux® is the registered trademark of Linus Torvalds in the U.S. and other countries.

Mac and macOS are trademarks of Apple Inc., registered in the U.S. and other countries.

Tizen is a registered trademark of The Linux Foundation.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].