All Projects → Ctranslate2 → Similar Projects or Alternatives

1274 Open source projects that are alternatives of or similar to Ctranslate2

Arraymancer
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+466.43%)
Mutual labels:  parallel-computing, openmp, cudnn, cuda
Onednn
oneAPI Deep Neural Network Library (oneDNN)
Stars: ✭ 2,600 (+1757.14%)
Mutual labels:  openmp, deep-neural-networks, avx2
Guided Missile Simulation
Guided Missile, Radar and Infrared EOS Simulation Framework written in Fortran.
Stars: ✭ 33 (-76.43%)
Mutual labels:  openmp, avx, avx2
Corrfunc
⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Stars: ✭ 114 (-18.57%)
Mutual labels:  openmp, avx2, avx
Hybridizer Basic Samples
Examples of C# code compiled to GPU by hybridizer
Stars: ✭ 186 (+32.86%)
Mutual labels:  cuda, avx2, avx
Tensorflow Optimized Wheels
TensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (-15.71%)
Mutual labels:  cudnn, cuda, avx2
Wheels
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+536.43%)
Mutual labels:  cuda, avx2, avx
Vc
SIMD Vector Classes for C++
Stars: ✭ 985 (+603.57%)
Mutual labels:  parallel-computing, avx2, avx
mbsolve
An open-source solver tool for the Maxwell-Bloch equations.
Stars: ✭ 14 (-90%)
Mutual labels:  openmp, parallel-computing, cuda
Boost.simd
Boost SIMD
Stars: ✭ 238 (+70%)
Mutual labels:  parallel-computing, avx2, avx
Nsimd
Agenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-1.43%)
Mutual labels:  cuda, avx2, avx
crowdsource-video-experiments-on-android
Crowdsourcing video experiments (such as collaborative benchmarking and optimization of DNN algorithms) using Collective Knowledge Framework across diverse Android devices provided by volunteers. Results are continuously aggregated in the open repository:
Stars: ✭ 29 (-79.29%)
Mutual labels:  openmp, cuda
Corium
Corium is a modern scripting language which combines simple, safe and efficient programming.
Stars: ✭ 18 (-87.14%)
Mutual labels:  parallel-computing, avx
CPURasterizer
CPU Based Rasterizer Engine
Stars: ✭ 99 (-29.29%)
Mutual labels:  parallel-computing, avx2
Distiller
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller
Stars: ✭ 3,760 (+2585.71%)
gpu-monitor
Script to remotely check GPU servers for free GPUs
Stars: ✭ 85 (-39.29%)
Mutual labels:  cuda, cudnn
Sleef
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (+152.14%)
Mutual labels:  cuda, avx
Amgcl
C++ library for solving large sparse linear systems with algebraic multigrid method
Stars: ✭ 390 (+178.57%)
Mutual labels:  openmp, cuda
Graffitist
Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow
Stars: ✭ 135 (-3.57%)
Awesome Emdl
Embedded and mobile deep learning research resources
Stars: ✭ 554 (+295.71%)
Taskflow
A General-purpose Parallel and Heterogeneous Task Programming System
Stars: ✭ 6,128 (+4277.14%)
Mutual labels:  parallel-computing, cuda
Pyopencl
OpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+464.29%)
Mutual labels:  parallel-computing, cuda
peakperf
Achieve peak performance on x86 CPUs and NVIDIA GPUs
Stars: ✭ 33 (-76.43%)
Mutual labels:  cuda, avx
Aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
Stars: ✭ 453 (+223.57%)
Chainer
A flexible framework of neural networks for deep learning
Stars: ✭ 5,656 (+3940%)
Mutual labels:  cudnn, cuda
Directxmath
DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Stars: ✭ 859 (+513.57%)
Mutual labels:  avx2, avx
Arch-Data-Science
Archlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision
Stars: ✭ 92 (-34.29%)
Mutual labels:  cuda, cudnn
monolish
monolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Stars: ✭ 166 (+18.57%)
Mutual labels:  openmp, cuda
hero-sdk
⛔ DEPRECATED ⛔ HERO Software Development Kit
Stars: ✭ 21 (-85%)
Mutual labels:  openmp, parallel-computing
Sockeye
Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet
Stars: ✭ 990 (+607.14%)
Kernels
This is a set of simple programs that can be used to explore the features of a parallel platform.
Stars: ✭ 287 (+105%)
Mutual labels:  parallel-computing, openmp
Deep Diamond
A fast Clojure Tensor & Deep Learning library
Stars: ✭ 288 (+105.71%)
Mutual labels:  deep-neural-networks, cuda
Mini Caffe
Minimal runtime core of Caffe, Forward only, GPU support and Memory efficiency.
Stars: ✭ 373 (+166.43%)
Mutual labels:  cudnn, cuda
gpubootcamp
This repository consists for gpu bootcamp material for HPC and AI
Stars: ✭ 227 (+62.14%)
Mutual labels:  openmp, cuda
Cupy
NumPy & SciPy for GPU
Stars: ✭ 5,625 (+3917.86%)
Mutual labels:  cudnn, cuda
Stdgpu
stdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+279.29%)
Mutual labels:  openmp, cuda
Kratos
Kratos Multiphysics (A.K.A Kratos) is a framework for building parallel multi-disciplinary simulation software. Modularity, extensibility and HPC are the main objectives. Kratos has BSD license and is written in C++ with extensive Python interface.
Stars: ✭ 558 (+298.57%)
Mutual labels:  parallel-computing, openmp
Libxsmm
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Stars: ✭ 518 (+270%)
Mutual labels:  avx2, avx
Marian
Fast Neural Machine Translation in C++
Stars: ✭ 777 (+455%)
Mutual labels:  neural-machine-translation, cuda
Accelerate
Embedded language for high-performance array computations
Stars: ✭ 751 (+436.43%)
Mutual labels:  parallel-computing, cuda
Nbody
N body gravity attraction problem solver
Stars: ✭ 40 (-71.43%)
Mutual labels:  openmp, cuda
Simde
Implementations of SIMD instruction sets for systems which don't natively support them.
Stars: ✭ 1,012 (+622.86%)
Mutual labels:  avx2, avx
Accelerate Llvm
LLVM backend for Accelerate
Stars: ✭ 134 (-4.29%)
Mutual labels:  parallel-computing, cuda
Nvidia libs test
Tests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-74.29%)
Mutual labels:  cudnn, cuda
Sixtyfour
How fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-70.71%)
Mutual labels:  cuda, avx2
Simple Sh Datascience
A collection of Bash scripts and Dockerfiles to install data science Tool, Lib and application
Stars: ✭ 32 (-77.14%)
Mutual labels:  cudnn, cuda
Unisimd Assembler
SIMD macro assembler unified for ARM, MIPS, PPC and x86
Stars: ✭ 63 (-55%)
Mutual labels:  avx2, avx
Gdax Orderbook Ml
Application of machine learning to the Coinbase (GDAX) orderbook
Stars: ✭ 60 (-57.14%)
Mutual labels:  deep-neural-networks, cuda
Quadray Engine
Realtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-90.71%)
Mutual labels:  avx2, avx
Umesimd
UME::SIMD A library for explicit simd vectorization.
Stars: ✭ 66 (-52.86%)
Mutual labels:  avx2, avx
Singularity Tutorial
Tutorial for using Singularity containers
Stars: ✭ 46 (-67.14%)
Mutual labels:  cudnn, cuda
Openmp Examples
openmp examples
Stars: ✭ 64 (-54.29%)
Mutual labels:  parallel-computing, openmp
Marian Dev
Fast Neural Machine Translation in C++ - development repository
Stars: ✭ 136 (-2.86%)
Mutual labels:  neural-machine-translation, cuda
Tutorial Ubuntu 18.04 Install Nvidia Driver And Cuda And Cudnn And Build Tensorflow For Gpu
Ubuntu 18.04 How to install Nvidia driver + CUDA + CUDNN + build tensorflow for gpu step by step command line
Stars: ✭ 91 (-35%)
Mutual labels:  cudnn, cuda
Aurora
Minimal Deep Learning library is written in Python/Cython/C++ and Numpy/CUDA/cuDNN.
Stars: ✭ 90 (-35.71%)
Mutual labels:  cudnn, cuda
Pytorchnlpbook
Code and data accompanying Natural Language Processing with PyTorch published by O'Reilly Media https://nlproc.info
Stars: ✭ 1,390 (+892.86%)
Simd
C++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Stars: ✭ 1,263 (+802.14%)
Mutual labels:  avx2, avx
Tensorflow Object Detection Tutorial
The purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-19.29%)
Mutual labels:  cudnn, cuda
allgebra
Base container for developing C++ and Fortran HPC applications
Stars: ✭ 14 (-90%)
Mutual labels:  openmp, cuda
FGPU
No description or website provided.
Stars: ✭ 30 (-78.57%)
Mutual labels:  openmp, cuda
1-60 of 1274 similar projects