All Projects → Audio Pretrained Model → Similar Projects or Alternatives

1764 Open source projects that are alternatives of or similar to Audio Pretrained Model

Dkeras
Distributed Keras Engine, Make Keras faster with only one line of code.
Stars: ✭ 181 (+196.72%)
Pytorch2keras
PyTorch to Keras model convertor
Stars: ✭ 676 (+1008.2%)
pytorch2keras
PyTorch to Keras model convertor
Stars: ✭ 788 (+1191.8%)
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+34.43%)
Sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+1152.46%)
Speech recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+9734.43%)
web-voice-processor
A library for real-time voice processing in web browsers
Stars: ✭ 69 (+13.11%)
Automatic Speech Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
Stars: ✭ 192 (+214.75%)
TF-Model-Deploy-Tutorial
A tutorial exploring multiple approaches to deploy a trained TensorFlow (or Keras) model or multiple models for prediction.
Stars: ✭ 51 (-16.39%)
Keras Sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-22.95%)
Tensorflow Open nsfw
Tensorflow Implementation of Yahoo's Open NSFW Model
Stars: ✭ 338 (+454.1%)
Mutual labels:  caffe, tensorflow-models
Tensorflow end2end speech recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Stars: ✭ 305 (+400%)
Surfboard
Novoic's audio feature extraction library
Stars: ✭ 318 (+421.31%)
Mutual labels:  audio, audio-processing
Predictive Maintenance Using Lstm
Example of Multiple Multivariate Time Series Prediction with LSTM Recurrent Neural Networks in Python with Keras.
Stars: ✭ 352 (+477.05%)
Mutual labels:  keras-tensorflow, keras-models
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+544.26%)
Free Spoken Digit Dataset
A free audio dataset of spoken digits. Think MNIST for audio.
Stars: ✭ 396 (+549.18%)
Mutual labels:  speech-recognition, audio
Tensorflowasr
⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (+555.74%)
Speech To Text Benchmark
speech to text benchmark framework
Stars: ✭ 481 (+688.52%)
Q
C++ Library for Audio Digital Signal Processing
Stars: ✭ 481 (+688.52%)
Mutual labels:  audio, audio-processing
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-6.56%)
Soundfingerprinting
Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.
Stars: ✭ 554 (+808.2%)
Mutual labels:  audio, audio-processing
Rhino
On-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (+565.57%)
Silero Models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+755.74%)
Awesome Coreml Models
Largest list of models for Core ML (for iOS 11+)
Stars: ✭ 5,192 (+8411.48%)
Mutual labels:  caffe, tensorflow-models
Bidaf Keras
Bidirectional Attention Flow for Machine Comprehension implemented in Keras 2
Stars: ✭ 60 (-1.64%)
Mutual labels:  keras-tensorflow, keras-models
Pinto model zoo
A repository that shares tuning results of trained models generated by TensorFlow / Keras. Post-training quantization (Weight Quantization, Integer Quantization, Full Integer Quantization, Float16 Quantization), Quantization-aware training. TensorFlow Lite. OpenVINO. CoreML. TensorFlow.js. TF-TRT. MediaPipe. ONNX. [.tflite,.h5,.pb,saved_model,tfjs,tftrt,mlmodel,.xml/.bin, .onnx]
Stars: ✭ 634 (+939.34%)
Mutual labels:  caffe, tensorflow-models
Dali
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Stars: ✭ 3,624 (+5840.98%)
Mutual labels:  mxnet, audio-processing
Segmentation models
Segmentation models with pretrained backbones. Keras and TensorFlow Keras.
Stars: ✭ 3,575 (+5760.66%)
Mutual labels:  keras-tensorflow, keras-models
Deepspeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+30522.95%)
Phonetisaurus
Phonetisaurus G2P
Stars: ✭ 277 (+354.1%)
Musig
A shazam like tool to store songs fingerprints and retrieve them
Stars: ✭ 388 (+536.07%)
Mutual labels:  audio, audio-processing
Cheetah
On-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (+527.87%)
Auto Editor
Auto-Editor: Effort free video editing!
Stars: ✭ 382 (+526.23%)
Mutual labels:  audio, audio-processing
Deep Learning Model Convertor
The convertor/conversion of deep learning models for different deep learning frameworks/softwares.
Stars: ✭ 3,044 (+4890.16%)
Mutual labels:  caffe, mxnet
Speech Demo
语音api示例
Stars: ✭ 454 (+644.26%)
Voice Overlay Ios
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (+621.31%)
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+703.28%)
Asrt speechrecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+8003.28%)
Chromaprint
C library for generating audio fingerprints used by AcoustID
Stars: ✭ 553 (+806.56%)
Mutual labels:  audio, audio-processing
Sonus
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+772.13%)
Audio Visualizer Android
🎵 [Android Library] A light-weight and easy-to-use Audio Visualizer for Android.
Stars: ✭ 581 (+852.46%)
Mutual labels:  audio, audio-processing
Resnetcam Keras
Keras implementation of a ResNet-CAM model
Stars: ✭ 269 (+340.98%)
Mutual labels:  keras-tensorflow, keras-models
Adapt
Adapt Intent Parser
Stars: ✭ 690 (+1031.15%)
Ffmediaelement
FFME: The Advanced WPF MediaElement (based on FFmpeg)
Stars: ✭ 733 (+1101.64%)
Mutual labels:  audio, audio-processing
Deepo
Setup and customize deep learning environment in seconds.
Stars: ✭ 6,145 (+9973.77%)
Mutual labels:  caffe, mxnet
Mmdnn
MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.
Stars: ✭ 5,472 (+8870.49%)
Mutual labels:  caffe, mxnet
Eesen
The official repository of the Eesen project
Stars: ✭ 738 (+1109.84%)
Machine Learning Curriculum
💻 Make machines learn so that you don't have to struggle to program them; The ultimate list
Stars: ✭ 761 (+1147.54%)
Mutual labels:  caffe, mxnet
Beethoven
🎸 A maestro of pitch detection.
Stars: ✭ 601 (+885.25%)
Mutual labels:  audio, audio-processing
Face Mask Detection
Face Mask Detection system based on computer vision and deep learning using OpenCV and Tensorflow/Keras
Stars: ✭ 774 (+1168.85%)
Mutual labels:  caffe, keras-tensorflow
Annyang
💬 Speech recognition for your site
Stars: ✭ 6,216 (+10090.16%)
Stephanie Va
Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (+1165.57%)
Kur
Descriptive Deep Learning
Stars: ✭ 811 (+1229.51%)
Steppy Toolkit
Curated set of transformers that make your work with steppy faster and more effective 🔭
Stars: ✭ 21 (-65.57%)
Mutual labels:  tensorflow-models, keras-models
Mxnet2caffe
convert model from mxnet to caffe without lossing precision
Stars: ✭ 20 (-67.21%)
Mutual labels:  caffe, mxnet
Guitard
Node based multi effects audio processor
Stars: ✭ 31 (-49.18%)
Mutual labels:  audio, audio-processing
Soloud
Free, easy, portable audio engine for games
Stars: ✭ 1,048 (+1618.03%)
Mutual labels:  speech-to-text, audio
Giada
Your Hardcore Loop Machine.
Stars: ✭ 903 (+1380.33%)
Mutual labels:  audio, audio-processing
Kfr
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Stars: ✭ 985 (+1514.75%)
Mutual labels:  audio, audio-processing
Discordspeechbot
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-42.62%)
1-60 of 1764 similar projects