DkerasDistributed Keras Engine, Make Keras faster with only one line of code.
Stars: ✭ 181 (+196.72%)
Pytorch2kerasPyTorch to Keras model convertor
Stars: ✭ 676 (+1008.2%)
pytorch2kerasPyTorch to Keras model convertor
Stars: ✭ 788 (+1191.8%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+34.43%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+1152.46%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+9734.43%)
web-voice-processorA library for real-time voice processing in web browsers
Stars: ✭ 69 (+13.11%)
TF-Model-Deploy-TutorialA tutorial exploring multiple approaches to deploy a trained TensorFlow (or Keras) model or multiple models for prediction.
Stars: ✭ 51 (-16.39%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-22.95%)
SurfboardNovoic's audio feature extraction library
Stars: ✭ 318 (+421.31%)
Predictive Maintenance Using LstmExample of Multiple Multivariate Time Series Prediction with LSTM Recurrent Neural Networks in Python with Keras.
Stars: ✭ 352 (+477.05%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+544.26%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (+555.74%)
QC++ Library for Audio Digital Signal Processing
Stars: ✭ 481 (+688.52%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-6.56%)
SoundfingerprintingOpen source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.
Stars: ✭ 554 (+808.2%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (+565.57%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+755.74%)
Bidaf KerasBidirectional Attention Flow for Machine Comprehension implemented in Keras 2
Stars: ✭ 60 (-1.64%)
Pinto model zooA repository that shares tuning results of trained models generated by TensorFlow / Keras. Post-training quantization (Weight Quantization, Integer Quantization, Full Integer Quantization, Float16 Quantization), Quantization-aware training. TensorFlow Lite. OpenVINO. CoreML. TensorFlow.js. TF-TRT. MediaPipe. ONNX. [.tflite,.h5,.pb,saved_model,tfjs,tftrt,mlmodel,.xml/.bin, .onnx]
Stars: ✭ 634 (+939.34%)
DaliA GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Stars: ✭ 3,624 (+5840.98%)
Segmentation modelsSegmentation models with pretrained backbones. Keras and TensorFlow Keras.
Stars: ✭ 3,575 (+5760.66%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+30522.95%)
MusigA shazam like tool to store songs fingerprints and retrieve them
Stars: ✭ 388 (+536.07%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (+527.87%)
Auto EditorAuto-Editor: Effort free video editing!
Stars: ✭ 382 (+526.23%)
Deep Learning Model ConvertorThe convertor/conversion of deep learning models for different deep learning frameworks/softwares.
Stars: ✭ 3,044 (+4890.16%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (+621.31%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+703.28%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+8003.28%)
ChromaprintC library for generating audio fingerprints used by AcoustID
Stars: ✭ 553 (+806.56%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+772.13%)
Audio Visualizer Android🎵 [Android Library] A light-weight and easy-to-use Audio Visualizer for Android.
Stars: ✭ 581 (+852.46%)
Resnetcam KerasKeras implementation of a ResNet-CAM model
Stars: ✭ 269 (+340.98%)
AdaptAdapt Intent Parser
Stars: ✭ 690 (+1031.15%)
FfmediaelementFFME: The Advanced WPF MediaElement (based on FFmpeg)
Stars: ✭ 733 (+1101.64%)
DeepoSetup and customize deep learning environment in seconds.
Stars: ✭ 6,145 (+9973.77%)
MmdnnMMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.
Stars: ✭ 5,472 (+8870.49%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+1109.84%)
Machine Learning Curriculum💻 Make machines learn so that you don't have to struggle to program them; The ultimate list
Stars: ✭ 761 (+1147.54%)
Beethoven🎸 A maestro of pitch detection.
Stars: ✭ 601 (+885.25%)
Face Mask DetectionFace Mask Detection system based on computer vision and deep learning using OpenCV and Tensorflow/Keras
Stars: ✭ 774 (+1168.85%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+10090.16%)
Stephanie VaStephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (+1165.57%)
KurDescriptive Deep Learning
Stars: ✭ 811 (+1229.51%)
Steppy ToolkitCurated set of transformers that make your work with steppy faster and more effective 🔭
Stars: ✭ 21 (-65.57%)
Mxnet2caffeconvert model from mxnet to caffe without lossing precision
Stars: ✭ 20 (-67.21%)
GuitardNode based multi effects audio processor
Stars: ✭ 31 (-49.18%)
SoloudFree, easy, portable audio engine for games
Stars: ✭ 1,048 (+1618.03%)
GiadaYour Hardcore Loop Machine.
Stars: ✭ 903 (+1380.33%)
KfrFast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Stars: ✭ 985 (+1514.75%)
DiscordspeechbotA speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-42.62%)