Casr Demo基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
Stars: ✭ 76 (-68.98%)
Mit Deep Learning Book PdfMIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville
Stars: ✭ 9,859 (+3924.08%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-57.96%)
anycontrolVoice control for your websites and applications
Stars: ✭ 53 (-78.37%)
Multimodal-Gesture-Recognition-with-LSTMs-and-CTCAn end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.
Stars: ✭ 25 (-89.8%)
simple-obs-sttSpeech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (-63.67%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-91.43%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-91.43%)
NetronVisualizer for neural network, deep learning, and machine learning models
Stars: ✭ 17,193 (+6917.55%)
simple diarizerSimplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Stars: ✭ 26 (-89.39%)
opensnipsOpen source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (-79.59%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-85.71%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-91.02%)
PaddlexPaddlePaddle End-to-End Development Toolkit(『飞桨』深度学习全流程开发工具)
Stars: ✭ 3,399 (+1287.35%)
Speech To Text RussianПроект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (-38.37%)
SpeechtAn opensource speech-to-text software written in tensorflow
Stars: ✭ 152 (-37.96%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (+63.27%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+60.41%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+100%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+121.22%)
DeepfacelabDeepFaceLab is the leading software for creating deepfakes.
Stars: ✭ 30,308 (+12270.61%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+211.84%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+208.57%)
KurDescriptive Deep Learning
Stars: ✭ 811 (+231.02%)
Dc ttsA TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: ✭ 1,017 (+315.1%)
DiscordspeechbotA speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-85.71%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+2437.14%)
FixyAmacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çözebilen, eşsiz yaklaşımlar öne süren ve literatürdeki çalışmaların eksiklerini gideren open source bir yazım destekleyicisi/denetleyicisi oluşturmak. Kullanıcıların yazdıkları metinlerdeki yazım yanlışlarını derin öğrenme yaklaşımıyla çözüp aynı zamanda metinlerde anlamsal analizi de gerçekleştirerek bu bağlamda ortaya çıkan yanlışları da fark edip düzeltebilmek.
Stars: ✭ 165 (-32.65%)
LibraErgonomic machine learning for everyone.
Stars: ✭ 1,925 (+685.71%)
Tacotron asrSpeech Recognition Using Tacotron
Stars: ✭ 165 (-32.65%)
Deeplearning4jAll DeepLearning4j projects go here.
Stars: ✭ 68 (-72.24%)
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-73.88%)
DeepspeechA PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+397.55%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+453.88%)
Open sttOpen STT
Stars: ✭ 584 (+138.37%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+503.67%)
Ensemble PytorchA unified ensemble framework for Pytorch to improve the performance and robustness of your deep learning model
Stars: ✭ 153 (-37.55%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+755.92%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+1404.08%)
Tts CubeEnd-2-end speech synthesis with recurrent neural networks
Stars: ✭ 213 (-13.06%)
MahjongaiJapanese Riichi Mahjong AI agent. (Feel free to extend this agent or develop your own agent)
Stars: ✭ 210 (-14.29%)
InfernoA utility library around PyTorch
Stars: ✭ 237 (-3.27%)
GamA PyTorch implementation of "Graph Classification Using Structural Attention" (KDD 2018).
Stars: ✭ 227 (-7.35%)
TrixiManage your machine learning experiments with trixi - modular, reproducible, high fashion. An experiment infrastructure optimized for PyTorch, but flexible enough to work for your framework and your tastes.
Stars: ✭ 211 (-13.88%)
AdatuneGradient based Hyperparameter Tuning library in PyTorch
Stars: ✭ 226 (-7.76%)
Stereo From Mono[ECCV 2020] Learning stereo from single images using monocular depth estimation networks
Stars: ✭ 210 (-14.29%)
Handtrack.jsA library for prototyping realtime hand detection (bounding box), directly in the browser.
Stars: ✭ 2,531 (+933.06%)