wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+4972.34%)
rustfstRust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+121.28%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+336.17%)
DiViMeACLEW Diarization Virtual Machine
Stars: ✭ 28 (-40.43%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (+10.64%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+376.6%)
AmplitudaAmlituda - an android library that calculates amplitudes from audio and provides data in different formats. Based on this data, you can draw waveform. Android audio amplitude library.
Stars: ✭ 75 (+59.57%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (+10.64%)
scim[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (-63.83%)
awesome-keyword-spottingThis repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (+219.15%)
SoundfingerprintingOpen source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.
Stars: ✭ 554 (+1078.72%)
Computervision RecipesBest Practices, code samples, and documentation for Computer Vision.
Stars: ✭ 8,214 (+17376.6%)
Nara wpeDifferent implementations of "Weighted Prediction Error" for speech dereverberation
Stars: ✭ 265 (+463.83%)
Caffe HrtHeterogeneous Run Time version of Caffe. Added heterogeneous capabilities to the Caffe, uses heterogeneous computing infrastructure framework to speed up Deep Learning on Arm-based heterogeneous embedded platform. It also retains all the features of the original Caffe architecture which users deploy their applications seamlessly.
Stars: ✭ 271 (+476.6%)
hifigan-denoiserHiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (+87.23%)
Pytorch SrganA modern PyTorch implementation of SRGAN
Stars: ✭ 289 (+514.89%)
Vosk ServerWebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (+489.36%)
UnityASRAutomatic Speech Recognition in Unity.
Stars: ✭ 14 (-70.21%)
Personality DetectionImplementation of a hierarchical CNN based model to detect Big Five personality traits
Stars: ✭ 338 (+619.15%)
Ios 10 SamplerCode examples for new APIs of iOS 10.
Stars: ✭ 3,341 (+7008.51%)
RmdlRMDL: Random Multimodel Deep Learning for Classification
Stars: ✭ 375 (+697.87%)
ArtificioDeep Learning Computer Vision Algorithms for Real-World Use
Stars: ✭ 326 (+593.62%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (+695.74%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-55.32%)
QC++ Library for Audio Digital Signal Processing
Stars: ✭ 481 (+923.4%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+1053.19%)
Wavesurfer.jsNavigable waveform built on Web Audio and Canvas
Stars: ✭ 5,905 (+12463.83%)
Tf Pose EstimationDeep Pose Estimation implemented using Tensorflow with Custom Architectures for fast inference.
Stars: ✭ 3,856 (+8104.26%)
Auto EditorAuto-Editor: Effort free video editing!
Stars: ✭ 382 (+712.77%)
DeepfaceDeep Learning Models for Face Detection/Recognition/Alignments, implemented in Tensorflow
Stars: ✭ 409 (+770.21%)
Fast SrganA Fast Deep Learning Model to Upsample Low Resolution Videos to High Resolution at 30fps
Stars: ✭ 417 (+787.23%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+10417.02%)
Food Recipe Cnnfood image to recipe with deep convolutional neural networks.
Stars: ✭ 448 (+853.19%)
Neuralnetwork.netA TensorFlow-inspired neural network library built from scratch in C# 7.3 for .NET Standard 2.0, with GPU support through cuDNN
Stars: ✭ 392 (+734.04%)
DeepmodelsTensorFlow Implementation of state-of-the-art models since 2012
Stars: ✭ 33 (-29.79%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+1010.64%)
Svhn CnnGoogle Street View House Number(SVHN) Dataset, and classifying them through CNN
Stars: ✭ 44 (-6.38%)
Regl CnnDigit recognition with Convolutional Neural Networks in WebGL
Stars: ✭ 490 (+942.55%)
ChromaprintC library for generating audio fingerprints used by AcoustID
Stars: ✭ 553 (+1076.6%)
Trending Deep LearningTop 100 trending deep learning repositories sorted by the number of stars gained on a specific day.
Stars: ✭ 543 (+1055.32%)
Audio Visualizer Android🎵 [Android Library] A light-weight and easy-to-use Audio Visualizer for Android.
Stars: ✭ 581 (+1136.17%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+12663.83%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (+1246.81%)
FdwaveformviewReads an audio file and displays the waveform
Stars: ✭ 997 (+2021.28%)
TorchioMedical image preprocessing and augmentation toolkit for deep learning
Stars: ✭ 708 (+1406.38%)
Deeplearning.aideeplearning.ai , By Andrew Ng, All video link
Stars: ✭ 625 (+1229.79%)
Awesome DiarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (+1331.91%)
FfmediaelementFFME: The Advanced WPF MediaElement (based on FFmpeg)
Stars: ✭ 733 (+1459.57%)
AudinoOpen source audio annotation tool for humans™
Stars: ✭ 740 (+1474.47%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+1508.51%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (+1212.77%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+1470.21%)
Tf cnnvisCNN visualization tool in TensorFlow
Stars: ✭ 769 (+1536.17%)