App🤖 A GitHub App to automate acknowledging contributors to your open source projects
Stars: ✭ 358 (-71.54%)
facetFacet is a live coding system for algorithmic music
Stars: ✭ 72 (-94.28%)
torch-asgAuto Segmentation Criterion (ASG) implemented in pytorch
Stars: ✭ 42 (-96.66%)
gtranscribeSoftware for interview transcription
Stars: ✭ 12 (-99.05%)
InaspeechsegmenterCNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Stars: ✭ 352 (-72.02%)
linear16Converts an audio file to LINEAR16 Google-speech compatible file.
Stars: ✭ 14 (-98.89%)
ArcanArcan - [Display Server, Multimedia Framework, Game Engine] -> "Desktop Engine"
Stars: ✭ 885 (-29.65%)
speech-transformerTransformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (-96.82%)
All Contributors CliTool to help automate adding contributor acknowledgements according to the all-contributors specification ✨
Stars: ✭ 345 (-72.58%)
Ccpd[ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition
Stars: ✭ 1,252 (-0.48%)
DplugAudio plugin framework. VST2/VST3/AU/AAX/LV2 for Linux/macOS/Windows.
Stars: ✭ 341 (-72.89%)
Speechpy💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
Stars: ✭ 833 (-33.78%)
JD-NMFJoint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)
Stars: ✭ 20 (-98.41%)
Php Opencv ExamplesTutorial for computer vision and machine learning in PHP 7/8 by opencv (installation + examples + documentation)
Stars: ✭ 333 (-73.53%)
Ios 10 SamplerCode examples for new APIs of iOS 10.
Stars: ✭ 3,341 (+165.58%)
deep avsrA PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (-91.73%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (-35.77%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-97.85%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+1384.9%)
Awesome Web AudioA list of resources and projects to help learn about audio
Stars: ✭ 73 (-94.2%)
wikipronMassively multilingual pronunciation mining
Stars: ✭ 167 (-86.72%)
Wave U NetImplementation of the Wave-U-Net for audio source separation
Stars: ✭ 506 (-59.78%)
sepia-stt-serverSEPIA server to support open-source speech recognition via WebSocket connection.
Stars: ✭ 45 (-96.42%)
kaldi helpers🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-98.97%)
Alan Sdk IosAlan AI iOS SDK adds a voice assistant or chatbot to your app. Supports Swift, Objective-C.
Stars: ✭ 318 (-74.72%)
react-clientAn React client library for Speechly API
Stars: ✭ 71 (-94.36%)
Stephanie VaStephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (-38.63%)
VectorhubVector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)
Stars: ✭ 317 (-74.8%)
etiketaiEtiketai is an online tool designed to label images, useful for training AI models
Stars: ✭ 63 (-94.99%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (-83.7%)
DaliA GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Stars: ✭ 3,624 (+188.08%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (-98.09%)
Deepspeech Websocket ServerServer & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
Stars: ✭ 79 (-93.72%)
tenacityTenacity is an easy-to-use, privacy-friendly, FLOSS, cross-platform multi-track audio editor/recorder for Windows, macOS, Linux and other operating systems. Project currently on an indefinite hiatus.
Stars: ✭ 7,231 (+474.8%)
Speech recognitionA Flutter plugin to use speech recognition on iOS & Android (Swift/Java)
Stars: ✭ 302 (-75.99%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (-41.34%)
TacotronAudio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Stars: ✭ 493 (-60.81%)
Fre-GAN-pytorchFre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: ✭ 73 (-94.2%)
AdaSpeechAdaSpeech: Adaptive Text to Speech for Custom Voice
Stars: ✭ 108 (-91.41%)
PysptkA python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (-76.39%)
Iflytek awaken asruse iflytek's technology to realize awaken and order recognition
Stars: ✭ 53 (-95.79%)
PhormaticsUsing A.I. and computer vision to build a virtual personal fitness trainer. (Most Startup-Viable Hack - HackNYU2018)
Stars: ✭ 79 (-93.72%)
Aca CodeMatlab scripts accompanying the book "An Introduction to Audio Content Analysis" (www.AudioContentAnalysis.org)
Stars: ✭ 67 (-94.67%)
PnccA implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-96.82%)
speech-to-textmixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-95.15%)
Mycroft PreciseA lightweight, simple-to-use, RNN wake word listener
Stars: ✭ 481 (-61.76%)
scim[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (-98.65%)
tsunamiA simple but powerful audio editor
Stars: ✭ 41 (-96.74%)
Tika PythonTika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Stars: ✭ 997 (-20.75%)