All Projects → ml-with-audio → Similar Projects or Alternatives

461 Open source projects that are alternatives of or similar to ml-with-audio

hf-experiments
Experiments with Hugging Face 🔬 🤗
Stars: ✭ 37 (-64.42%)
Mutual labels:  speech-recognition, huggingface
Awesome Ai Services
An overview of the AI-as-a-service landscape
Stars: ✭ 133 (+27.88%)
Openseq2seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+1225%)
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-74.04%)
Athena
an open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+421.15%)
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+371.15%)
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (-49.04%)
Cross vc
Cross-lingual Voice Conversion
Stars: ✭ 91 (-12.5%)
voicekit-examples
Examples on how to use Tinkoff Voicekit
Stars: ✭ 35 (-66.35%)
Espnet
End-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+4258.65%)
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+97.12%)
Naomi
The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (+64.42%)
TinyCog
Small Robot, Toy Robot platform
Stars: ✭ 29 (-72.12%)
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+708.65%)
speechrec
a simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-82.69%)
Khronos
The open source intelligent personal assistant
Stars: ✭ 25 (-75.96%)
leon
🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+8130.77%)
web-speech-cognitive-services
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-66.35%)
Nemo
NeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+3443.27%)
porfir
Голосовой ассистент Порфирьевич
Stars: ✭ 23 (-77.88%)
spokestack-android
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-50%)
Libfaceid
libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (+240.38%)
Awesome Speech Recognition Speech Synthesis Papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Stars: ✭ 2,085 (+1904.81%)
Spokestack Python
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-0.96%)
Kalliope
Kalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (+1350.96%)
Speech ai
Simple speech linguistic AI with Python
Stars: ✭ 66 (-36.54%)
idear
🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (-19.23%)
Artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (+872.12%)
Lingvo
Lingvo
Stars: ✭ 2,361 (+2170.19%)
AmazonSpeechTranslator
End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-51.92%)
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (-14.42%)
Mutual labels:  speech-recognition
TensorVox
Desktop application for neural speech synthesis written in C++
Stars: ✭ 140 (+34.62%)
Mutual labels:  speech-synthesis
KodiSharp
Use Kodi python APIs in C#, and write rich addons using the .NET framework/Mono
Stars: ✭ 22 (-78.85%)
Mutual labels:  speech-recognition
awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (+44.23%)
Mutual labels:  speech-recognition
salutejs
SmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript
Stars: ✭ 35 (-66.35%)
Mutual labels:  speech-recognition
awesome-huggingface
🤗 A list of wonderful open-source projects & applications integrated with Hugging Face libraries.
Stars: ✭ 436 (+319.23%)
Mutual labels:  huggingface
Catch-A-Waveform
Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
Stars: ✭ 117 (+12.5%)
Mutual labels:  speech-synthesis
TFGAN
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Stars: ✭ 65 (-37.5%)
Mutual labels:  speech-synthesis
StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
Stars: ✭ 161 (+54.81%)
Mutual labels:  speech-synthesis
praise
Do stuff with your voice in the browser.
Stars: ✭ 13 (-87.5%)
Mutual labels:  speech-recognition
api
Speechly public API definitions and generated code
Stars: ✭ 15 (-85.58%)
Mutual labels:  speech-recognition
KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-79.81%)
Mutual labels:  speech-recognition
React.ai
It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (-63.46%)
Mutual labels:  speech-recognition
clip-italian
CLIP (Contrastive Language–Image Pre-training) for Italian
Stars: ✭ 113 (+8.65%)
Mutual labels:  huggingface
octopus
On-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-71.15%)
Mutual labels:  speech-recognition
awesome-end2end-speech-recognition
💬 A list of End-to-End speech recognition, including papers, codes and other materials
Stars: ✭ 49 (-52.88%)
Mutual labels:  speech-recognition
MediumVC
Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
Stars: ✭ 46 (-55.77%)
Mutual labels:  speech-synthesis
SingleVC
Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.
Stars: ✭ 25 (-75.96%)
Mutual labels:  speech-synthesis
iOSProjects
It's project that contains different applications developed with Swift 5.7 👨‍💻👩🏼‍💻🧑🏿‍💻
Stars: ✭ 122 (+17.31%)
Mutual labels:  speech-recognition
TabFormer
Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)
Stars: ✭ 209 (+100.96%)
Mutual labels:  huggingface
picovoice
The end-to-end platform for building voice products at scale
Stars: ✭ 316 (+203.85%)
Mutual labels:  speech-recognition
opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-79.81%)
Mutual labels:  speech-recognition
End-to-End-Mandarin-ASR
End-to-end speech recognition on AISHELL dataset.
Stars: ✭ 20 (-80.77%)
Mutual labels:  speech-recognition
2018-dlsl
UPC Deep Learning for Speech and Language 2018
Stars: ✭ 18 (-82.69%)
Mutual labels:  speech-recognition
Cross-Speaker-Emotion-Transfer
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Stars: ✭ 107 (+2.88%)
Mutual labels:  speech-synthesis
Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Stars: ✭ 139 (+33.65%)
Mutual labels:  speech-synthesis
web-voice-processor
A library for real-time voice processing in web browsers
Stars: ✭ 69 (-33.65%)
Mutual labels:  speech-recognition
AnimeGANv3
Use AnimeGANv3 to make your own animation works, including turning photos or videos into anime.
Stars: ✭ 878 (+744.23%)
Mutual labels:  huggingface
ctc-asr
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (+7.69%)
Mutual labels:  speech-recognition
Neural-Scam-Artist
Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.
Stars: ✭ 18 (-82.69%)
Mutual labels:  huggingface
1-60 of 461 similar projects