Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (-54%)

Mutual labels: speech-synthesis, speech-recognition

Deep Learning Drizzle

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

Stars: ✭ 9,717 (+114.36%)

Mutual labels: speech-recognition, machine-translation

Speech Transformer Tf2.0

transformer for ASR-systerm (via tensorflow2.0)

Stars: ✭ 90 (-98.01%)

Mutual labels: speech-recognition, end-to-end

E2e Asr

PyTorch Implementations for End-to-End Automatic Speech Recognition

Stars: ✭ 106 (-97.66%)

Mutual labels: speech-recognition, end-to-end

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-99.6%)

Mutual labels: speech-synthesis, speech-recognition

ppg-vc

PPG-Based Voice Conversion

Stars: ✭ 154 (-96.6%)

Mutual labels: speech-synthesis, voice-conversion

DSTC6-End-to-End-Conversation-Modeling

DSTC6: End-to-End Conversation Modeling Track

Stars: ✭ 56 (-98.76%)

Mutual labels: chainer, end-to-end

Rus-SpeechRecognition-LSTM-CTC-VoxForge

Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge

Stars: ✭ 50 (-98.9%)

Mutual labels: end-to-end, speech-recognition

speech separation

Constrained Permutation Invariant Training, Speech Separation

Stars: ✭ 27 (-99.4%)

Mutual labels: speech-separation

Wire Ios

📱 Wire for iOS (iPhone and iPad)

Stars: ✭ 3,079 (-32.08%)

Mutual labels: end-to-end

waifu2x-chainer

Chainer implementation of waifu2x

Stars: ✭ 137 (-96.98%)

Mutual labels: chainer

kim-voice-assistant

Kim，你的私人语音助理。

Stars: ✭ 70 (-98.46%)

Mutual labels: speech-recognition

Bytenet Tensorflow

ByteNet for character-level language modelling

Stars: ✭ 319 (-92.96%)

Mutual labels: machine-translation

React Transcript Editor

A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress

Stars: ✭ 285 (-93.71%)

Mutual labels: kaldi

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (-97.29%)

Mutual labels: speech-recognition

Voice-Denoising-AN

A Conditional Generative Adverserial Network (cGAN) was adapted for the task of source de-noising of noisy voice auditory images. The base architecture is adapted from Pix2Pix.

Stars: ✭ 42 (-99.07%)

Mutual labels: speech-enhancement

Alan Sdk Android

Alan AI Android SDK adds a voice assistant or chatbot to your app. Supports Java, Kotlin.

Stars: ✭ 278 (-93.87%)

Mutual labels: speech-recognition

Ajax-Chat

Ajax Chat is a complete web chat in javascript, ajax, php and mysql compatible with Phonegap

Stars: ✭ 19 (-99.58%)

Mutual labels: end-to-end

Espeak

eSpeak NG is an open source speech synthesizer that supports 101 languages and accents.

Stars: ✭ 339 (-92.52%)

Mutual labels: speech-synthesis

Gp Gan

Official Chainer implementation of GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral)

Stars: ✭ 317 (-93.01%)

Mutual labels: chainer

nepali-translator

Neural Machine Translation on the Nepali-English language pair

Stars: ✭ 29 (-99.36%)

Mutual labels: machine-translation

Recording-Bot

A bot built to record and transcribe audio fragments from Discord.

Stars: ✭ 22 (-99.51%)

Mutual labels: speech-recognition

StageMate

StageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.

Stars: ✭ 60 (-98.68%)

Mutual labels: speech-recognition

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (-93.85%)

Mutual labels: speech-synthesis

Tacotron pytorch

Tacotron implementation of pytorch

Stars: ✭ 12 (-99.74%)

Mutual labels: speech-synthesis

Alan Sdk Flutter

Alan AI Flutter SDK adds a voice assistant or chatbot to your app.

Stars: ✭ 309 (-93.18%)

Mutual labels: speech-recognition

Neuraldialog Cvae

Tensorflow Implementation of Knowledge-Guided CVAE for dialog generation ACL 2017. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU

Stars: ✭ 279 (-93.85%)

Mutual labels: end-to-end

dropclass speaker

DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020

Stars: ✭ 20 (-99.56%)

Mutual labels: kaldi

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (-98.37%)

Mutual labels: speech-synthesis

Phonetisaurus

Phonetisaurus G2P

Stars: ✭ 277 (-93.89%)

Mutual labels: speech-recognition

Multi-Hotword Spotting

Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?

Stars: ✭ 31 (-99.32%)

Mutual labels: speech-recognition

download audioset

📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).

Stars: ✭ 53 (-98.83%)

Mutual labels: speech-recognition

Multilingual text to speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Stars: ✭ 324 (-92.85%)

Mutual labels: speech-synthesis

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (-93.12%)

Mutual labels: speech-synthesis

Transformer

A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"

Stars: ✭ 271 (-94.02%)

Mutual labels: machine-translation

musicologist

Music advice from a conversational interface powered by Algolia

Stars: ✭ 19 (-99.58%)

Mutual labels: speech-recognition

ocaml-otr

Off-the-record (OTR) messaging protocol, purely in OCaml

Stars: ✭ 39 (-99.14%)

Mutual labels: end-to-end

Attention-Visualization

Visualization for simple attention and Google's multi-head attention.

Stars: ✭ 54 (-98.81%)

Mutual labels: machine-translation

Zhihu

This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.

Stars: ✭ 3,307 (-27.05%)

Mutual labels: machine-translation

Alan Sdk Cordova

Alan AI Cordova SDK adds a voice assistant or chatbot to your app.

Stars: ✭ 269 (-94.07%)

Mutual labels: speech-recognition

61-120 of 717 similar projects

‹

›

next*5