speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

Stars: ✭ 259 (-74.53%)

Mutual labels: speech

Amazing Python Scripts

🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.

Stars: ✭ 229 (-77.48%)

Mutual labels: speech

Speech Demo

语音api示例

Stars: ✭ 454 (-55.36%)

Mutual labels: speech-to-text

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-97.84%)

Mutual labels: tts

Noise2Noise-audio denoising without clean training data

Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…

Stars: ✭ 49 (-95.18%)

Mutual labels: speech

Adapt

Adapt Intent Parser

Stars: ✭ 690 (-32.15%)

Mutual labels: speech-to-text

Melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Stars: ✭ 444 (-56.34%)

Mutual labels: tts

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Stars: ✭ 88 (-91.35%)

Mutual labels: speech

esp32-flite

Speech synthesis running on ESP32 based on Flite engine.

Stars: ✭ 28 (-97.25%)

Mutual labels: tts

Voice Overlay Ios

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 440 (-56.74%)

Mutual labels: speech-to-text

minutes

🔭 Speaker diarization via transfer learning

Stars: ✭ 25 (-97.54%)

Mutual labels: speech

Speechtotext Websockets Java

SDK & Sample to do speech recognition using websockets in Java

Stars: ✭ 11 (-98.92%)

Mutual labels: speech-to-text

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (-32.94%)

Mutual labels: tts

demo vietasr

Vietnamese Speech Recognition

Stars: ✭ 22 (-97.84%)

Mutual labels: speech-to-text

RequestifyTF2

Client side commands for mic spamming and more!

Stars: ✭ 13 (-98.72%)

Mutual labels: tts

flite-go

Go bindings for Flite (festival-lite)

Stars: ✭ 14 (-98.62%)

Mutual labels: speech

Transformer Tts

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

Stars: ✭ 418 (-58.9%)

Mutual labels: tts

kim-voice-assistant

Kim，你的私人语音助理。

Stars: ✭ 70 (-93.12%)

Mutual labels: speech-to-text

Praat

Praat: Doing Phonetics By Computer

Stars: ✭ 675 (-33.63%)

Mutual labels: speech

Asrt speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Stars: ✭ 4,943 (+386.04%)

Mutual labels: speech-to-text

google-translate-tts

Node library for Google Translate TTS (Text-to-Speech) API

Stars: ✭ 23 (-97.74%)

Mutual labels: tts

leopard-chat-ui-teneo

Leopard Chat UI - A Teneo Chat Client based on Vue and Vuetify

Stars: ✭ 65 (-93.61%)

Mutual labels: tts

Specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Stars: ✭ 408 (-59.88%)

Mutual labels: speech

talkbot

Text-to-speech and translation bot for Discord

Stars: ✭ 27 (-97.35%)

Mutual labels: tts

Dialectid e2e

End to End Dialect Identification using Convolutional Neural Network

Stars: ✭ 40 (-96.07%)

Mutual labels: speech

apple airplayer

Make your AirPlay devices as TTS speakers

Stars: ✭ 84 (-91.74%)

Mutual labels: tts

Kur

Descriptive Deep Learning

Stars: ✭ 811 (-20.26%)

Mutual labels: speech-to-text

Segan

Speech Enhancement Generative Adversarial Network in TensorFlow

Stars: ✭ 661 (-35%)

Mutual labels: speech

Rhino

On-device speech-to-intent engine powered by deep learning

Stars: ✭ 406 (-60.08%)

Mutual labels: speech-to-text

tt-vae-gan

Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.

Stars: ✭ 37 (-96.36%)

Mutual labels: speech

BangalASR

Transformer based Bangla Speech Recognition

Stars: ✭ 20 (-98.03%)

Mutual labels: speech-to-text

Neural sp

End-to-end ASR/LM implementation with PyTorch

Stars: ✭ 408 (-59.88%)

Mutual labels: speech

home-assistant-custom-components-linkplay

LinkPlay based media devices integration for Home Assistant. Fully compatible with Mini Media Player card including speaker group management. Supports snapshot and restore functionality for TTS.

Stars: ✭ 62 (-93.9%)

Mutual labels: tts

Speech recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Stars: ✭ 5,999 (+489.87%)

Mutual labels: speech-to-text

Tensorflowasr

⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Stars: ✭ 400 (-60.67%)

Mutual labels: speech-to-text

Speech256

An FPGA implementation of a classic 80ies speech synthesizer. Done for the Retro Challenge 2017/10.

Stars: ✭ 51 (-94.99%)

Mutual labels: speech

speak.awf

An Alfred 3 workflow that uses macOS's TTS (text-to-speech) feature to speak text aloud.

Stars: ✭ 29 (-97.15%)

Mutual labels: tts

61-120 of 438 similar projects

‹

›

next*5