OPUS-MT-trainTraining open neural machine translation models
Stars: ✭ 166 (-96.34%)
edittsOfficial implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: ✭ 74 (-98.37%)
speech-enhancement-WGANspeech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN
Stars: ✭ 35 (-99.23%)
web-voice-processorA library for real-time voice processing in web browsers
Stars: ✭ 69 (-98.48%)
SpleeterRTReal time monaural source separation base on fully convolutional neural network operates on Time-frequency domain.
Stars: ✭ 111 (-97.55%)
speech-to-text-code-patternReact app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-99.18%)
kaldi-python-ioA python IO interface for data accessing in kaldi
Stars: ✭ 39 (-99.14%)
download audioset📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (-98.83%)
gravityUser-space deniable data encryption client.
Stars: ✭ 89 (-98.04%)
ilmultiTooling to play around with multilingual machine translation for Indian Languages.
Stars: ✭ 19 (-99.58%)
sepia-docsDocumentation and Wiki for SEPIA. Please post your questions and bug-reports here in the issues section! Thank you :-)
Stars: ✭ 160 (-96.47%)
sova-tts-tpsNLP-preprocessor for the SOVA-TTS project
Stars: ✭ 44 (-99.03%)
ocaml-otrOff-the-record (OTR) messaging protocol, purely in OCaml
Stars: ✭ 39 (-99.14%)
bergamot-translatorCross platform C++ library focusing on optimized machine translation on the consumer-grade device.
Stars: ✭ 181 (-96.01%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (-99.4%)
superresolution ganChainer implementation of Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network
Stars: ✭ 50 (-98.9%)
ChainerPrunerChainerPruner: Channel Pruning framework for Chainer
Stars: ✭ 21 (-99.54%)
speech-transformerTransformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (-99.12%)
NanoFlowPyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)
Stars: ✭ 63 (-98.61%)
captioning chainerA fast implementation of Neural Image Caption by Chainer
Stars: ✭ 17 (-99.62%)
QPPWGQuasi-Periodic Parallel WaveGAN Pytorch implementation
Stars: ✭ 41 (-99.1%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (-96.05%)
ZhihuThis repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.
Stars: ✭ 3,307 (-27.05%)
asr2424-hour Automatic Speech Recognition
Stars: ✭ 27 (-99.4%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (-95.48%)
htkHTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (-99.69%)
wiki2ssmlWiki2SSML provides the WikiVoice markup language used for fine-tuning synthesised voice.
Stars: ✭ 31 (-99.32%)
good-speech-web-clientPractice your speech level in any language using speech recognition
Stars: ✭ 26 (-99.43%)
obviA Polymer 3+ webcomponent / button for doing speech recognition
Stars: ✭ 54 (-98.81%)
JD-NMFJoint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)
Stars: ✭ 20 (-99.56%)
sova-tts-engineTacotron2 based engine for the SOVA-TTS project
Stars: ✭ 63 (-98.61%)
sepia-stt-serverSEPIA server to support open-source speech recognition via WebSocket connection.
Stars: ✭ 45 (-99.01%)
quickstart-examplesIntegration examples of Tanker's client-side encryption SDKs
Stars: ✭ 17 (-99.62%)
Hifi GanHiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Stars: ✭ 325 (-92.83%)
GlottDNNGlottDNN vocoder and tools for training DNN excitation models
Stars: ✭ 30 (-99.34%)
IMS-ToucanText-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (-93.49%)
Fre-GAN-pytorchFre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: ✭ 73 (-98.39%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-99.54%)
Portrait mattingImplementation of "Automatic Portrait Segmentation" and "Deep Automatic Portrait Matting" with Chainer.
Stars: ✭ 267 (-94.11%)
fdndlpA speech dereverberation algorithm, also called wpe
Stars: ✭ 115 (-97.46%)
LVCNetLVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
Stars: ✭ 67 (-98.52%)
QuantumSpeech-QCNNIEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (-98.43%)
SacremosesPython port of Moses tokenizer, truecaser and normalizer
Stars: ✭ 293 (-93.54%)
inv rlInverse Reinforcement Learning Argorithms
Stars: ✭ 34 (-99.25%)
apertium-html-toolsWeb application providing a fully localised interface for text/website/document translation, analysis and generation powered by Apertium.
Stars: ✭ 36 (-99.21%)
PhomemeSimple sentence mixing tool (work in progress)
Stars: ✭ 18 (-99.6%)
VoiceBridgeVoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit
Stars: ✭ 17 (-99.62%)