A series of 3 programs that will automatically receive scripts from Reddit, allow the user to edit them, then be sent off to a video generator where they will be uploaded to YouTube automatically.

Stars: ✭ 152 (+28.81%)

Mutual labels: tts

Bert Keras

Keras implementation of BERT with pre-trained weights

Stars: ✭ 820 (+594.92%)

Mutual labels: pretrained-models

Nonparaseq2seqvc code

Implementation code of non-parallel sequence-to-sequence VC

Stars: ✭ 154 (+30.51%)

Mutual labels: text-to-speech

Awesome Speech Recognition Speech Synthesis Papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+1666.95%)

Mutual labels: tts

Pytorchinsight

a pytorch lib with state-of-the-art architectures, pretrained models and real-time updated results

Stars: ✭ 713 (+504.24%)

Mutual labels: pretrained-models

node-red-contrib-yandex-station-management

Модуль node-red-contrib-yandex-station-management для управления умными колонками от Яндекс

Stars: ✭ 20 (-83.05%)

Mutual labels: tts

Conv Emotion

This repo contains implementation of different architectures for emotion recognition in conversations.

Stars: ✭ 646 (+447.46%)

Mutual labels: pretrained-models

cookietts

TTS from Cookie. Messy and experimental!

Stars: ✭ 29 (-75.42%)

Mutual labels: tacotron2

Tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Stars: ✭ 1,756 (+1388.14%)

Mutual labels: tts

Dla

Deep learning for audio processing

Stars: ✭ 142 (+20.34%)

Mutual labels: tts

Pinto model zoo

A repository that shares tuning results of trained models generated by TensorFlow / Keras. Post-training quantization (Weight Quantization, Integer Quantization, Full Integer Quantization, Float16 Quantization), Quantization-aware training. TensorFlow Lite. OpenVINO. CoreML. TensorFlow.js. TF-TRT. MediaPipe. ONNX. [.tflite,.h5,.pb,saved_model,tfjs,tftrt,mlmodel,.xml/.bin, .onnx]

Stars: ✭ 634 (+437.29%)

Mutual labels: pretrained-models

Breast cancer classifier

Deep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening

Stars: ✭ 614 (+420.34%)

Mutual labels: pretrained-models

Self Driving Car In Video Games

A deep neural network that learns to drive in video games

Stars: ✭ 559 (+373.73%)

Mutual labels: pretrained-models

masr

中文语音识别系列，读者可以借助它快速训练属于自己的中文语音识别模型，或直接使用预训练模型测试效果。

Stars: ✭ 179 (+51.69%)

Mutual labels: pretrained-models

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (+342.37%)

Mutual labels: pretrained-models

Ha Tts Bluetooth Speaker

TTS Bluetooth Speaker for Home Assistant

Stars: ✭ 140 (+18.64%)

Mutual labels: tts

Deepvoice3 pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Stars: ✭ 1,654 (+1301.69%)

Mutual labels: tts

Bert Multitask Learning

BERT for Multitask Learning

Stars: ✭ 380 (+222.03%)

Mutual labels: pretrained-models

Google Speech V2

💬 Reverse Engineering Google's Speech To Text API (v2)

Stars: ✭ 435 (+268.64%)

Mutual labels: text-to-speech

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-70.34%)

Mutual labels: text-to-speech

Midi2voice

Singing synthesis from MIDI file

Stars: ✭ 102 (-13.56%)

Mutual labels: tts

My Appdaemon

My apps, my helpfiles, all about AppDaemon for Home Assistant

Stars: ✭ 94 (-20.34%)

Mutual labels: tts

Pytorch Human Pose Estimation

Implementation of various human pose estimation models in pytorch on multiple datasets (MPII & COCO) along with pretrained models

Stars: ✭ 346 (+193.22%)

Mutual labels: pretrained-models

Alan Sdk Web

Alan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.

Stars: ✭ 368 (+211.86%)

Mutual labels: text-to-speech

Pitchtron

TTS for pitch-accented language. Korean dialect DB.

Stars: ✭ 91 (-22.88%)

Mutual labels: tts

Tts

Tools to convert text to speech 📚💬

Stars: ✭ 84 (-28.81%)

Mutual labels: tts

concurrent-video-analytic-pipeline-optimization-sample-l

Create a concurrent video analysis pipeline featuring multistream face and human pose detection, vehicle attribute detection, and the ability to encode multiple videos to local storage in a single stream.

Stars: ✭ 39 (-66.95%)

Mutual labels: pretrained-models

One-Shot-Voice-Cloning

☺️ One Shot Voice Cloning base on Unet-TTS

Stars: ✭ 118 (+0%)

Mutual labels: tts

WARP

Code for ACL'2021 paper WARP 🌀 Word-level Adversarial ReProgramming. Outperforming `GPT-3` on SuperGLUE Few-Shot text classification. https://aclanthology.org/2021.acl-long.381/

Stars: ✭ 66 (-44.07%)

Mutual labels: pretrained-models

open clip

An open source implementation of CLIP.

Stars: ✭ 1,534 (+1200%)

Mutual labels: pretrained-models

bangla-tts

Bangla text to speech, Multilingual (Bangla, English) real-time ([almost] in a GPU) speech synthesis library

Stars: ✭ 61 (-48.31%)

Mutual labels: text-to-speech

java-google-speech-api

🙊 Speech Recognition , Text To Speech , Google Translate