MerlinThis is now the official location of the Merlin project.
Stars: ✭ 1,168 (+889.83%)
Skip Thoughts.torchPorting of Skip-Thoughts pretrained models from Theano to PyTorch & Torch7
Stars: ✭ 146 (+23.73%)
mlp-singerOfficial implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis (IEEE MLSP 2021)
Stars: ✭ 103 (-12.71%)
Tod BertPre-Trained Models for ToD-BERT
Stars: ✭ 143 (+21.19%)
Dragonfirethe open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+849.15%)
NnsplitSemantic text segmentation. For sentence boundary detection, compound splitting and more.
Stars: ✭ 141 (+19.49%)
Electra中文 预训练 ELECTRA 模型: 基于对抗学习 pretrain Chinese Model
Stars: ✭ 132 (+11.86%)
ganimation replicateAn Out-of-the-Box Replication of GANimation using PyTorch, pretrained weights are available!
Stars: ✭ 165 (+39.83%)
arguing-robots🤖 Watch and hear macOS robots argue live in your terminal 🤖
Stars: ✭ 53 (-55.08%)
VocganVocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Stars: ✭ 158 (+33.9%)
Mrcp Plugin With Freeswitch使用FreeSWITCH接受用户手机呼叫,通过UniMRCP Server集成讯飞开放平台(xfyun)插件将用户语音进行语音识别(ASR),并根据自定义业务逻辑调用语音合成(TTS),构建简单的端到端语音呼叫中心。
Stars: ✭ 168 (+42.37%)
Covid Twitter BertPretrained BERT model for analysing COVID-19 Twitter data
Stars: ✭ 101 (-14.41%)
Friend.lyA social media platform with a friend recommendation engine based on personality trait extraction
Stars: ✭ 41 (-65.25%)
gap-text2sqlGAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training
Stars: ✭ 83 (-29.66%)
MelnetImplementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
Stars: ✭ 161 (+36.44%)
Tts Papers🐸 collection of TTS papers
Stars: ✭ 160 (+35.59%)
Dialogue UnderstandingThis repository contains PyTorch implementation for the baseline models from the paper Utterance-level Dialogue Understanding: An Empirical Study
Stars: ✭ 77 (-34.75%)
AsrgenAttacking Speaker Recognition with Deep Generative Models
Stars: ✭ 31 (-73.73%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-82.2%)
Gpt2 MlGPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Stars: ✭ 1,066 (+803.39%)
Pyannote AudioNeural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Stars: ✭ 978 (+728.81%)
baseclsA codebase & model zoo for pretrained backbone based on MegEngine.
Stars: ✭ 29 (-75.42%)
Vonage Php Sdk CoreVonage REST API client for PHP. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.
Stars: ✭ 849 (+619.49%)
melganMelGAN implementation with Multi-Band and Full Band supports...
Stars: ✭ 54 (-54.24%)
Bert KerasKeras implementation of BERT with pre-trained weights
Stars: ✭ 820 (+594.92%)
Pytorchinsighta pytorch lib with state-of-the-art architectures, pretrained models and real-time updated results
Stars: ✭ 713 (+504.24%)
Conv EmotionThis repo contains implementation of different architectures for emotion recognition in conversations.
Stars: ✭ 646 (+447.46%)
cookiettsTTS from Cookie. Messy and experimental!
Stars: ✭ 29 (-75.42%)
TacotronA TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Stars: ✭ 1,756 (+1388.14%)
DlaDeep learning for audio processing
Stars: ✭ 142 (+20.34%)
Pinto model zooA repository that shares tuning results of trained models generated by TensorFlow / Keras. Post-training quantization (Weight Quantization, Integer Quantization, Full Integer Quantization, Float16 Quantization), Quantization-aware training. TensorFlow Lite. OpenVINO. CoreML. TensorFlow.js. TF-TRT. MediaPipe. ONNX. [.tflite,.h5,.pb,saved_model,tfjs,tftrt,mlmodel,.xml/.bin, .onnx]
Stars: ✭ 634 (+437.29%)
Breast cancer classifierDeep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening
Stars: ✭ 614 (+420.34%)
masr中文语音识别系列,读者可以借助它快速训练属于自己的中文语音识别模型,或直接使用预训练模型测试效果。
Stars: ✭ 179 (+51.69%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+342.37%)
Deepvoice3 pytorchPyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: ✭ 1,654 (+1301.69%)
Google Speech V2💬 Reverse Engineering Google's Speech To Text API (v2)
Stars: ✭ 435 (+268.64%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-70.34%)
Midi2voiceSinging synthesis from MIDI file
Stars: ✭ 102 (-13.56%)
My AppdaemonMy apps, my helpfiles, all about AppDaemon for Home Assistant
Stars: ✭ 94 (-20.34%)
Pytorch Human Pose EstimationImplementation of various human pose estimation models in pytorch on multiple datasets (MPII & COCO) along with pretrained models
Stars: ✭ 346 (+193.22%)
Alan Sdk WebAlan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.
Stars: ✭ 368 (+211.86%)
PitchtronTTS for pitch-accented language. Korean dialect DB.
Stars: ✭ 91 (-22.88%)
TtsTools to convert text to speech 📚💬
Stars: ✭ 84 (-28.81%)
concurrent-video-analytic-pipeline-optimization-sample-lCreate a concurrent video analysis pipeline featuring multistream face and human pose detection, vehicle attribute detection, and the ability to encode multiple videos to local storage in a single stream.
Stars: ✭ 39 (-66.95%)
WARPCode for ACL'2021 paper WARP 🌀 Word-level Adversarial ReProgramming. Outperforming `GPT-3` on SuperGLUE Few-Shot text classification. https://aclanthology.org/2021.acl-long.381/
Stars: ✭ 66 (-44.07%)
open clipAn open source implementation of CLIP.
Stars: ✭ 1,534 (+1200%)
bangla-ttsBangla text to speech, Multilingual (Bangla, English) real-time ([almost] in a GPU) speech synthesis library
Stars: ✭ 61 (-48.31%)
Hassio AddonsThe repository for my Home Assistant Supervisor Add-ons.
Stars: ✭ 71 (-39.83%)
Cnn vocoderA fast cnn-based vocoder
Stars: ✭ 74 (-37.29%)