Speech And TextSpeech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Stars: ✭ 102 (-26.62%)
atari-demoCode for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"
Stars: ✭ 21 (-84.89%)
IafCode for reproducing key results in the paper "Improving Variational Inference with Inverse Autoregressive Flow"
Stars: ✭ 468 (+236.69%)
FastSpeech2PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
Stars: ✭ 163 (+17.27%)
VoicenetSpeech synthesis platform based on tensorflow and sonnet
Stars: ✭ 60 (-56.83%)
datasets🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+9878.42%)
MmfA modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Stars: ✭ 4,713 (+3290.65%)
Deepvoice3 pytorchPyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: ✭ 1,654 (+1089.93%)
safety-gear-detector-pythonObserve workers as they pass in front of a camera to determine if they have adequate safety protection.
Stars: ✭ 54 (-61.15%)
Read AloudAn awesome browser extension that reads aloud webpage content with one click
Stars: ✭ 444 (+219.42%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-41.01%)
DipsNAACL 2019: Submodular optimization-based diverse paraphrasing and its effectiveness in data augmentation
Stars: ✭ 59 (-57.55%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-74.82%)
GateA high performant & paralleled Minecraft proxy server with scalability, flexibility & excellent server version support - ready for the cloud!
Stars: ✭ 102 (-26.62%)
Sparsely Grouped GanCode for paper "Sparsely Grouped Multi-task Generative Adversarial Networks for Facial Attribute Manipulation"
Stars: ✭ 68 (-51.08%)
Bert paper chinese translationBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding 论文的中文翻译 Chinese Translation!
Stars: ✭ 564 (+305.76%)
Tensorflow-YOLACTImplementation of the paper "YOLACT Real-time Instance Segmentation" in Tensorflow 2
Stars: ✭ 97 (-30.22%)
Pytorch Classification UncertaintyThis repo contains a PyTorch implementation of the paper: "Evidential Deep Learning to Quantify Classification Uncertainty"
Stars: ✭ 59 (-57.55%)
SprocketVoice Conversion Tool Kit
Stars: ✭ 425 (+205.76%)
Vonage Python SdkVonage Server SDK for Python. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.
Stars: ✭ 134 (-3.6%)
msla2014wherein I implement several substructural logics in Agda
Stars: ✭ 24 (-82.73%)
voicesmacOS CLI for changing the default TTS (text-to-speech) voice and printing information about and speaking text with multiple voices.
Stars: ✭ 53 (-61.87%)
Pytorch Image ModelsPyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
Stars: ✭ 15,232 (+10858.27%)
mimic2Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.
Stars: ✭ 537 (+286.33%)
Neural Backed Decision TreesMaking decision trees competitive with neural networks on CIFAR10, CIFAR100, TinyImagenet200, Imagenet
Stars: ✭ 411 (+195.68%)
BIRADS classifierHigh-resolution breast cancer screening with multi-view deep convolutional neural networks
Stars: ✭ 122 (-12.23%)
LibraryCollection of papers in the field of distributed systems, game theory, cryptography, cryptoeconomics, zero knowledge
Stars: ✭ 100 (-28.06%)
data-at-hand-mobileMobile application for exploring fitness data using both speech and touch interaction.
Stars: ✭ 50 (-64.03%)
YatopiaThe Most Powerful and Feature Rich Minecraft Server Software!
Stars: ✭ 408 (+193.53%)
HotpurA fork of Purpur that aims to improve performance and add FabricMC compatibility.
Stars: ✭ 17 (-87.77%)
AutoInAgdaProof automation – for Agda, in Agda.
Stars: ✭ 38 (-72.66%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+182.73%)
cerberus researchResearch tools for analysing Cerberus banking trojan.
Stars: ✭ 110 (-20.86%)
MdmA TensorFlow implementation of the Mnemonic Descent Method.
Stars: ✭ 120 (-13.67%)
AdversarialAudioSeparationCode accompanying the paper "Semi-supervised adversarial audio source separation applied to singing voice extraction"
Stars: ✭ 70 (-49.64%)
pigalleryPiGallery: AI-powered Self-hosted Secure Multi-user Image Gallery and Detailed Image analysis using Machine Learning, EXIF Parsing and Geo Tagging
Stars: ✭ 35 (-74.82%)
CVCCVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)
Stars: ✭ 45 (-67.63%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-58.99%)
paper-terminalPrint Markdown to a paper in your terminal
Stars: ✭ 33 (-76.26%)
IpfsPeer-to-peer hypermedia protocol
Stars: ✭ 20,128 (+14380.58%)
EagerMOTOfficial code for "EagerMOT: 3D Multi-Object Tracking via Sensor Fusion" [ICRA 2021]
Stars: ✭ 249 (+79.14%)
Imitation Code for the paper "Generative Adversarial Imitation Learning"
Stars: ✭ 555 (+299.28%)
Nodejs SpeechNode.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.
Stars: ✭ 545 (+292.09%)
nlp-classA Natural Language Processing course taught by Professor Ghassemi
Stars: ✭ 95 (-31.65%)