Brasil TTS é um conjunto de sintetizadores de voz, em português do Brasil, que lê telas para portadores de deficiência visual. Transforma texto em áudio, permitindo que pessoas cegas ou com baixa visão tenham acesso ao conteúdo exibido na tela. Embora o principal público-alvo de sistemas de conversão texto-fala – como o Brasil TTS – seja formado…

Stars: ✭ 34 (-73.85%)

Mutual labels: voice

Disgord

Go module for interacting with the documented Discord's bot interface; Gateway, REST requests and voice

Stars: ✭ 277 (+113.08%)

Mutual labels: voice

Discordspeechbot

A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

Stars: ✭ 35 (-73.08%)

Mutual labels: speech

lidbox

End-to-end spoken language identification out of the box.

Stars: ✭ 39 (-70%)

Mutual labels: speech

Noisetorch

Real-time microphone noise suppression on Linux.

Stars: ✭ 5,199 (+3899.23%)

Mutual labels: voice

awesome-rhasspy

Carefully curated list of projects and resources for the voice assistant Rhasspy

Stars: ✭ 50 (-61.54%)

Mutual labels: voice

Alan Sdk Pcf

Alan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.

Stars: ✭ 128 (-1.54%)

Mutual labels: voice

Iter Reason

Code for Iterative Reasoning Paper (CVPR 2018)

Stars: ✭ 263 (+102.31%)

Mutual labels: recognition

JustAnotherVoiceChat

TeamSpeak 3 plugin to control 3D voice communication in games

Stars: ✭ 21 (-83.85%)

Mutual labels: voice

Wsay

Windows "say"

Stars: ✭ 36 (-72.31%)

Mutual labels: speech

karen

open-source voice assistant

Stars: ✭ 19 (-85.38%)

Mutual labels: voice

Speech Aligner

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

Stars: ✭ 259 (+99.23%)

Mutual labels: speech

useful-twilio-functions

A set of useful Twilio Functions.

Stars: ✭ 53 (-59.23%)

Mutual labels: voice

Deepspeech

A PaddlePaddle implementation of ASR.

Stars: ✭ 1,219 (+837.69%)

Mutual labels: speech

eidos-audition

Collection of auditory models.

Stars: ✭ 25 (-80.77%)

Mutual labels: speech

Amazing Python Scripts

🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.

Stars: ✭ 229 (+76.15%)

Mutual labels: speech

NBSS

The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".

Stars: ✭ 77 (-40.77%)

Mutual labels: speech

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-76.15%)

Mutual labels: speech

africastalking-node.js

Official Node.js SDK for Africa's Talking

Stars: ✭ 113 (-13.08%)

Mutual labels: voice

Noise2Noise-audio denoising without clean training data

Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…

Stars: ✭ 49 (-62.31%)

Mutual labels: speech

africastalking.Net

Africa's Talking API Wrapper for C#

Stars: ✭ 16 (-87.69%)

Mutual labels: voice

Midi2voice

Singing synthesis from MIDI file

Stars: ✭ 102 (-21.54%)

Mutual labels: voice

EnglishStu

英语学习软件，集成有道翻译、科大讯飞，有翻译、朗读示例、阅读评测功能

Stars: ✭ 27 (-79.23%)

Mutual labels: voice

minutes

🔭 Speaker diarization via transfer learning

Stars: ✭ 25 (-80.77%)

Mutual labels: speech

pytorch-pcen

PyTorch reimplementation of per-channel energy normalization for audio.

Stars: ✭ 80 (-38.46%)

Mutual labels: speech

Aaya

Personal Voice Assistant

Stars: ✭ 20 (-84.62%)

Mutual labels: voice

txt2speech

Convert text to speech using Google Translate API

Stars: ✭ 38 (-70.77%)

Mutual labels: speech

ruby-magic

Simple interface to libmagic for Ruby Programming Language

Stars: ✭ 23 (-82.31%)

Mutual labels: recognition

JustAnotherVoiceChat-Server

Server for the JustAnotherVoiceChat TeamSpeak 3 plugin

Stars: ✭ 17 (-86.92%)

Mutual labels: voice

Phormatics

Using A.I. and computer vision to build a virtual personal fitness trainer. (Most Startup-Viable Hack - HackNYU2018)

Stars: ✭ 79 (-39.23%)

Mutual labels: recognition

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (-36.92%)

Mutual labels: speech

AlexaAndroid

No description or website provided.

Stars: ✭ 15 (-88.46%)

Mutual labels: voice

Multimodal-Gesture-Recognition-with-LSTMs-and-CTC

An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.

Stars: ✭ 25 (-80.77%)

Mutual labels: speech

Vc With Gan

Voice Conversion with GANs

Stars: ✭ 13 (-90%)

Mutual labels: voice

react-native-speech-bubble

💬 A speech bubble dialog component for React Native.

Stars: ✭ 50 (-61.54%)

Mutual labels: speech

Voice-Denoising-AN

A Conditional Generative Adverserial Network (cGAN) was adapted for the task of source de-noising of noisy voice auditory images. The base architecture is adapted from Pix2Pix.

Stars: ✭ 42 (-67.69%)

Mutual labels: voice

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (-35.38%)

Mutual labels: speech

Tts

Text-to-Speech for Arduino

Stars: ✭ 118 (-9.23%)

Mutual labels: speech

Naver-AI-Hackathon-Speech

2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib

Stars: ✭ 26 (-80%)

Mutual labels: speech

tt-vae-gan

Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.

Stars: ✭ 37 (-71.54%)

Mutual labels: speech

lectures-all

Central repository for all lectures on deep learning at UPC ETSETB TelecomBCN.

Stars: ✭ 46 (-64.62%)

Mutual labels: speech

Xunfei Clj

Clojure封装讯飞语音SDK, 可提供给Emacs/Vim编辑器使用,或者命令行, 实现语音提醒/语音识别/语音转为命令等

Stars: ✭ 26 (-80%)

Mutual labels: voice

Voiceripple

Voice Record Button that has ripple effect with users voice

Stars: ✭ 379 (+191.54%)

Mutual labels: voice

D-TDNN

PyTorch implementation of Densely Connected Time Delay Neural Network

Stars: ✭ 60 (-53.85%)

Mutual labels: speech

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (-43.08%)

Mutual labels: speech

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+86.15%)

Mutual labels: speech

Vonage Dotnet Sdk

Nexmo REST API client for .NET, ASP.NET, ASP.NET MVC written in C#. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.

Stars: ✭ 76 (-41.54%)

Mutual labels: voice

kaldi helpers

🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.

Stars: ✭ 13 (-90%)

Mutual labels: speech

download audioset

📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).