All Projects → Avpi → Similar Projects or Alternatives

427 Open source projects that are alternatives of or similar to Avpi

Vchsm
C++ 11 algorithm implementation for voice conversion using harmonic plus stochastic models
Stars: ✭ 38 (-70.77%)
Mutual labels:  voice
room-impulse-responses
A list of publicly available room impulse response datasets and scripts to download them.
Stars: ✭ 143 (+10%)
Mutual labels:  speech
Imagedetect
✂️ Detect and crop faces, barcodes and texts in image with iOS 11 Vision api.
Stars: ✭ 286 (+120%)
Mutual labels:  recognition
Hand-Digits-Recognition
Recognize your own handwritten digits with Tensorflow, embedded in a PyQT5 GUI. The Neural Network was trained on MNIST.
Stars: ✭ 11 (-91.54%)
Mutual labels:  recognition
Teaspeak
The TeaSpeak server issue tracker
Stars: ✭ 81 (-37.69%)
Mutual labels:  voice
brasiltts
Brasil TTS é um conjunto de sintetizadores de voz, em português do Brasil, que lê telas para portadores de deficiência visual. Transforma texto em áudio, permitindo que pessoas cegas ou com baixa visão tenham acesso ao conteúdo exibido na tela. Embora o principal público-alvo de sistemas de conversão texto-fala – como o Brasil TTS – seja formado…
Stars: ✭ 34 (-73.85%)
Mutual labels:  voice
Disgord
Go module for interacting with the documented Discord's bot interface; Gateway, REST requests and voice
Stars: ✭ 277 (+113.08%)
Mutual labels:  voice
Discordspeechbot
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-73.08%)
Mutual labels:  speech
lidbox
End-to-end spoken language identification out of the box.
Stars: ✭ 39 (-70%)
Mutual labels:  speech
Noisetorch
Real-time microphone noise suppression on Linux.
Stars: ✭ 5,199 (+3899.23%)
Mutual labels:  voice
awesome-rhasspy
Carefully curated list of projects and resources for the voice assistant Rhasspy
Stars: ✭ 50 (-61.54%)
Mutual labels:  voice
Alan Sdk Pcf
Alan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.
Stars: ✭ 128 (-1.54%)
Mutual labels:  voice
Iter Reason
Code for Iterative Reasoning Paper (CVPR 2018)
Stars: ✭ 263 (+102.31%)
Mutual labels:  recognition
JustAnotherVoiceChat
TeamSpeak 3 plugin to control 3D voice communication in games
Stars: ✭ 21 (-83.85%)
Mutual labels:  voice
Wsay
Windows "say"
Stars: ✭ 36 (-72.31%)
Mutual labels:  speech
karen
open-source voice assistant
Stars: ✭ 19 (-85.38%)
Mutual labels:  voice
Speech Aligner
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Stars: ✭ 259 (+99.23%)
Mutual labels:  speech
useful-twilio-functions
A set of useful Twilio Functions.
Stars: ✭ 53 (-59.23%)
Mutual labels:  voice
Deepspeech
A PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+837.69%)
Mutual labels:  speech
eidos-audition
Collection of auditory models.
Stars: ✭ 25 (-80.77%)
Mutual labels:  speech
Amazing Python Scripts
🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
Stars: ✭ 229 (+76.15%)
Mutual labels:  speech
NBSS
The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".
Stars: ✭ 77 (-40.77%)
Mutual labels:  speech
Lightspeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-76.15%)
Mutual labels:  speech
africastalking-node.js
Official Node.js SDK for Africa's Talking
Stars: ✭ 113 (-13.08%)
Mutual labels:  voice
Noise2Noise-audio denoising without clean training data
Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…
Stars: ✭ 49 (-62.31%)
Mutual labels:  speech
africastalking.Net
Africa's Talking API Wrapper for C#
Stars: ✭ 16 (-87.69%)
Mutual labels:  voice
Midi2voice
Singing synthesis from MIDI file
Stars: ✭ 102 (-21.54%)
Mutual labels:  voice
EnglishStu
英语学习软件,集成有道翻译、科大讯飞,有翻译、朗读示例、阅读评测功能
Stars: ✭ 27 (-79.23%)
Mutual labels:  voice
minutes
🔭 Speaker diarization via transfer learning
Stars: ✭ 25 (-80.77%)
Mutual labels:  speech
pytorch-pcen
PyTorch reimplementation of per-channel energy normalization for audio.
Stars: ✭ 80 (-38.46%)
Mutual labels:  speech
Aaya
Personal Voice Assistant
Stars: ✭ 20 (-84.62%)
Mutual labels:  voice
txt2speech
Convert text to speech using Google Translate API
Stars: ✭ 38 (-70.77%)
Mutual labels:  speech
ruby-magic
Simple interface to libmagic for Ruby Programming Language
Stars: ✭ 23 (-82.31%)
Mutual labels:  recognition
JustAnotherVoiceChat-Server
Server for the JustAnotherVoiceChat TeamSpeak 3 plugin
Stars: ✭ 17 (-86.92%)
Mutual labels:  voice
Phormatics
Using A.I. and computer vision to build a virtual personal fitness trainer. (Most Startup-Viable Hack - HackNYU2018)
Stars: ✭ 79 (-39.23%)
Mutual labels:  recognition
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-36.92%)
Mutual labels:  speech
AlexaAndroid
No description or website provided.
Stars: ✭ 15 (-88.46%)
Mutual labels:  voice
Multimodal-Gesture-Recognition-with-LSTMs-and-CTC
An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.
Stars: ✭ 25 (-80.77%)
Mutual labels:  speech
Vc With Gan
Voice Conversion with GANs
Stars: ✭ 13 (-90%)
Mutual labels:  voice
react-native-speech-bubble
💬 A speech bubble dialog component for React Native.
Stars: ✭ 50 (-61.54%)
Mutual labels:  speech
Voice-Denoising-AN
A Conditional Generative Adverserial Network (cGAN) was adapted for the task of source de-noising of noisy voice auditory images. The base architecture is adapted from Pix2Pix.
Stars: ✭ 42 (-67.69%)
Mutual labels:  voice
idear
🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (-35.38%)
Mutual labels:  speech
Tts
Text-to-Speech for Arduino
Stars: ✭ 118 (-9.23%)
Mutual labels:  speech
Naver-AI-Hackathon-Speech
2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib
Stars: ✭ 26 (-80%)
Mutual labels:  speech
tt-vae-gan
Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.
Stars: ✭ 37 (-71.54%)
Mutual labels:  speech
lectures-all
Central repository for all lectures on deep learning at UPC ETSETB TelecomBCN.
Stars: ✭ 46 (-64.62%)
Mutual labels:  speech
Xunfei Clj
Clojure封装讯飞语音SDK, 可提供给Emacs/Vim编辑器使用,或者命令行, 实现语音提醒/语音识别/语音转为命令等
Stars: ✭ 26 (-80%)
Mutual labels:  voice
Voiceripple
Voice Record Button that has ripple effect with users voice
Stars: ✭ 379 (+191.54%)
Mutual labels:  voice
D-TDNN
PyTorch implementation of Densely Connected Time Delay Neural Network
Stars: ✭ 60 (-53.85%)
Mutual labels:  speech
editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: ✭ 74 (-43.08%)
Mutual labels:  speech
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+86.15%)
Mutual labels:  speech
Vonage Dotnet Sdk
Nexmo REST API client for .NET, ASP.NET, ASP.NET MVC written in C#. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.
Stars: ✭ 76 (-41.54%)
Mutual labels:  voice
kaldi helpers
🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-90%)
Mutual labels:  speech
download audioset
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (-59.23%)
Mutual labels:  voice
Voc
A physical model of the human vocal tract using literate programming, based on Pink Trombone.
Stars: ✭ 129 (-0.77%)
Mutual labels:  speech
Univoice
P2P VoIP in Unity
Stars: ✭ 128 (-1.54%)
Mutual labels:  voice
Pytorch Asr
ASR with PyTorch
Stars: ✭ 124 (-4.62%)
Mutual labels:  speech
Dcnets
Implementation for <Decoupled Networks> in CVPR'18.
Stars: ✭ 115 (-11.54%)
Mutual labels:  recognition
Wavenet Enhancement
Speech Enhancement using Bayesian WaveNet
Stars: ✭ 86 (-33.85%)
Mutual labels:  speech
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-56.15%)
Mutual labels:  speech
301-360 of 427 similar projects