This application is developed to help speechless people interact with others with ease. It detects voice and converts the input speech into a sign language based video.

Stars: ✭ 21 (-83.85%)

Mutual labels: voice, speech

web-speech-demo

Learn how to build a simple text-to-speech voice app for the web using the Web Speech API.

Stars: ✭ 19 (-85.38%)

Mutual labels: voice, speech

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+276.92%)

Mutual labels: speech, recognition

fade

A Simulation Framework for Auditory Discrimination Experiments

Stars: ✭ 12 (-90.77%)

Mutual labels: recognition, speech

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-60%)

Mutual labels: voice, speech

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+4681.54%)

Mutual labels: speech, voice

Julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

Stars: ✭ 1,258 (+867.69%)

Mutual labels: speech, recognition

Voice Gender

Gender recognition by voice and speech analysis

Stars: ✭ 248 (+90.77%)

Mutual labels: speech, voice

voice-based-email-for-blind

Emailing System for visually impaired persons

Stars: ✭ 35 (-73.08%)

Mutual labels: voice, speech

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Stars: ✭ 633 (+386.92%)

Mutual labels: speech, voice

Dialectid e2e

End to End Dialect Identification using Convolutional Neural Network

Stars: ✭ 40 (-69.23%)

Mutual labels: speech, recognition

Audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Stars: ✭ 1,262 (+870.77%)

Mutual labels: speech

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (-14.62%)

Mutual labels: speech

Tts

Tools to convert text to speech 📚💬

Stars: ✭ 84 (-35.38%)

Mutual labels: speech

Segan

A PyTorch implementation of SEGAN based on INTERSPEECH 2017 paper "SEGAN: Speech Enhancement Generative Adversarial Network"

Stars: ✭ 82 (-36.92%)

Mutual labels: voice

Labelbox

Labelbox is the fastest way to annotate data to build and ship computer vision applications.

Stars: ✭ 1,588 (+1121.54%)

Mutual labels: recognition

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+1037.69%)

Mutual labels: speech

Figaro

Real-time voice-changer for voice-chat, etc. Will support many different voice-filters and features in the future. 🎵

Stars: ✭ 80 (-38.46%)

Mutual labels: voice

Holobot

HoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.

Stars: ✭ 114 (-12.31%)

Mutual labels: speech

Crnn chinese characters rec

(CRNN) Chinese Characters Recognition.

Stars: ✭ 1,259 (+868.46%)

Mutual labels: recognition

Code Switching Papers

A curated list of research papers and resources on code-switching

Stars: ✭ 122 (-6.15%)

Mutual labels: speech

Ccpd

[ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition

Stars: ✭ 1,252 (+863.08%)

Mutual labels: recognition

Python Speech recognition

A simple example for use speech recognition baidu api with python.

Stars: ✭ 106 (-18.46%)

Mutual labels: speech

Teaspeak

The TeaSpeak server issue tracker

Stars: ✭ 81 (-37.69%)

Mutual labels: voice

Alan Sdk Pcf

Alan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.

Stars: ✭ 128 (-1.54%)

Mutual labels: voice

Midi2voice

Singing synthesis from MIDI file

Stars: ✭ 102 (-21.54%)

Mutual labels: voice

Deepspeech

A PaddlePaddle implementation of ASR.

Stars: ✭ 1,219 (+837.69%)

Mutual labels: speech

Vokaturiandroid

Emotion recognition by speech in android.

Stars: ✭ 79 (-39.23%)

Mutual labels: recognition

Phormatics

Using A.I. and computer vision to build a virtual personal fitness trainer. (Most Startup-Viable Hack - HackNYU2018)

Stars: ✭ 79 (-39.23%)

Mutual labels: recognition

Tts

Text-to-Speech for Arduino

Stars: ✭ 118 (-9.23%)

Mutual labels: speech

Insideheartz Whatsapp Bot

A multipurpose whatsapp bot buillt on node.js

Stars: ✭ 102 (-21.54%)

Mutual labels: voice

Vonage Java Sdk

Vonage Server SDK for Java. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.

Stars: ✭ 75 (-42.31%)

Mutual labels: voice

Vonage Dotnet Sdk

Nexmo REST API client for .NET, ASP.NET, ASP.NET MVC written in C#. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.

Stars: ✭ 76 (-41.54%)

Mutual labels: voice

Assistantjs

TypeScript framework to build cross-platform voice applications (alexa, google home, ...).

Stars: ✭ 100 (-23.08%)

Mutual labels: voice

Android Kotlin Chat App

Open-source Voice & Video Calling and Text Chat App for Kotlin (Android)

Stars: ✭ 76 (-41.54%)

Mutual labels: voice

Face recognition

Face recognition docker image to provide a web service which is able to register and recognize faces

Stars: ✭ 74 (-43.08%)

Mutual labels: recognition

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-1.54%)

Mutual labels: speech

Reconstructing faces from voices

An example of the paper "reconstructing faces from voices"

Stars: ✭ 127 (-2.31%)

Mutual labels: speech

3d Densenet

3D Dense Connected Convolutional Network (3D-DenseNet for action recognition)

Stars: ✭ 118 (-9.23%)

Mutual labels: recognition

Online place recognition

Graph-based image sequences matching for the visual place recognition in changing environments.

Stars: ✭ 100 (-23.08%)

Mutual labels: recognition

Unityrtc

基于webrtc的unity多人游戏实时语音(A Unity Demo for Impl Real-time Game Voice Among Mutiplayers Based On WEBRTC)

Stars: ✭ 74 (-43.08%)

Mutual labels: voice

Audiomate

Python library for handling audio datasets.

Stars: ✭ 99 (-23.85%)

Mutual labels: speech

Noise Suppression For Voice

Noise suppression plugin based on Xiph's RNNoise

Stars: ✭ 1,164 (+795.38%)

Mutual labels: voice

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (-46.92%)

Mutual labels: speech

Speech And Text Unity Ios Android

Speed to text in Unity iOS use Native Speech Recognition

Stars: ✭ 117 (-10%)

Mutual labels: speech

Mad Twinnet

The code for the MaD TwinNet. Demo page:

Stars: ✭ 99 (-23.85%)

Mutual labels: voice

Audioswitch

An Android audio management library for real-time communication apps.

Stars: ✭ 69 (-46.92%)

Mutual labels: voice

Epic Kitchens 55 Action Models

EPIC-KITCHENS-55 baselines for Action Recognition

Stars: ✭ 68 (-47.69%)

Mutual labels: recognition

Wikipron

Massively multilingual pronunciation mining

Stars: ✭ 99 (-23.85%)

Mutual labels: speech

Nlp Paper

自然语言处理领域下的对话语音领域，整理相关论文（附阅读笔记），复现模型以及数据处理等（代码含TensorFlow和PyTorch两版本）

Stars: ✭ 67 (-48.46%)

Mutual labels: speech

Mtcnn

face detection and alignment with mtcnn

Stars: ✭ 66 (-49.23%)

Mutual labels: recognition

1-60 of 427 similar projects

›

next*5