CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Stars: ✭ 352 (-72.02%)

Mutual labels: speech

linear16

Converts an audio file to LINEAR16 Google-speech compatible file.

Stars: ✭ 14 (-98.89%)

Mutual labels: speech

Arcan

Arcan - [Display Server, Multimedia Framework, Game Engine] -> "Desktop Engine"

Stars: ✭ 885 (-29.65%)

Mutual labels: audio-processing

speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.

Stars: ✭ 40 (-96.82%)

Mutual labels: speech

All Contributors Cli

Tool to help automate adding contributor acknowledgements according to the all-contributors specification ✨

Stars: ✭ 345 (-72.58%)

Mutual labels: recognition

farm-animal-tracking

Farm Animal Tracking (FAT)

Stars: ✭ 19 (-98.49%)

Mutual labels: recognition

Ccpd

[ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition

Stars: ✭ 1,252 (-0.48%)

Mutual labels: recognition

TensorFlow-Powered Robot Vision

No description or website provided.

Stars: ✭ 34 (-97.3%)

Mutual labels: recognition

Dplug

Audio plugin framework. VST2/VST3/AU/AAX/LV2 for Linux/macOS/Windows.

Stars: ✭ 341 (-72.89%)

Mutual labels: audio-processing

Vst3HostDemo

Stars: ✭ 16 (-98.73%)

Mutual labels: audio-processing

Speechpy

💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/

Stars: ✭ 833 (-33.78%)

Mutual labels: speech-recognition

JD-NMF

Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)

Stars: ✭ 20 (-98.41%)

Mutual labels: speech

Php Opencv Examples

Tutorial for computer vision and machine learning in PHP 7/8 by opencv (installation + examples + documentation)

Stars: ✭ 333 (-73.53%)

Mutual labels: recognition

Deep-learning-And-Paper

【仅作为交流学习使用】机器智能--相关书目及经典论文包括AutoML、情感分类、语音识别、声纹识别、语音合成实验代码等

Stars: ✭ 62 (-95.07%)

Mutual labels: speech-recognition

Sound Source Localization Algorithm doa estimation

关于语音信号声源定位DOA估计所用的一些传统算法

Stars: ✭ 58 (-95.39%)

Mutual labels: speech

Ios 10 Sampler

Code examples for new APIs of iOS 10.

Stars: ✭ 3,341 (+165.58%)

Mutual labels: speech

deep avsr

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Stars: ✭ 104 (-91.73%)

Mutual labels: speech-recognition

Espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Stars: ✭ 808 (-35.77%)

Mutual labels: speech-recognition

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-97.85%)

Mutual labels: speech-recognition

Deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Stars: ✭ 18,680 (+1384.9%)

Mutual labels: speech-recognition

Awesome Web Audio

A list of resources and projects to help learn about audio

Stars: ✭ 73 (-94.2%)

Mutual labels: audio-processing

wikipron

Massively multilingual pronunciation mining

Stars: ✭ 167 (-86.72%)

Mutual labels: speech

Wave U Net

Implementation of the Wave-U-Net for audio source separation

Stars: ✭ 506 (-59.78%)

Mutual labels: audio-processing

sepia-stt-server

SEPIA server to support open-source speech recognition via WebSocket connection.

Stars: ✭ 45 (-96.42%)

Mutual labels: speech-recognition

kaldi helpers

🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.

Stars: ✭ 13 (-98.97%)

Mutual labels: speech

Alan Sdk Ios

Alan AI iOS SDK adds a voice assistant or chatbot to your app. Supports Swift, Objective-C.

Stars: ✭ 318 (-74.72%)

Mutual labels: speech-recognition

react-client

An React client library for Speechly API

Stars: ✭ 71 (-94.36%)

Mutual labels: speech-recognition

Stephanie Va

Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.

Stars: ✭ 772 (-38.63%)

Mutual labels: speech-recognition

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Stars: ✭ 21 (-98.33%)

Mutual labels: speech-recognition

Vectorhub

Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)

Stars: ✭ 317 (-74.8%)

Mutual labels: audio-processing

Dolphinattack

Inaudible Voice Commands

Stars: ✭ 57 (-95.47%)

Mutual labels: speech-recognition

etiketai

Etiketai is an online tool designed to label images, useful for training AI models

Stars: ✭ 63 (-94.99%)

Mutual labels: recognition

Identifying-Clothing-Attributes

Pretrained VGG-16 network as feature extractor for Object Recognition (Python, Keras, Scikit-Learn)

Stars: ✭ 22 (-98.25%)

Mutual labels: recognition

SpeechToText

Speech To Text in Android

Stars: ✭ 53 (-95.79%)

Mutual labels: speech-recognition

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stars: ✭ 205 (-83.7%)

Mutual labels: speech-recognition

Dali

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

Stars: ✭ 3,624 (+188.08%)

Mutual labels: audio-processing

syn-speech-samples

An application that demostrate the usage of Syn.Speech library for Speech Recognition

Stars: ✭ 24 (-98.09%)

Mutual labels: speech-recognition

Deepspeech Websocket Server

Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments

Stars: ✭ 79 (-93.72%)

Mutual labels: speech-recognition

tenacity

Tenacity is an easy-to-use, privacy-friendly, FLOSS, cross-platform multi-track audio editor/recorder for Windows, macOS, Linux and other operating systems. Project currently on an indefinite hiatus.

Stars: ✭ 7,231 (+474.8%)

Mutual labels: audio-processing

Speech recognition

A Flutter plugin to use speech recognition on iOS & Android (Swift/Java)

Stars: ✭ 302 (-75.99%)

Mutual labels: speech-recognition

Eesen

The official repository of the Eesen project

Stars: ✭ 738 (-41.34%)

Mutual labels: speech-recognition

Tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.