Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (-91.73%)

Mutual labels: speech-recognition

Awesome Diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Stars: ✭ 673 (-46.5%)

Mutual labels: speech-recognition

DCASE2020 task1

Code for DCASE 2020 task 1a and task 1b.

Stars: ✭ 72 (-94.28%)

Mutual labels: audio-processing

Iter Reason

Code for Iterative Reasoning Paper (CVPR 2018)

Stars: ✭ 263 (-79.09%)

Mutual labels: recognition

Aurio

Audio Fingerprinting & Retrieval for .NET

Stars: ✭ 84 (-93.32%)

Mutual labels: audio-processing

SpeechToText

Speech To Text in Android

Stars: ✭ 53 (-95.79%)

Mutual labels: speech-recognition

Speech Aligner

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

Stars: ✭ 259 (-79.41%)

Mutual labels: speech

fast-mixer

Mini recording and mixing studio for android

Stars: ✭ 47 (-96.26%)

Mutual labels: audio-processing

Speech recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Stars: ✭ 5,999 (+376.87%)

Mutual labels: speech-recognition

ml-with-audio

HF's ML for Audio study group

Stars: ✭ 104 (-91.73%)

Mutual labels: speech-recognition

HotVoice

Adds Speech Recognition support to AutoHotkey, via a C# DLL

Stars: ✭ 41 (-96.74%)

Mutual labels: speech-recognition

Tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

Stars: ✭ 493 (-60.81%)

Mutual labels: speech

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-94.2%)

Mutual labels: speech

End-to-End-Mandarin-ASR

End-to-end speech recognition on AISHELL dataset.

Stars: ✭ 20 (-98.41%)

Mutual labels: speech-recognition

Noise2Noise-audio denoising without clean training data

Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…

Stars: ✭ 49 (-96.1%)

Mutual labels: speech

api

Speechly public API definitions and generated code

Stars: ✭ 15 (-98.81%)

Mutual labels: speech-recognition

Libreasr

💬 An On-Premises, Streaming Speech Recognition System

Stars: ✭ 633 (-49.68%)

Mutual labels: speech-recognition

rnnt decoder cuda

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.

Stars: ✭ 60 (-95.23%)

Mutual labels: speech-recognition

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Stars: ✭ 88 (-93%)

Mutual labels: speech

twang

Library for pure Rust advanced audio synthesis.

Stars: ✭ 83 (-93.4%)

Mutual labels: audio-processing

Patter

speech-to-text in pytorch

Stars: ✭ 71 (-94.36%)

Mutual labels: speech-recognition

salutejs

SmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript

Stars: ✭ 35 (-97.22%)

Mutual labels: speech-recognition

DuME

A fast, versatile, easy-to-use and cross-platform Media Encoder based on FFmpeg

Stars: ✭ 66 (-94.75%)

Mutual labels: audio-processing

Android-TTS-STT

One line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem

Stars: ✭ 77 (-93.88%)

Mutual labels: speech-recognition

Phormatics

Using A.I. and computer vision to build a virtual personal fitness trainer. (Most Startup-Viable Hack - HackNYU2018)

Stars: ✭ 79 (-93.72%)

Mutual labels: recognition

Aca Code

Matlab scripts accompanying the book "An Introduction to Audio Content Analysis" (www.AudioContentAnalysis.org)

Stars: ✭ 67 (-94.67%)

Mutual labels: audio-processing

Pncc

A implementation of Power Normalized Cepstral Coefficients: PNCC

Stars: ✭ 40 (-96.82%)

Mutual labels: speech-recognition

Audio cat dog classification

Classification of WAV files from cats and dogs

Stars: ✭ 16 (-98.73%)

Mutual labels: audio-processing

speech-to-text

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras

Stars: ✭ 61 (-95.15%)

Mutual labels: speech-recognition

houndify-sdk-go

The official Houndify SDK for Go

Stars: ✭ 23 (-98.17%)

Mutual labels: speech-recognition

minutes

🔭 Speaker diarization via transfer learning

Stars: ✭ 25 (-98.01%)

Mutual labels: speech

telltime

iOS application to tell the time in the British way 🇬🇧⏰

Stars: ✭ 49 (-96.1%)

Mutual labels: speech-recognition

Openimager

Image processing Toolkit in R

Stars: ✭ 45 (-96.42%)

Mutual labels: recognition

hf-experiments

Experiments with Hugging Face 🔬 🤗

Stars: ✭ 37 (-97.06%)

Mutual labels: speech-recognition

video-audio-tools

To process/edit video and audio with Python+FFmpeg. [简单实用] 基于Python+FFmpeg的视频和音频的处理/剪辑。

Stars: ✭ 164 (-86.96%)

Mutual labels: audio-processing

UnitySoundManager

Sound manager with 3 tracks, language system, pooling system, Fade in/out effects, EventTrigger system and more.

Stars: ✭ 55 (-95.63%)

Mutual labels: audio-processing

Beethoven

🎸 A maestro of pitch detection.

Stars: ✭ 601 (-52.23%)

Mutual labels: audio-processing

SimpleCompressor

Code and theory of a look-ahead compressor / limiter.

Stars: ✭ 70 (-94.44%)

Mutual labels: audio-processing

ruby-magic

Simple interface to libmagic for Ruby Programming Language

Stars: ✭ 23 (-98.17%)

Mutual labels: recognition

Chords.py

Neural networks applied in recognizing guitar chords using python, AutoML.NET with C# and .NET Core

Stars: ✭ 24 (-98.09%)

Mutual labels: recognition

Sytody

a Flutter "speech to todo" app example

Stars: ✭ 79 (-93.72%)

Mutual labels: speech-recognition

ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Stars: ✭ 112 (-91.1%)

Mutual labels: speech-recognition

pydiogment

📣 Python library for audio augmentation

Stars: ✭ 64 (-94.91%)

Mutual labels: audio-processing

kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Stars: ✭ 456 (-63.75%)

Mutual labels: speech-recognition

Tensorflowandroiddemo

TensorFlow android demo 车道线车辆人脸动作骨架识别检测抽烟打电话闭眼睁眼

Stars: ✭ 589 (-53.18%)

Mutual labels: recognition

Mycroft Precise

A lightweight, simple-to-use, RNN wake word listener

Stars: ✭ 481 (-61.76%)

Mutual labels: speech-recognition

scim

[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.

Stars: ✭ 17 (-98.65%)

Mutual labels: speech-recognition

tsunami

A simple but powerful audio editor

Stars: ✭ 41 (-96.74%)

Mutual labels: audio-processing

Tika Python

Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

Stars: ✭ 997 (-20.75%)

Mutual labels: recognition

C++ Library for Audio Digital Signal Processing

Stars: ✭ 481 (-61.76%)

Mutual labels: audio-processing

361-420 of 689 similar projects