All Projects → Julius → Similar Projects or Alternatives

689 Open source projects that are alternatives of or similar to Julius

scripty
Speech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (-98.89%)
Mutual labels:  speech-recognition
Vosk Server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (-77.98%)
Mutual labels:  speech-recognition
python-soxr
Fast and high quality sample-rate conversion library for Python
Stars: ✭ 25 (-98.01%)
Mutual labels:  audio-processing
Wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 5,907 (+369.55%)
Mutual labels:  speech-recognition
Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
Stars: ✭ 61 (-95.15%)
Mutual labels:  speech-recognition
Vosk Android Demo
Offline speech recognition for Android with Vosk library.
Stars: ✭ 271 (-78.46%)
Mutual labels:  speech-recognition
timit-preprocessor
Extract mfcc vectors and phones from TIMIT dataset
Stars: ✭ 14 (-98.89%)
Mutual labels:  speech-recognition
Parrots
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese.
Stars: ✭ 48 (-96.18%)
Mutual labels:  speech-recognition
Audio Classification using LSTM
Classification of Urban Sound Audio Dataset using LSTM-based model.
Stars: ✭ 47 (-96.26%)
Mutual labels:  audio-processing
Nara wpe
Different implementations of "Weighted Prediction Error" for speech dereverberation
Stars: ✭ 265 (-78.93%)
Mutual labels:  audio-processing
rustfst
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (-91.73%)
Mutual labels:  speech-recognition
Awesome Diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (-46.5%)
Mutual labels:  speech-recognition
DCASE2020 task1
Code for DCASE 2020 task 1a and task 1b.
Stars: ✭ 72 (-94.28%)
Mutual labels:  audio-processing
Iter Reason
Code for Iterative Reasoning Paper (CVPR 2018)
Stars: ✭ 263 (-79.09%)
Mutual labels:  recognition
Aurio
Audio Fingerprinting & Retrieval for .NET
Stars: ✭ 84 (-93.32%)
Mutual labels:  audio-processing
SpeechToText
Speech To Text in Android
Stars: ✭ 53 (-95.79%)
Mutual labels:  speech-recognition
Speech Aligner
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Stars: ✭ 259 (-79.41%)
Mutual labels:  speech
fast-mixer
Mini recording and mixing studio for android
Stars: ✭ 47 (-96.26%)
Mutual labels:  audio-processing
Speech recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+376.87%)
Mutual labels:  speech-recognition
ml-with-audio
HF's ML for Audio study group
Stars: ✭ 104 (-91.73%)
Mutual labels:  speech-recognition
HotVoice
Adds Speech Recognition support to AutoHotkey, via a C# DLL
Stars: ✭ 41 (-96.74%)
Mutual labels:  speech-recognition
Tacotron
Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Stars: ✭ 493 (-60.81%)
Mutual labels:  speech
Fre-GAN-pytorch
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: ✭ 73 (-94.2%)
Mutual labels:  speech
End-to-End-Mandarin-ASR
End-to-end speech recognition on AISHELL dataset.
Stars: ✭ 20 (-98.41%)
Mutual labels:  speech-recognition
Noise2Noise-audio denoising without clean training data
Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…
Stars: ✭ 49 (-96.1%)
Mutual labels:  speech
api
Speechly public API definitions and generated code
Stars: ✭ 15 (-98.81%)
Mutual labels:  speech-recognition
Libreasr
💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (-49.68%)
Mutual labels:  speech-recognition
rnnt decoder cuda
An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (-95.23%)
Mutual labels:  speech-recognition
hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-93%)
Mutual labels:  speech
twang
Library for pure Rust advanced audio synthesis.
Stars: ✭ 83 (-93.4%)
Mutual labels:  audio-processing
Patter
speech-to-text in pytorch
Stars: ✭ 71 (-94.36%)
Mutual labels:  speech-recognition
salutejs
SmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript
Stars: ✭ 35 (-97.22%)
Mutual labels:  speech-recognition
DuME
A fast, versatile, easy-to-use and cross-platform Media Encoder based on FFmpeg
Stars: ✭ 66 (-94.75%)
Mutual labels:  audio-processing
Android-TTS-STT
One line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem
Stars: ✭ 77 (-93.88%)
Mutual labels:  speech-recognition
Phormatics
Using A.I. and computer vision to build a virtual personal fitness trainer. (Most Startup-Viable Hack - HackNYU2018)
Stars: ✭ 79 (-93.72%)
Mutual labels:  recognition
Aca Code
Matlab scripts accompanying the book "An Introduction to Audio Content Analysis" (www.AudioContentAnalysis.org)
Stars: ✭ 67 (-94.67%)
Mutual labels:  audio-processing
Pncc
A implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-96.82%)
Mutual labels:  speech-recognition
Audio cat dog classification
Classification of WAV files from cats and dogs
Stars: ✭ 16 (-98.73%)
Mutual labels:  audio-processing
speech-to-text
mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-95.15%)
Mutual labels:  speech-recognition
houndify-sdk-go
The official Houndify SDK for Go
Stars: ✭ 23 (-98.17%)
Mutual labels:  speech-recognition
minutes
🔭 Speaker diarization via transfer learning
Stars: ✭ 25 (-98.01%)
Mutual labels:  speech
telltime
iOS application to tell the time in the British way 🇬🇧⏰
Stars: ✭ 49 (-96.1%)
Mutual labels:  speech-recognition
Openimager
Image processing Toolkit in R
Stars: ✭ 45 (-96.42%)
Mutual labels:  recognition
hf-experiments
Experiments with Hugging Face 🔬 🤗
Stars: ✭ 37 (-97.06%)
Mutual labels:  speech-recognition
video-audio-tools
To process/edit video and audio with Python+FFmpeg. [简单实用] 基于Python+FFmpeg的视频和音频的处理/剪辑。
Stars: ✭ 164 (-86.96%)
Mutual labels:  audio-processing
UnitySoundManager
Sound manager with 3 tracks, language system, pooling system, Fade in/out effects, EventTrigger system and more.
Stars: ✭ 55 (-95.63%)
Mutual labels:  audio-processing
Beethoven
🎸 A maestro of pitch detection.
Stars: ✭ 601 (-52.23%)
Mutual labels:  audio-processing
SimpleCompressor
Code and theory of a look-ahead compressor / limiter.
Stars: ✭ 70 (-94.44%)
Mutual labels:  audio-processing
ruby-magic
Simple interface to libmagic for Ruby Programming Language
Stars: ✭ 23 (-98.17%)
Mutual labels:  recognition
Chords.py
Neural networks applied in recognizing guitar chords using python, AutoML.NET with C# and .NET Core
Stars: ✭ 24 (-98.09%)
Mutual labels:  recognition
Sytody
a Flutter "speech to todo" app example
Stars: ✭ 79 (-93.72%)
Mutual labels:  speech-recognition
ctc-asr
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (-91.1%)
Mutual labels:  speech-recognition
pydiogment
📣 Python library for audio augmentation
Stars: ✭ 64 (-94.91%)
Mutual labels:  audio-processing
kospeech
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (-63.75%)
Mutual labels:  speech-recognition
Tensorflowandroiddemo
TensorFlow android demo 车道线 车辆 人脸 动作 骨架 识别 检测 抽烟 打电话 闭眼 睁眼
Stars: ✭ 589 (-53.18%)
Mutual labels:  recognition
Mycroft Precise
A lightweight, simple-to-use, RNN wake word listener
Stars: ✭ 481 (-61.76%)
Mutual labels:  speech-recognition
scim
[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (-98.65%)
Mutual labels:  speech-recognition
tsunami
A simple but powerful audio editor
Stars: ✭ 41 (-96.74%)
Mutual labels:  audio-processing
Tika Python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Stars: ✭ 997 (-20.75%)
Mutual labels:  recognition
Q
C++ Library for Audio Digital Signal Processing
Stars: ✭ 481 (-61.76%)
Mutual labels:  audio-processing
361-420 of 689 similar projects