All Projects → Stl → Similar Projects or Alternatives

362 Open source projects that are alternatives of or similar to Stl

Xr3player

🎧 🎼 Advanced JavaFX Media Player

Stars: ✭ 472 (+972.73%)

Mutual labels: speech, audio-processing

Emotion Classification From Audio Files

Understanding emotions from audio files using neural networks and multiple datasets.

Stars: ✭ 189 (+329.55%)

Mutual labels: speech, audio-processing

Julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

Stars: ✭ 1,258 (+2759.09%)

Mutual labels: speech, audio-processing

audio noise clustering

https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.

Stars: ✭ 24 (-45.45%)

Mutual labels: speech, audio-processing

Praat

Praat: Doing Phonetics By Computer

Stars: ✭ 675 (+1434.09%)

Mutual labels: speech

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+1013.64%)

Mutual labels: speech

Nnaudio

Audio processing by using pytorch 1D convolution network

Stars: ✭ 428 (+872.73%)

Mutual labels: audio-processing

Auto Editor

Auto-Editor: Effort free video editing!

Stars: ✭ 382 (+768.18%)

Mutual labels: audio-processing

Arcan

Arcan - [Display Server, Multimedia Framework, Game Engine] -> "Desktop Engine"

Stars: ✭ 885 (+1911.36%)

Mutual labels: audio-processing

Beethoven

🎸 A maestro of pitch detection.

Stars: ✭ 601 (+1265.91%)

Mutual labels: audio-processing

Twilio Java

A Java library for communicating with the Twilio REST API and generating TwiML.

Stars: ✭ 371 (+743.18%)

Mutual labels: telephony

Wave U Net

Implementation of the Wave-U-Net for audio source separation

Stars: ✭ 506 (+1050%)

Mutual labels: audio-processing

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+14027.27%)

Mutual labels: speech

C++ Library for Audio Digital Signal Processing

Stars: ✭ 481 (+993.18%)

Mutual labels: audio-processing

Introduction To Programming With Matlab

Coursera Course: Introduction to Programming 👩‍💻 with MATLAB ~by Vanderbilt University 🎓

Stars: ✭ 23 (-47.73%)

Mutual labels: audio-processing

Specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Stars: ✭ 408 (+827.27%)

Mutual labels: speech

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Stars: ✭ 633 (+1338.64%)

Mutual labels: speech

Musig

A shazam like tool to store songs fingerprints and retrieve them

Stars: ✭ 388 (+781.82%)

Mutual labels: audio-processing

Kfr

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

Stars: ✭ 985 (+2138.64%)

Mutual labels: audio-processing

Audio Visualizer Android

🎵 [Android Library] A light-weight and easy-to-use Audio Visualizer for Android.

Stars: ✭ 581 (+1220.45%)

Mutual labels: audio-processing

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+722.73%)

Mutual labels: speech

Aaxaudioconverter

Convert Audible aax files to mp3 and m4a/m4b

Stars: ✭ 336 (+663.64%)

Mutual labels: audio-processing

Tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+12234.09%)

Mutual labels: speech

Baresip

Baresip is a modular SIP User-Agent with audio and video support

Stars: ✭ 817 (+1756.82%)

Mutual labels: telephony

Soundfingerprinting

Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.

Stars: ✭ 554 (+1159.09%)

Mutual labels: audio-processing

Eqmac

macOS System-wide Audio Equalizer & Volume Mixer 🎧

Stars: ✭ 3,947 (+8870.45%)

Mutual labels: audio-processing

Speech Denoising Wavenet

A neural network for end-to-end speech denoising

Stars: ✭ 516 (+1072.73%)

Mutual labels: speech

Audino

Open source audio annotation tool for humans™

Stars: ✭ 740 (+1581.82%)

Mutual labels: audio-processing

Tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

Stars: ✭ 493 (+1020.45%)

Mutual labels: speech

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-29.55%)

Mutual labels: speech

Ffmediaelement

FFME: The Advanced WPF MediaElement (based on FFmpeg)

Stars: ✭ 733 (+1565.91%)

Mutual labels: audio-processing

Vectorhub

Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)

Stars: ✭ 317 (+620.45%)

Mutual labels: audio-processing

Cboard

AAC communication system with text-to-speech for the browser

Stars: ✭ 437 (+893.18%)

Mutual labels: speech

Discordspeechbot

A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

Stars: ✭ 35 (-20.45%)

Mutual labels: speech

Mobly

E2E test framework for tests with complex environment requirements.

Stars: ✭ 424 (+863.64%)

Mutual labels: telephony

Segan

Speech Enhancement Generative Adversarial Network in TensorFlow

Stars: ✭ 661 (+1402.27%)

Mutual labels: speech

Neural sp

End-to-end ASR/LM implementation with PyTorch

Stars: ✭ 408 (+827.27%)

Mutual labels: speech

Giada

Your Hardcore Loop Machine.

Stars: ✭ 903 (+1952.27%)

Mutual labels: audio-processing

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (+793.18%)

Mutual labels: speech

Vad

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stars: ✭ 622 (+1313.64%)

Mutual labels: speech

Voice Converter Cyclegan

Voice Converter Using CycleGAN and Non-Parallel Data

Stars: ✭ 384 (+772.73%)

Mutual labels: speech

Dialectid e2e

End to End Dialect Identification using Convolutional Neural Network

Stars: ✭ 40 (-9.09%)

Mutual labels: speech

Tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Stars: ✭ 305 (+593.18%)

Mutual labels: speech

Tracktion engine

Tracktion Engine module

Stars: ✭ 587 (+1234.09%)

Mutual labels: audio-processing

Inaspeechsegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Stars: ✭ 352 (+700%)

Mutual labels: speech

Mlt

MLT Multimedia Framework

Stars: ✭ 836 (+1800%)

Mutual labels: audio-processing

Dplug

Audio plugin framework. VST2/VST3/AU/AAX/LV2 for Linux/macOS/Windows.

Stars: ✭ 341 (+675%)

Mutual labels: audio-processing

Klio

Smarter data pipelines for audio.

Stars: ✭ 560 (+1172.73%)

Mutual labels: audio-processing

Ios 10 Sampler

Code examples for new APIs of iOS 10.

Stars: ✭ 3,341 (+7493.18%)

Mutual labels: speech

Wsay

Windows "say"

Stars: ✭ 36 (-18.18%)

Mutual labels: speech

Surfboard

Novoic's audio feature extraction library

Stars: ✭ 318 (+622.73%)

Mutual labels: audio-processing

Chromaprint

C library for generating audio fingerprints used by AcoustID

Stars: ✭ 553 (+1156.82%)

Mutual labels: audio-processing

Sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

Stars: ✭ 764 (+1636.36%)

Mutual labels: audio-processing

Android Speech

Android speech recognition and text to speech made easy

Stars: ✭ 310 (+604.55%)

Mutual labels: speech

Nodejs Speech

Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.

Stars: ✭ 545 (+1138.64%)

Mutual labels: speech

Dali

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

Stars: ✭ 3,624 (+8136.36%)

Mutual labels: audio-processing

Css10

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

Stars: ✭ 302 (+586.36%)

Mutual labels: speech

Twilio Csharp

Twilio C#/.NET Helper Library for .NET Framework 3.5+ and supported .NET Core versions

Stars: ✭ 541 (+1129.55%)

Mutual labels: telephony

Dc tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model