Avsr Deep SpeechGoogle Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab
Stars: ✭ 43 (-89.14%)
SwiftspeechA speech recognition framework designed for SwiftUI.
Stars: ✭ 149 (-62.37%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+92.93%)
LingvoLingvo
Stars: ✭ 2,361 (+496.21%)
AudiomatePython library for handling audio datasets.
Stars: ✭ 99 (-75%)
Medmnist[ISBI'21] MedMNIST Classification Decathlon: A Lightweight AutoML Benchmark for Medical Image Analysis
Stars: ✭ 338 (-14.65%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-88.13%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+1414.9%)
Fashion MnistA MNIST-like fashion product database. Benchmark 👇
Stars: ✭ 9,675 (+2343.18%)
Esc 50ESC-50: Dataset for Environmental Sound Classification
Stars: ✭ 631 (+59.34%)
MultidigitmnistCombine multiple MNIST digits to create datasets with 100/1000 classes for few-shot learning/meta-learning
Stars: ✭ 48 (-87.88%)
MirdataPython library to work with Music Information Retrieval datasets
Stars: ✭ 170 (-57.07%)
TswechatA WeChat alternative. Written in Swift 5.
Stars: ✭ 3,674 (+827.78%)
Midiwriterjs♬ A JavaScript library which provides an API for programmatically generating and creating expressive multi-track MIDI files and JSON objects.
Stars: ✭ 381 (-3.79%)
Pytorch Mnist Celeba Gan DcganPytorch implementation of Generative Adversarial Networks (GAN) and Deep Convolutional Generative Adversarial Networks (DCGAN) for MNIST and CelebA datasets
Stars: ✭ 363 (-8.33%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+1044.7%)
Comma2k19A driving dataset for the development and validation of fused pose estimators and mapping algorithms
Stars: ✭ 391 (-1.26%)
DataPython related videos and metadata powering =>
Stars: ✭ 355 (-10.35%)
Rythm.jsA javascript library that makes your page dance.
Stars: ✭ 3,755 (+848.23%)
Sfml.netOfficial binding of SFML for .Net languages
Stars: ✭ 354 (-10.61%)
QtoxqTox is a chat, voice, video, and file transfer IM client using the encrypted peer-to-peer Tox protocol.
Stars: ✭ 3,843 (+870.45%)
Duckhunt JsDuckHunt ported to JS and HTML5
Stars: ✭ 390 (-1.52%)
MusigA shazam like tool to store songs fingerprints and retrieve them
Stars: ✭ 388 (-2.02%)
SubsyncSubtitle Speech Synchronizer
Stars: ✭ 379 (-4.29%)
UniversalviewerA community-developed open source project on a mission to help you share your 📚📜📰📽️📻🗿 with the 🌎
Stars: ✭ 343 (-13.38%)
EartrumpetEarTrumpet - Volume Control for Windows
Stars: ✭ 4,761 (+1102.27%)
SupercolliderjsThe JavaScript client library for SuperCollider
Stars: ✭ 381 (-3.79%)
Lissajous🎵 A tool for programmatic audio performance in the browser using Javascript.
Stars: ✭ 367 (-7.32%)
Cmu MultimodalsdkCMU MultimodalSDK is a machine learning platform for development of advanced multimodal models as well as easily accessing and processing multimodal datasets.
Stars: ✭ 388 (-2.02%)
Radiodroidradio browser app that uses www.radio-browser.info on android
Stars: ✭ 362 (-8.59%)
VpgnetVPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition (ICCV 2017)
Stars: ✭ 382 (-3.54%)
Audioreadcross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python
Stars: ✭ 359 (-9.34%)
MystiqQt5/C++ FFmpeg Media Converter
Stars: ✭ 393 (-0.76%)
Howler.jsJavascript audio library for the modern web.
Stars: ✭ 19,425 (+4805.3%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (-10.61%)
BeatsA command-line drum machine. Convert a beat notated in YAML into a *.wav file.
Stars: ✭ 389 (-1.77%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (-5.56%)
BrevitasBrevitas: quantization-aware training in PyTorch
Stars: ✭ 343 (-13.38%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-0.76%)
Dukemtmc Reid evaluationICCV2017 The Person re-ID Evaluation Code for DukeMTMC-reID Dataset (Including Dataset Download)
Stars: ✭ 344 (-13.13%)
TfrecordTFRecord reader for PyTorch
Stars: ✭ 377 (-4.8%)
Awesome Music ProductionA curated list of software, services and resources to create and distribute music.
Stars: ✭ 340 (-14.14%)
Alan Sdk WebAlan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.
Stars: ✭ 368 (-7.07%)
Dsprites DatasetDataset to assess the disentanglement properties of unsupervised learning methods
Stars: ✭ 340 (-14.14%)
Eseur Code DataCode and data used to create the examples in "Evidence-based Software Engineering based on the publicly available data"
Stars: ✭ 340 (-14.14%)
SupercolliderAn audio server, programming language, and IDE for sound synthesis and algorithmic composition.
Stars: ✭ 4,036 (+919.19%)
PcamThe PatchCamelyon (PCam) deep learning classification benchmark.
Stars: ✭ 340 (-14.14%)
Deeperforensics 1.0[CVPR 2020] A Large-Scale Dataset for Real-World Face Forgery Detection
Stars: ✭ 338 (-14.65%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (-1.01%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (-3.28%)
SnapcastSynchronous multiroom audio player
Stars: ✭ 4,028 (+917.17%)
LaspvfxAudio reactive Unity VFX with LASP
Stars: ✭ 337 (-14.9%)
SpectralizerAudio visualizer plugin for obs-studio
Stars: ✭ 332 (-16.16%)