3D augmentation and transforms of 2D/3D sparse data, such as 3D triangle meshes or point clouds in Euclidean space. Extension of the Fast.ai library to train Sub-manifold Sparse Convolution Networks

Stars: ✭ 46 (+100%)

Mutual labels: data-augmentation

KoEDA

Korean Easy Data Augmentation

Stars: ✭ 62 (+169.57%)

Mutual labels: data-augmentation

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stars: ✭ 205 (+791.3%)

Mutual labels: speech-recognition

Rus-SpeechRecognition-LSTM-CTC-VoxForge

Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge

Stars: ✭ 50 (+117.39%)

Mutual labels: speech-recognition

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 2,384 (+10265.22%)

Mutual labels: speech-recognition

formulas-python

Ritchie CLI formulas in Python 🐍

Stars: ✭ 17 (-26.09%)

Mutual labels: speech-recognition

pytorch audio

audio processing module for pytorch:stft, istft

Stars: ✭ 33 (+43.48%)

Mutual labels: speech-recognition

Chinese-automatic-speech-recognition

Chinese speech recognition

Stars: ✭ 147 (+539.13%)

Mutual labels: speech-recognition

pocketsphinx

Updated ROS bindings to pocketsphinx

Stars: ✭ 36 (+56.52%)

Mutual labels: speech-recognition

semantic-parsing-dual

Source code and data for ACL 2019 Long Paper ``Semantic Parsing with Dual Learning".

Stars: ✭ 17 (-26.09%)

Mutual labels: data-augmentation

scripty

Speech to text bot for Discord using Mozilla's DeepSpeech

Stars: ✭ 14 (-39.13%)

Mutual labels: speech-recognition

revai-node-sdk

Node.js SDK for the Rev AI API

Stars: ✭ 21 (-8.7%)

Mutual labels: speech-recognition

bird species classification

Supervised Classification of bird species 🐦 in high resolution images, especially for, Himalayan birds, having diverse species with fairly low amount of labelled data

Stars: ✭ 59 (+156.52%)

Mutual labels: data-augmentation

deep avsr

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Stars: ✭ 104 (+352.17%)

Mutual labels: speech-recognition

quran-align

Word-accurate timestamps for Qur'anic audio.

Stars: ✭ 139 (+504.35%)

Mutual labels: speech-recognition

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (+17.39%)

Mutual labels: speech-recognition

Speech-Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 21 (-8.7%)

Mutual labels: speech-recognition

react-client

An React client library for Speechly API

Stars: ✭ 71 (+208.7%)

Mutual labels: speech-recognition

traj-pred-irl

Official implementation codes of "Regularizing neural networks for future trajectory prediction via IRL framework"

Stars: ✭ 23 (+0%)

Mutual labels: regularization

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Stars: ✭ 21 (-8.7%)

Mutual labels: speech-recognition

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (+873.91%)

Mutual labels: speech-recognition

speechless

Speech-to-text based on wav2letter built for transfer learning

Stars: ✭ 92 (+300%)

Mutual labels: speech-recognition

Regularization-Pruning

[ICLR'21] PyTorch code for our paper "Neural Pruning via Growing Regularization"

Stars: ✭ 44 (+91.3%)

Mutual labels: regularization

mongolian-nlp

Useful resources for Mongolian NLP

Stars: ✭ 119 (+417.39%)

Mutual labels: speech-recognition

NLP Toolkit

Library of state-of-the-art models (PyTorch) for NLP tasks

Stars: ✭ 92 (+300%)

Mutual labels: speech-recognition

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+3556.52%)

Mutual labels: speech-recognition

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Stars: ✭ 816 (+3447.83%)

Mutual labels: data-augmentation

numpy-neuralnet-exercise

Implementation of key concepts of neuralnetwork via numpy

Stars: ✭ 49 (+113.04%)

Mutual labels: regularization

Machine Learning From Scratch

Machine Learning models from scratch with a better visualisation

Stars: ✭ 15 (-34.78%)

Mutual labels: regularization

Keras-MultiClass-Image-Classification

Multiclass image classification using Convolutional Neural Network

Stars: ✭ 48 (+108.7%)

Mutual labels: data-augmentation

speech to text

how to use the Google Cloud Speech API to transcribe audio/video files.

Stars: ✭ 35 (+52.17%)

Mutual labels: speech-recognition

favorite-research-papers

Listing my favorite research papers 📝 from different fields as I read them.

Stars: ✭ 12 (-47.83%)

Mutual labels: speech-recognition

soxan

Wav2Vec for speech recognition, classification, and audio classification

Stars: ✭ 113 (+391.3%)

Mutual labels: speech-recognition

VoiceDictation

迅飞语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息，让机器能够“听懂”人类语言，相当于给机器安装上“耳朵”，使其具备“能听”的功能。

Stars: ✭ 36 (+56.52%)

Mutual labels: speech-recognition

lightning-asr

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Stars: ✭ 36 (+56.52%)

Mutual labels: speech-recognition

keras-transform

Library for data augmentation

Stars: ✭ 31 (+34.78%)

Mutual labels: data-augmentation

Awesome-Few-Shot-Image-Generation

A curated list of papers, code and resources pertaining to few-shot image generation.

Stars: ✭ 209 (+808.7%)

Mutual labels: data-augmentation

Transformer-Transducer

PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)

Stars: ✭ 61 (+165.22%)

Mutual labels: speech-recognition

timit-preprocessor

Extract mfcc vectors and phones from TIMIT dataset

Stars: ✭ 14 (-39.13%)

Mutual labels: speech-recognition

vosk-asterisk

Speech Recognition in Asterisk with Vosk Server

Stars: ✭ 52 (+126.09%)

Mutual labels: speech-recognition

Tensorflow-Keyword-Spotting

Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.

Stars: ✭ 27 (+17.39%)

Mutual labels: speech-recognition

speech-recognition-transfer-learning

Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow

Stars: ✭ 18 (-21.74%)

Mutual labels: speech-recognition

consistency

Implementation of models in our EMNLP 2019 paper: A Logic-Driven Framework for Consistency of Neural Models

Stars: ✭ 26 (+13.04%)

Mutual labels: regularization

1-60 of 460 similar projects

›

next*5