pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+8288%)

Mutual labels: speech-recognition, asr

Asr Evaluation

Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).

Stars: ✭ 190 (+660%)

Mutual labels: speech-recognition, asr

leopard

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (+1316%)

Mutual labels: speech-recognition, asr

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (+616%)

Mutual labels: speech-recognition, asr

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

Stars: ✭ 25 (+0%)

Mutual labels: speech-recognition, asr

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (+112%)

Mutual labels: speech-recognition, asr

ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Stars: ✭ 112 (+348%)

Mutual labels: speech-recognition, asr

vosk-asterisk

Speech Recognition in Asterisk with Vosk Server

Stars: ✭ 52 (+108%)

Mutual labels: speech-recognition, asr

PCPM

Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.

Stars: ✭ 21 (-16%)

Mutual labels: speech-recognition, asr

rustfst

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (+316%)

Mutual labels: speech-recognition, asr

Ktspeechcrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Stars: ✭ 92 (+268%)

Mutual labels: speech-recognition, asr

Speech-Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 21 (-16%)

Mutual labels: end-to-end, speech-recognition

syn-speech-samples

An application that demostrate the usage of Syn.Speech library for Speech Recognition

Stars: ✭ 24 (-4%)

Mutual labels: speech-recognition, asr

SOLQ

"SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.

Stars: ✭ 159 (+536%)

Mutual labels: end-to-end, transformer

OverlapPredator

[CVPR 2021, Oral] PREDATOR: Registration of 3D Point Clouds with Low Overlap.

Stars: ✭ 293 (+1072%)

Mutual labels: transformer

mixup

speechpro.com/

Stars: ✭ 23 (-8%)

Mutual labels: speech-recognition

NLP Toolkit

Library of state-of-the-art models (PyTorch) for NLP tasks

Stars: ✭ 92 (+268%)

Mutual labels: speech-recognition

visualization

a collection of visualization function

Stars: ✭ 189 (+656%)

Mutual labels: transformer

Transformer-in-Transformer

An Implementation of Transformer in Transformer in TensorFlow for image classification, attention inside local patches

Stars: ✭ 40 (+60%)

Mutual labels: transformer

Restormer

[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

Stars: ✭ 586 (+2244%)

Mutual labels: transformer

speech-to-text-code-pattern

React app using the Watson Speech to Text service to transform voice audio into written text.

Stars: ✭ 37 (+48%)

Mutual labels: speech-recognition

formulas-python

Ritchie CLI formulas in Python 🐍

Stars: ✭ 17 (-32%)

Mutual labels: speech-recognition

Learning-Lab-C-Library

This library provides a set of basic functions for different type of deep learning (and other) algorithms in C.This deep learning library will be constantly updated

Stars: ✭ 20 (-20%)

Mutual labels: transformer

soxan

Wav2Vec for speech recognition, classification, and audio classification

Stars: ✭ 113 (+352%)

Mutual labels: speech-recognition

laravel5-hal-json

Laravel 5 HAL+JSON API Transformer Package

Stars: ✭ 15 (-40%)

Mutual labels: transformer

CCAligner

🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.

Stars: ✭ 131 (+424%)

Mutual labels: speech-recognition

segmenter

[ICCV2021] Official PyTorch implementation of Segmenter: Transformer for Semantic Segmentation

Stars: ✭ 463 (+1752%)

Mutual labels: transformer

Walk-Transformer

From Random Walks to Transformer for Learning Node Embeddings (ECML-PKDD 2020) (In Pytorch and Tensorflow)

Stars: ✭ 26 (+4%)

Mutual labels: transformer

Relation-Extraction-Transformer

NLP: Relation extraction with position-aware self-attention transformer

Stars: ✭ 63 (+152%)

Mutual labels: transformer

DeepPhonemizer

Grapheme to phoneme conversion with deep learning.

Stars: ✭ 152 (+508%)

Mutual labels: transformer

FNet-pytorch

Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms

Stars: ✭ 204 (+716%)

Mutual labels: transformer

Tensorflow-Keyword-Spotting

Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.

Stars: ✭ 27 (+8%)

Mutual labels: speech-recognition

pytorch-transformer-chatbot

PyTorch v1.2에서 생긴 Transformer API 를 이용한 간단한 Chitchat 챗봇

Stars: ✭ 44 (+76%)

Mutual labels: transformer

FragmentVC

Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention

Stars: ✭ 134 (+436%)

Mutual labels: transformer

graphtrans

Representing Long-Range Context for Graph Neural Networks with Global Attention

Stars: ✭ 45 (+80%)

Mutual labels: transformer

golgotha

Contextualised Embeddings and Language Modelling using BERT and Friends using R

Stars: ✭ 39 (+56%)

Mutual labels: transformer

Xpersona

XPersona: Evaluating Multilingual Personalized Chatbot

Stars: ✭ 54 (+116%)

Mutual labels: transformer

Behavior-Cloning

end to end learning for self-driving

Stars: ✭ 25 (+0%)

Mutual labels: end-to-end

commonvoice-utils

Linguistic processing for Common Voice

Stars: ✭ 32 (+28%)

Mutual labels: asr

rosecho

Tianbot Rosecho (Tianecho)，中文语音人机交互模块，支持ROS即插即用

Stars: ✭ 28 (+12%)

Mutual labels: speech-recognition

transformer

Neutron: A pytorch based implementation of Transformer and its variants.

Stars: ✭ 60 (+140%)

Mutual labels: transformer

enformer-pytorch

Implementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch

Stars: ✭ 146 (+484%)

Mutual labels: transformer

laravel-scene

Laravel Transformer

Stars: ✭ 27 (+8%)

Mutual labels: transformer

PDN

The official PyTorch implementation of "Pathfinder Discovery Networks for Neural Message Passing" (WebConf '21)

Stars: ✭ 44 (+76%)

Mutual labels: transformer

tf2-transformer-chatbot

Transformer Chatbot in TensorFlow 2 with TPU support.

Stars: ✭ 94 (+276%)

Mutual labels: transformer

Transformer-ocr

Handwritten text recognition using transformers.

Stars: ✭ 92 (+268%)

Mutual labels: transformer

tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Stars: ✭ 183 (+632%)

Mutual labels: transformer

TS-CAM

Codes for TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.

Stars: ✭ 96 (+284%)

Mutual labels: transformer

image-classification

A collection of SOTA Image Classification Models in PyTorch

Stars: ✭ 70 (+180%)

Mutual labels: transformer

ASVspoof PA

No description or website provided.

Stars: ✭ 22 (-12%)

Mutual labels: end-to-end

RSTNet

RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words (CVPR 2021)

Stars: ✭ 71 (+184%)

Mutual labels: transformer

towhee

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Stars: ✭ 821 (+3184%)

Mutual labels: transformer

A chronology of deep learning

Tracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense research in AI.

Stars: ✭ 47 (+88%)

Mutual labels: speech-recognition

Image-Caption

Using LSTM or Transformer to solve Image Captioning in Pytorch

Stars: ✭ 36 (+44%)

Mutual labels: transformer

text2keywords

Trained T5 and T5-large model for creating keywords from text

Stars: ✭ 53 (+112%)

Mutual labels: transformer

61-120 of 750 similar projects

‹

›

next*5