All Projects → speech-transformer → Similar Projects or Alternatives

668 Open source projects that are alternatives of or similar to speech-transformer

Graphormer is a deep learning package that allows researchers and developers to train custom models for molecule modeling tasks. It aims to accelerate the research and application in AI for molecule science, such as material design, drug discovery, etc.

Stars: ✭ 1,194 (+2885%)

Mutual labels: transformer

Dc tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

Stars: ✭ 1,017 (+2442.5%)

Mutual labels: speech

php-serializer

Serialize PHP variables, including objects, in any format. Support to unserialize it too.

Stars: ✭ 47 (+17.5%)

Mutual labels: transformer

ru-dalle

Generate images from texts. In Russian

Stars: ✭ 1,606 (+3915%)

Mutual labels: transformer

sdk-android

Tanker client-side encryption SDK for Android

Stars: ✭ 14 (-65%)

Mutual labels: end-to-end

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+15440%)

Mutual labels: speech

KitanaQA

KitanaQA: Adversarial training and data augmentation for neural question-answering models

Stars: ✭ 58 (+45%)

Mutual labels: transformer

Segan

Speech Enhancement Generative Adversarial Network in TensorFlow

Stars: ✭ 661 (+1552.5%)

Mutual labels: speech

ViTs-vs-CNNs

[NeurIPS 2021]: Are Transformers More Robust Than CNNs? (Pytorch implementation & checkpoints)

Stars: ✭ 145 (+262.5%)

Mutual labels: transformer

laravel-scene

Laravel Transformer

Stars: ✭ 27 (-32.5%)

Mutual labels: transformer

room-impulse-responses

A list of publicly available room impulse response datasets and scripts to download them.

Stars: ✭ 143 (+257.5%)

Mutual labels: speech

DSTC6-End-to-End-Conversation-Modeling

DSTC6: End-to-End Conversation Modeling Track

Stars: ✭ 56 (+40%)

Mutual labels: end-to-end

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (+1230%)

Mutual labels: speech

Representation-Learning-for-Information-Extraction

Pytorch implementation of Paper by Google Research - Representation Learning for Information Extraction from Form-like Documents.

Stars: ✭ 82 (+105%)

Mutual labels: transformer

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+1125%)

Mutual labels: speech

tensorflow-ml-nlp-tf2

텐서플로2와 머신러닝으로 시작하는 자연어처리 (로지스틱회귀부터 BERT와 GPT3까지) 실습자료

Stars: ✭ 245 (+512.5%)

Mutual labels: transformer

Cboard

AAC communication system with text-to-speech for the browser

Stars: ✭ 437 (+992.5%)

Mutual labels: speech

Awesome-low-level-vision-resources

A curated list of resources for Low-level Vision Tasks

Stars: ✭ 35 (-12.5%)

Mutual labels: transformer

AESRC2020

a deep accent recognition network

Stars: ✭ 35 (-12.5%)

Mutual labels: asr

Voice Converter Cyclegan

Voice Converter Using CycleGAN and Non-Parallel Data

Stars: ✭ 384 (+860%)

Mutual labels: speech

ventib

📈 Ventib records your voice, transcribes it in realtime, and performs speech pattern analysis to give you objective statistics about how you speak.

Stars: ✭ 43 (+7.5%)

Mutual labels: speech

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+805%)

Mutual labels: speech

KoLM

Korean text normalization and language preparation package for LM in Kaldi-based ASR system

Stars: ✭ 46 (+15%)

Mutual labels: asr

Tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+13467.5%)

Mutual labels: speech

bytekit

Java 字节操作的工具库(不是字节码的工具库)

Stars: ✭ 40 (+0%)

Mutual labels: transformer

Android Speech

Android speech recognition and text to speech made easy

Stars: ✭ 310 (+675%)

Mutual labels: speech

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-32.5%)

Mutual labels: asr

Pocketsphinx Python

Python interface to CMU Sphinxbase and Pocketsphinx libraries

Stars: ✭ 298 (+645%)

Mutual labels: speech

Speech-Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 21 (-47.5%)

Mutual labels: end-to-end

Sednn

deep learning based speech enhancement using keras or pytorch, make it easy to use

Stars: ✭ 288 (+620%)

Mutual labels: speech

DolboNet

Русскоязычный чат-бот для Discord на архитектуре Transformer

Stars: ✭ 53 (+32.5%)

Mutual labels: transformer

Speech Aligner

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

Stars: ✭ 259 (+547.5%)

Mutual labels: speech

pie

百度云流式语音识别客户端 SDK

Stars: ✭ 62 (+55%)

Mutual labels: asr

Noise2Noise-audio denoising without clean training data

Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…

Stars: ✭ 49 (+22.5%)

Mutual labels: speech

TransMorph Transformer for Medical Image Registration

TransMorph: Transformer for Unsupervised Medical Image Registration (PyTorch)

Stars: ✭ 130 (+225%)

Mutual labels: transformer

minutes

🔭 Speaker diarization via transfer learning

Stars: ✭ 25 (-37.5%)

Mutual labels: speech

MASTER-pytorch

Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)

Stars: ✭ 263 (+557.5%)

Mutual labels: transformer

Kevinpro-NLP-demo

All NLP you Need Here. 个人实现了一些好玩的NLP demo，目前包含13个NLP应用的pytorch实现

Stars: ✭ 117 (+192.5%)

Mutual labels: transformer

TabFormer

Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)

Stars: ✭ 209 (+422.5%)

Mutual labels: transformer

Voice Gender

Gender recognition by voice and speech analysis

Stars: ✭ 248 (+520%)

Mutual labels: speech

cypress-example-docker-circle-workflows

Cypress + Docker + CircleCI Workflows = ❤️

Stars: ✭ 29 (-27.5%)

Mutual labels: end-to-end

Speech256

An FPGA implementation of a classic 80ies speech synthesizer. Done for the Retro Challenge 2017/10.

Stars: ✭ 51 (+27.5%)

Mutual labels: speech

query-selector

LONG-TERM SERIES FORECASTING WITH QUERYSELECTOR – EFFICIENT MODEL OF SPARSEATTENTION

Stars: ✭ 63 (+57.5%)

Mutual labels: transformer

VAD-LTSD

Efficient voice activity detection algorithm using long-term speech information

Stars: ✭ 37 (-7.5%)

Mutual labels: speech

anycontrol

Voice control for your websites and applications

Stars: ✭ 53 (+32.5%)

Mutual labels: speech

TRAR-VQA

[ICCV 2021] TRAR: Routing the Attention Spans in Transformers for Visual Question Answering -- Official Implementation

Stars: ✭ 49 (+22.5%)

Mutual labels: transformer

Wavegrad

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (+512.5%)

Mutual labels: speech

SER-datasets

A collection of datasets for the purpose of emotion recognition/detection in speech.

Stars: ✭ 74 (+85%)

Mutual labels: speech

densecap

Dense video captioning in PyTorch

Stars: ✭ 37 (-7.5%)

Mutual labels: transformer

speech recognition ctc

Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别

Stars: ✭ 40 (+0%)

Mutual labels: speech

Quality-Estimation1

机器翻译子任务-翻译质量评价-复现 WMT2018 阿里论文结果

Stars: ✭ 19 (-52.5%)

Mutual labels: transformer

catr

Image Captioning Using Transformer

Stars: ✭ 206 (+415%)

Mutual labels: transformer

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+505%)

Mutual labels: speech

pytorch-gpt-x

Implementation of autoregressive language model using improved Transformer and DeepSpeed pipeline parallelism.

Stars: ✭ 21 (-47.5%)

Mutual labels: transformer

ICON

(TPAMI2022) Salient Object Detection via Integrity Learning.

Stars: ✭ 125 (+212.5%)

Mutual labels: transformer

myG2P

Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).