treebenderA HDPSG-inspired symbolic natural language parser written in Rust
Stars: ✭ 24 (-76.24%)
foliaFoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…
Stars: ✭ 56 (-44.55%)
Speech Alignerspeech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Stars: ✭ 259 (+156.44%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (+700%)
speech-to-textmixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-39.6%)
Yesterday I LearnedBrainfarts are caused by the rupturing of the cerebral sphincter.
Stars: ✭ 50 (-50.5%)
TextDatasetCleaner🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (-73.27%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+289.11%)
Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (+168.32%)
Kaldi Ioc++ Kaldi IO lib (static and dynamic).
Stars: ✭ 22 (-78.22%)
spanish-corporaUnannotated Spanish 3 Billion Words Corpora
Stars: ✭ 61 (-39.6%)
Dragonfirethe open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+1008.91%)
TextGridToolsRead, write, and manipulate Praat TextGrid files with Python
Stars: ✭ 84 (-16.83%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+630.69%)
mystemCGo bindings to Yandex.Mystem
Stars: ✭ 28 (-72.28%)
PldaAn LDA/PLDA estimator using KALDI in python for speaker verification tasks
Stars: ✭ 85 (-15.84%)
kaldi-timit-sre-ivectorDevelop speaker recognition model based on i-vector using TIMIT database
Stars: ✭ 17 (-83.17%)
opensnipsOpen source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (-50.5%)
PsychopyFor running psychology and neuroscience experiments
Stars: ✭ 1,020 (+909.9%)
lametaThe Metadata Editor for Transparent Archiving of language document materials
Stars: ✭ 18 (-82.18%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+4388.12%)
Vosk ServerWebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (+174.26%)
Theano Kaldi RnnTHEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.
Stars: ✭ 31 (-69.31%)
BetaAn open source reimplementation of Benny Brodda's BETA in Python
Stars: ✭ 65 (-35.64%)
rsyntaxtreeSyntax tree generator made with Ruby and RMagic
Stars: ✭ 62 (-38.61%)
Awesome Sentiment Analysis😀😄😂😭 A curated list of Sentiment Analysis methods, implementations and misc. 😥😟😱😤
Stars: ✭ 816 (+707.92%)
dropclass speakerDropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Stars: ✭ 20 (-80.2%)
FlatFoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm.
Stars: ✭ 93 (-7.92%)
concepticon-dataThe curation repository for the data behind Concepticon.
Stars: ✭ 25 (-75.25%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+648.51%)
wikipronMassively multilingual pronunciation mining
Stars: ✭ 167 (+65.35%)
NhyaiAI智能审查,支持色情识别、暴恐识别、语言识别、敏感文字检测和视频检测等功能,以及各种OCR识别能力,如身份证、驾照、行驶证、营业执照、银行卡、手写体、车牌和名片识别等功能,可以访问网站体验功能。
Stars: ✭ 60 (-40.59%)
OpenGNTOpen Greek New Testament Project; NA28 / NA27 Equivalent Text & Resources
Stars: ✭ 55 (-45.54%)
dureeDurée: the longest book ever written.
Stars: ✭ 67 (-33.66%)
WikipronMassively multilingual pronunciation mining
Stars: ✭ 99 (-1.98%)
kaldi-allignerscripts to align a given wave to its transcription using trained models by Kaldi
Stars: ✭ 24 (-76.24%)
PynlplPyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (+321.78%)
TextannotationgraphsA modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.
Stars: ✭ 73 (-27.72%)
linguisticsdownEasy Linguistics Document Writing with R Markdown
Stars: ✭ 24 (-76.24%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (+270.3%)
kaldi helpers🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-87.13%)
Voxceleb IvectorVoxceleb1 i-vector based speaker recognition system
Stars: ✭ 36 (-64.36%)
Factorized TdnnPyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Stars: ✭ 98 (-2.97%)
Ivector XvectorExtract xvector and ivector under kaldi
Stars: ✭ 67 (-33.66%)
PhonemesJason Riggle's chart of phonological features in JSON format + extras
Stars: ✭ 33 (-67.33%)
React Transcript EditorA React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
Stars: ✭ 285 (+182.18%)