All Projects → Prosody → Similar Projects or Alternatives

1301 Open source projects that are alternatives of or similar to Prosody

Ua Gec
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Stars: ✭ 108 (-22.3%)
Insuranceqa Corpus Zh
🚁 保险行业语料库,聊天机器人
Stars: ✭ 821 (+490.65%)
Coarij
Corpus of Annual Reports in Japan
Stars: ✭ 55 (-60.43%)
Nlp bahasa resources
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (+13.67%)
Fakenewscorpus
A dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (+83.45%)
Awesome Hungarian Nlp
A curated list of NLP resources for Hungarian
Stars: ✭ 121 (-12.95%)
Dataset List
lists of text corpus and more (mainly Japanese)
Stars: ✭ 84 (-39.57%)
Mutual labels:  dataset, corpus
Indonesian Nlp Resources
data resource untuk NLP bahasa indonesia
Stars: ✭ 143 (+2.88%)
Mutual labels:  dataset, corpus
Pytorch Nlp
Basic Utilities for PyTorch Natural Language Processing (NLP)
Stars: ✭ 1,996 (+1335.97%)
Text2sql Data
A collection of datasets that pair questions with SQL queries.
Stars: ✭ 287 (+106.47%)
Gector
Official implementation of the paper “GECToR – Grammatical Error Correction: Tag, Not Rewrite” // Published on BEA15 Workshop (co-located with ACL 2020) https://www.aclweb.org/anthology/2020.bea-1.16.pdf
Stars: ✭ 287 (+106.47%)
Doccano
Open source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+3928.78%)
Ncrfpp
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
Stars: ✭ 1,767 (+1171.22%)
Bond
BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
Stars: ✭ 96 (-30.94%)
Pytreebank
😡😇 Stanford Sentiment Treebank loader in Python
Stars: ✭ 93 (-33.09%)
Mams For Absa
A Multi-Aspect Multi-Sentiment Dataset for aspect-based sentiment analysis.
Stars: ✭ 135 (-2.88%)
Efaqa Corpus Zh
❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库
Stars: ✭ 170 (+22.3%)
Medical-Names-Corpus
医疗语料库。医疗机构名语料库。药品本位码。
Stars: ✭ 26 (-81.29%)
Mutual labels:  corpus, dataset
Oie Resources
A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (+103.6%)
Weixin public corpus
微信公众号语料库
Stars: ✭ 465 (+234.53%)
Korean Hate Speech
Korean HateSpeech Dataset
Stars: ✭ 192 (+38.13%)
Pororo
PORORO: Platform Of neuRal mOdels for natuRal language prOcessing
Stars: ✭ 812 (+484.17%)
Company Names Corpus
公司名语料库。机构名语料库。公司简称,缩写,品牌词,企业名。可用于中文分词、机构名实体识别。
Stars: ✭ 868 (+524.46%)
Mutual labels:  dataset, corpus
Awesome Ai Services
An overview of the AI-as-a-service landscape
Stars: ✭ 133 (-4.32%)
Dialog corpus
用于训练中英文对话系统的语料库 Datasets for Training Chatbot System
Stars: ✭ 1,662 (+1095.68%)
Mutual labels:  dataset, corpus
Jsut Lab
HTS-style full-context labels for JSUT v1.1
Stars: ✭ 28 (-79.86%)
Mutual labels:  dataset, speech-synthesis
Mtnt
Code for the collection and analysis of the MTNT dataset
Stars: ✭ 48 (-65.47%)
Char Rnn Tensorflow
Multi-layer Recurrent Neural Networks for character-level language models implements by TensorFlow
Stars: ✭ 58 (-58.27%)
Hate Speech And Offensive Language
Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
Stars: ✭ 543 (+290.65%)
Typing Assistant
Typing Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.
Stars: ✭ 32 (-76.98%)
Clue
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Stars: ✭ 2,425 (+1644.6%)
Mutual labels:  dataset, corpus
Nlvr
Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.
Stars: ✭ 192 (+38.13%)
Wikisql
A large annotated semantic parsing corpus for developing natural language interfaces.
Stars: ✭ 965 (+594.24%)
Neuronblocks
NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego
Stars: ✭ 1,356 (+875.54%)
Species-Names-Corpus
物种名称语料库。植物名,动物名。
Stars: ✭ 23 (-83.45%)
Mutual labels:  corpus, dataset
Chinese Names Corpus
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
Stars: ✭ 3,053 (+2096.4%)
Mutual labels:  dataset, corpus
Chazutsu
The tool to make NLP datasets ready to use
Stars: ✭ 238 (+71.22%)
Anago
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.
Stars: ✭ 1,392 (+901.44%)
Spokestack Python
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-25.9%)
Awesome Persian Nlp Ir
Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (+230.94%)
Neuronlp2
Deep neural models for core NLP tasks (Pytorch version)
Stars: ✭ 397 (+185.61%)
Cluepretrainedmodels
高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型
Stars: ✭ 493 (+254.68%)
Mutual labels:  dataset, corpus
Nlp chinese corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Stars: ✭ 6,656 (+4688.49%)
Mutual labels:  dataset, corpus
Cluener2020
CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition
Stars: ✭ 689 (+395.68%)
Mutual labels:  dataset, sequence-labeling
Quanteda
An R package for the Quantitative Analysis of Textual Data
Stars: ✭ 647 (+365.47%)
Seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
Stars: ✭ 508 (+265.47%)
Ja.text8
Japanese text8 corpus for word embedding.
Stars: ✭ 79 (-43.17%)
Flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
Stars: ✭ 11,065 (+7860.43%)
Gossiping Chinese Corpus
PTT 八卦版問答中文語料
Stars: ✭ 137 (-1.44%)
Mutual labels:  dataset, corpus
Prenlp
Preprocessing Library for Natural Language Processing
Stars: ✭ 130 (-6.47%)
Sluice Networks
Code for Sluice networks: Learning what to share between loosely related tasks
Stars: ✭ 135 (-2.88%)
Textacy
NLP, before and after spaCy
Stars: ✭ 1,849 (+1230.22%)
Legacy straight
A vocoder framework which had been widely used in research community since 1999.
Stars: ✭ 130 (-6.47%)
Mutual labels:  speech-synthesis
Datasets
🎁 3,000,000+ Unsplash images made available for research and machine learning
Stars: ✭ 1,805 (+1198.56%)
Mutual labels:  dataset
Rasa
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
Stars: ✭ 13,219 (+9410.07%)
Tvqa
[EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering
Stars: ✭ 130 (-6.47%)
Mutual labels:  dataset
Konoha
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Stars: ✭ 130 (-6.47%)
Chars2vec
Character-based word embeddings model based on RNN for handling real world texts
Stars: ✭ 130 (-6.47%)
Hpatches Benchmark
Python & Matlab code for local feature descriptor evaluation with the HPatches dataset.
Stars: ✭ 129 (-7.19%)
Mutual labels:  dataset
Kaggle Crowdflower
1st Place Solution for CrowdFlower Product Search Results Relevance Competition on Kaggle.
Stars: ✭ 1,708 (+1128.78%)
1-60 of 1301 similar projects