Official implementation of the paper “GECToR – Grammatical Error Correction: Tag, Not Rewrite” // Published on BEA15 Workshop (co-located with ACL 2020) https://www.aclweb.org/anthology/2020.bea-1.16.pdf

Stars: ✭ 287 (+106.47%)

Mutual labels: natural-language-processing, sequence-labeling

Doccano

Open source annotation tool for machine learning practitioners.

Stars: ✭ 5,600 (+3928.78%)

Mutual labels: dataset, natural-language-processing

Ncrfpp

NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.

Stars: ✭ 1,767 (+1171.22%)

Mutual labels: natural-language-processing, sequence-labeling

Bond

BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision

Stars: ✭ 96 (-30.94%)

Mutual labels: dataset, natural-language-processing

Pytreebank

😡😇 Stanford Sentiment Treebank loader in Python

Stars: ✭ 93 (-33.09%)

Mutual labels: dataset, natural-language-processing

Mams For Absa

A Multi-Aspect Multi-Sentiment Dataset for aspect-based sentiment analysis.

Stars: ✭ 135 (-2.88%)

Mutual labels: dataset, natural-language-processing

Efaqa Corpus Zh

❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库

Stars: ✭ 170 (+22.3%)

Mutual labels: corpus, natural-language-processing

Medical-Names-Corpus

医疗语料库。医疗机构名语料库。药品本位码。

Stars: ✭ 26 (-81.29%)

Mutual labels: corpus, dataset

Oie Resources

A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.

Stars: ✭ 283 (+103.6%)

Mutual labels: dataset, natural-language-processing

Weixin public corpus

微信公众号语料库

Stars: ✭ 465 (+234.53%)

Mutual labels: corpus, natural-language-processing

Korean Hate Speech

Korean HateSpeech Dataset

Stars: ✭ 192 (+38.13%)

Mutual labels: dataset, natural-language-processing

Pororo

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

Stars: ✭ 812 (+484.17%)

Mutual labels: natural-language-processing, speech-synthesis

Company Names Corpus

公司名语料库。机构名语料库。公司简称,缩写,品牌词,企业名。可用于中文分词、机构名实体识别。

Stars: ✭ 868 (+524.46%)

Mutual labels: dataset, corpus

Awesome Ai Services

An overview of the AI-as-a-service landscape

Stars: ✭ 133 (-4.32%)

Mutual labels: natural-language-processing, speech-synthesis

Dialog corpus

用于训练中英文对话系统的语料库 Datasets for Training Chatbot System

Stars: ✭ 1,662 (+1095.68%)

Mutual labels: dataset, corpus

Jsut Lab

HTS-style full-context labels for JSUT v1.1

Stars: ✭ 28 (-79.86%)

Mutual labels: dataset, speech-synthesis

Mtnt

Code for the collection and analysis of the MTNT dataset

Stars: ✭ 48 (-65.47%)

Mutual labels: dataset, natural-language-processing

Char Rnn Tensorflow

Multi-layer Recurrent Neural Networks for character-level language models implements by TensorFlow

Stars: ✭ 58 (-58.27%)

Mutual labels: dataset, natural-language-processing

Hate Speech And Offensive Language

Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017

Stars: ✭ 543 (+290.65%)

Mutual labels: dataset, natural-language-processing

Typing Assistant

Typing Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.

Stars: ✭ 32 (-76.98%)

Mutual labels: corpus, natural-language-processing

Clue

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Stars: ✭ 2,425 (+1644.6%)

Mutual labels: dataset, corpus

Nlvr

Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.

Stars: ✭ 192 (+38.13%)

Mutual labels: corpus, natural-language-processing

Wikisql

A large annotated semantic parsing corpus for developing natural language interfaces.

Stars: ✭ 965 (+594.24%)

Mutual labels: dataset, natural-language-processing

Neuronblocks

NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego

Stars: ✭ 1,356 (+875.54%)

Mutual labels: natural-language-processing, sequence-labeling

Species-Names-Corpus

物种名称语料库。植物名,动物名。

Stars: ✭ 23 (-83.45%)

Mutual labels: corpus, dataset

Chinese Names Corpus

中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。

Stars: ✭ 3,053 (+2096.4%)

Mutual labels: dataset, corpus

Chazutsu

The tool to make NLP datasets ready to use

Stars: ✭ 238 (+71.22%)

Mutual labels: dataset, natural-language-processing

Anago

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

Stars: ✭ 1,392 (+901.44%)

Mutual labels: natural-language-processing, sequence-labeling

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (-25.9%)

Mutual labels: natural-language-processing, speech-synthesis

Awesome Persian Nlp Ir

Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources

Stars: ✭ 460 (+230.94%)

Mutual labels: corpus, natural-language-processing

Neuronlp2

Deep neural models for core NLP tasks (Pytorch version)

Stars: ✭ 397 (+185.61%)

Mutual labels: natural-language-processing, sequence-labeling

Cluepretrainedmodels

高质量中文预训练模型集合：最先进大模型、最快小模型、相似度专门模型

Stars: ✭ 493 (+254.68%)

Mutual labels: dataset, corpus

Nlp chinese corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

Stars: ✭ 6,656 (+4688.49%)

Mutual labels: dataset, corpus

Cluener2020

CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition

Stars: ✭ 689 (+395.68%)

Mutual labels: dataset, sequence-labeling

Quanteda

An R package for the Quantitative Analysis of Textual Data

Stars: ✭ 647 (+365.47%)

Mutual labels: corpus, natural-language-processing

Seqeval

A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)

Stars: ✭ 508 (+265.47%)

Mutual labels: natural-language-processing, sequence-labeling

Ja.text8

Japanese text8 corpus for word embedding.

Stars: ✭ 79 (-43.17%)

Mutual labels: corpus, natural-language-processing

Flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Stars: ✭ 11,065 (+7860.43%)

Mutual labels: natural-language-processing, sequence-labeling

Gossiping Chinese Corpus

PTT 八卦版問答中文語料

Stars: ✭ 137 (-1.44%)

Mutual labels: dataset, corpus

Prenlp

Preprocessing Library for Natural Language Processing

Stars: ✭ 130 (-6.47%)

Mutual labels: natural-language-processing

Sluice Networks

Code for Sluice networks: Learning what to share between loosely related tasks

Stars: ✭ 135 (-2.88%)

Mutual labels: natural-language-processing

Textacy

NLP, before and after spaCy

Stars: ✭ 1,849 (+1230.22%)

Mutual labels: natural-language-processing

Legacy straight

A vocoder framework which had been widely used in research community since 1999.

Stars: ✭ 130 (-6.47%)

Mutual labels: speech-synthesis

Datasets

🎁 3,000,000+ Unsplash images made available for research and machine learning

Stars: ✭ 1,805 (+1198.56%)

Mutual labels: dataset

Rasa

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Stars: ✭ 13,219 (+9410.07%)

Mutual labels: natural-language-processing

Tvqa

[EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering

Stars: ✭ 130 (-6.47%)

Mutual labels: dataset

Konoha

🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.

Stars: ✭ 130 (-6.47%)

Mutual labels: natural-language-processing

Chars2vec

Character-based word embeddings model based on RNN for handling real world texts

Stars: ✭ 130 (-6.47%)

Mutual labels: natural-language-processing

Hpatches Benchmark

Python & Matlab code for local feature descriptor evaluation with the HPatches dataset.

Stars: ✭ 129 (-7.19%)

Mutual labels: dataset

Kaggle Crowdflower

1st Place Solution for CrowdFlower Product Search Results Relevance Competition on Kaggle.

Stars: ✭ 1,708 (+1128.78%)

Mutual labels: natural-language-processing

1-60 of 1301 similar projects

›

next*5