Text2sql DataA collection of datasets that pair questions with SQL queries.
Stars: ✭ 287 (+112.59%)
Mutual labels: dataset, natural-language-processing
WikisqlA large annotated semantic parsing corpus for developing natural language interfaces.
Stars: ✭ 965 (+614.81%)
Mutual labels: dataset, natural-language-processing
DoccanoOpen source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+4048.15%)
Mutual labels: dataset, natural-language-processing
ChazutsuThe tool to make NLP datasets ready to use
Stars: ✭ 238 (+76.3%)
Mutual labels: dataset, natural-language-processing
Pytreebank😡😇 Stanford Sentiment Treebank loader in Python
Stars: ✭ 93 (-31.11%)
Mutual labels: dataset, natural-language-processing
FakenewscorpusA dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (+88.89%)
Mutual labels: dataset, natural-language-processing
Insuranceqa Corpus Zh🚁 保险行业语料库,聊天机器人
Stars: ✭ 821 (+508.15%)
Mutual labels: dataset, natural-language-processing
ProsodyHelsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (+2.96%)
Mutual labels: dataset, natural-language-processing
Char Rnn TensorflowMulti-layer Recurrent Neural Networks for character-level language models implements by TensorFlow
Stars: ✭ 58 (-57.04%)
Mutual labels: dataset, natural-language-processing
CoarijCorpus of Annual Reports in Japan
Stars: ✭ 55 (-59.26%)
Mutual labels: dataset, natural-language-processing
Korean Hate SpeechKorean HateSpeech Dataset
Stars: ✭ 192 (+42.22%)
Mutual labels: dataset, natural-language-processing
Ua GecUA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Stars: ✭ 108 (-20%)
Mutual labels: dataset, natural-language-processing
Nlp bahasa resourcesA Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (+17.04%)
Mutual labels: dataset, natural-language-processing
Oie ResourcesA curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (+109.63%)
Mutual labels: dataset, natural-language-processing
Pytorch NlpBasic Utilities for PyTorch Natural Language Processing (NLP)
Stars: ✭ 1,996 (+1378.52%)
Mutual labels: dataset, natural-language-processing
Hate Speech And Offensive LanguageRepository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
Stars: ✭ 543 (+302.22%)
Mutual labels: dataset, natural-language-processing
MtntCode for the collection and analysis of the MTNT dataset
Stars: ✭ 48 (-64.44%)
Mutual labels: dataset, natural-language-processing
BondBOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
Stars: ✭ 96 (-28.89%)
Mutual labels: dataset, natural-language-processing
Awesome Hungarian NlpA curated list of NLP resources for Hungarian
Stars: ✭ 121 (-10.37%)
Mutual labels: dataset, natural-language-processing
Konoha🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Stars: ✭ 130 (-3.7%)
Mutual labels: natural-language-processing