Ua GecUA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Stars: ✭ 108 (-54.62%)
WikisqlA large annotated semantic parsing corpus for developing natural language interfaces.
Stars: ✭ 965 (+305.46%)
Char Rnn TensorflowMulti-layer Recurrent Neural Networks for character-level language models implements by TensorFlow
Stars: ✭ 58 (-75.63%)
Mams For AbsaA Multi-Aspect Multi-Sentiment Dataset for aspect-based sentiment analysis.
Stars: ✭ 135 (-43.28%)
MtntCode for the collection and analysis of the MTNT dataset
Stars: ✭ 48 (-79.83%)
Oie ResourcesA curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (+18.91%)
Nlp bahasa resourcesA Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (-33.61%)
FakenewscorpusA dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (+7.14%)
Pytreebank😡😇 Stanford Sentiment Treebank loader in Python
Stars: ✭ 93 (-60.92%)
ProsodyHelsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-41.6%)
Text2sql DataA collection of datasets that pair questions with SQL queries.
Stars: ✭ 287 (+20.59%)
BondBOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
Stars: ✭ 96 (-59.66%)
DoccanoOpen source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+2252.94%)
CoarijCorpus of Annual Reports in Japan
Stars: ✭ 55 (-76.89%)
Pytorch NlpBasic Utilities for PyTorch Natural Language Processing (NLP)
Stars: ✭ 1,996 (+738.66%)
Aidl kbA Knowledge Base for the FB Group Artificial Intelligence and Deep Learning (AIDL)
Stars: ✭ 219 (-7.98%)
Machine Learning ResourcesA curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Stars: ✭ 226 (-5.04%)
Awesome Financial NlpResearches for Natural Language Processing for Financial Domain
Stars: ✭ 220 (-7.56%)
CollectionCollection Data for Cooper Hewitt, Smithsonian Design Museum
Stars: ✭ 214 (-10.08%)
Datasetssource{d} datasets ("big code") for source code analysis and machine learning on source code
Stars: ✭ 231 (-2.94%)
Catalyst🚀 Catalyst is a C# Natural Language Processing library built for speed. Inspired by spaCy's design, it brings pre-trained models, out-of-the box support for training word and document embeddings, and flexible entity recognition models.
Stars: ✭ 224 (-5.88%)
Bccd datasetBCCD (Blood Cell Count and Detection) Dataset is a small-scale dataset for blood cells detection.
Stars: ✭ 216 (-9.24%)
Practical 1Oxford Deep NLP 2017 course - Practical 1: word2vec
Stars: ✭ 220 (-7.56%)
Stocknet DatasetA comprehensive dataset for stock movement prediction from tweets and historical stock prices.
Stars: ✭ 228 (-4.2%)
Awesome NlprojectsList of projects related to Natural Language Processing (NLP) that make a geek smile for they exist
Stars: ✭ 219 (-7.98%)
DataladKeep code, data, containers under control with git and git-annex
Stars: ✭ 234 (-1.68%)
LitThe Language Interpretability Tool: Interactively analyze NLP models for model understanding in an extensible and framework agnostic interface.
Stars: ✭ 2,721 (+1043.28%)
Visdial[CVPR 2017] Torch code for Visual Dialog
Stars: ✭ 215 (-9.66%)
Covid Chestxray DatasetWe are building an open database of COVID-19 cases with chest X-ray or CT images.
Stars: ✭ 2,759 (+1059.24%)
TorchdataPyTorch dataset extended with map, cache etc. (tensorflow.data like)
Stars: ✭ 226 (-5.04%)
DatatableA go in-memory table
Stars: ✭ 215 (-9.66%)
Dataset SerializeJSON to DataSet and DataSet to JSON converter for Delphi and Lazarus (FPC)
Stars: ✭ 213 (-10.5%)
DialogrptEMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"
Stars: ✭ 216 (-9.24%)
Structured3d[ECCV'20] Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling
Stars: ✭ 224 (-5.88%)
Text summarization with tensorflowImplementation of a seq2seq model for summarization of textual data. Demonstrated on amazon reviews, github issues and news articles.
Stars: ✭ 226 (-5.04%)
Short Jokes DatasetPython scripts for building 'Short Jokes' dataset, featured on Kaggle
Stars: ✭ 215 (-9.66%)
Ava downloader⏬ Download AVA dataset (A Large-Scale Database for Aesthetic Visual Analysis)
Stars: ✭ 214 (-10.08%)
StationaryGet hourly meteorological data from one of thousands of global stations
Stars: ✭ 225 (-5.46%)
Spacy LookupNamed Entity Recognition based on dictionaries
Stars: ✭ 212 (-10.92%)
Neat VisionNeat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Processing (NLP) tasks. (framework-agnostic)
Stars: ✭ 213 (-10.5%)
Covid 19 Repo DataData archive of identifiable COVID-19 related public projects on GitHub
Stars: ✭ 236 (-0.84%)
Deepnlp Models PytorchPytorch implementations of various Deep NLP models in cs-224n(Stanford Univ)
Stars: ✭ 2,760 (+1059.66%)
Spacy Services💫 REST microservices for various spaCy-related tasks
Stars: ✭ 230 (-3.36%)
CatalystAccelerated deep learning R&D
Stars: ✭ 2,804 (+1078.15%)
OmnianomalyKDD 2019: Robust Anomaly Detection for Multivariate Time Series through Stochastic Recurrent Neural Network
Stars: ✭ 208 (-12.61%)
CharlatanCreate fake data in R
Stars: ✭ 209 (-12.18%)
Prodigy Recipes🍳 Recipes for the Prodigy, our fully scriptable annotation tool
Stars: ✭ 229 (-3.78%)
Spacy Api DockerspaCy REST API, wrapped in a Docker container.
Stars: ✭ 222 (-6.72%)
Nlp RoadmapROADMAP(Mind Map) and KEYWORD for students those who have interest in learning NLP
Stars: ✭ 2,653 (+1014.71%)
ShifteratorInterpretable data visualizations for understanding how texts differ at the word level
Stars: ✭ 209 (-12.18%)
Mini Imagenet ToolsTools for generating mini-ImageNet dataset and processing batches
Stars: ✭ 209 (-12.18%)