BondBOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
Stars: ✭ 96 (-50%)
CoarijCorpus of Annual Reports in Japan
Stars: ✭ 55 (-71.35%)
WikisqlA large annotated semantic parsing corpus for developing natural language interfaces.
Stars: ✭ 965 (+402.6%)
DoccanoOpen source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+2816.67%)
FakenewscorpusA dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (+32.81%)
Pytorch NlpBasic Utilities for PyTorch Natural Language Processing (NLP)
Stars: ✭ 1,996 (+939.58%)
ChazutsuThe tool to make NLP datasets ready to use
Stars: ✭ 238 (+23.96%)
Char Rnn TensorflowMulti-layer Recurrent Neural Networks for character-level language models implements by TensorFlow
Stars: ✭ 58 (-69.79%)
Mams For AbsaA Multi-Aspect Multi-Sentiment Dataset for aspect-based sentiment analysis.
Stars: ✭ 135 (-29.69%)
Ua GecUA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Stars: ✭ 108 (-43.75%)
Oie ResourcesA curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (+47.4%)
Pytreebank😡😇 Stanford Sentiment Treebank loader in Python
Stars: ✭ 93 (-51.56%)
Text2sql DataA collection of datasets that pair questions with SQL queries.
Stars: ✭ 287 (+49.48%)
MtntCode for the collection and analysis of the MTNT dataset
Stars: ✭ 48 (-75%)
ProsodyHelsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-27.6%)
Nlp bahasa resourcesA Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (-17.71%)
MutualA Dataset for Multi-Turn Dialogue Reasoning
Stars: ✭ 181 (-5.73%)
Deep Survey Text ClassificationThe project surveys 16+ Natural Language Processing (NLP) research papers that propose novel Deep Neural Network Models for Text Classification, based on Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN). It also implements each of the models using Tensorflow and Keras.
Stars: ✭ 187 (-2.6%)
StopwordsDefault English stopword lists from many different sources
Stars: ✭ 179 (-6.77%)
Cookiecutter Spacy FastapiCookiecutter API for creating Custom Skills for Azure Search using Python and Docker
Stars: ✭ 179 (-6.77%)
DostoevskySentiment analysis library for russian language
Stars: ✭ 191 (-0.52%)
NelEntity linking framework
Stars: ✭ 176 (-8.33%)
Deeptoxictop 1% solution to toxic comment classification challenge on Kaggle.
Stars: ✭ 180 (-6.25%)
Nlp profilerA simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Stars: ✭ 181 (-5.73%)
Data Setstate driven all in one data process for data visualization
Stars: ✭ 191 (-0.52%)
Bert Vocab BuilderBuilds wordpiece(subword) vocabulary compatible for Google Research's BERT
Stars: ✭ 187 (-2.6%)
Cs224n 2019My completed implementation solutions for CS224N 2019
Stars: ✭ 178 (-7.29%)
Pyss3A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (-0.52%)
FastnlpfastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Stars: ✭ 2,441 (+1171.35%)
CleannlpR package providing annotators and a normalized data model for natural language processing
Stars: ✭ 174 (-9.37%)
PhrasalA large-scale statistical machine translation system written in Java.
Stars: ✭ 190 (-1.04%)
NeuralqaNeuralQA: A Usable Library for Question Answering on Large Datasets with BERT
Stars: ✭ 185 (-3.65%)
SiceLearning a Deep Single Image Contrast Enhancer from Multi-Exposure Images (TIP 2018)
Stars: ✭ 175 (-8.85%)
MsmarcoUtilities, Baselines, Statistics and Descriptions Related to the MSMARCO DATASET
Stars: ✭ 175 (-8.85%)
Transformers.jlJulia Implementation of Transformer models
Stars: ✭ 173 (-9.9%)
NlvrCornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.
Stars: ✭ 192 (+0%)
Displacy Ent💥 displaCy-ent.js: An open-source named entity visualiser for the modern web
Stars: ✭ 191 (-0.52%)
ArxivnotesIssuesにNLP(自然言語処理)に関連するの論文を読んだまとめを書いています.雑です.🚧 マークは編集中の論文です(事実上放置のものも多いです).🍡 マークは概要のみ書いてます(早く見れる的な意味で団子).
Stars: ✭ 190 (-1.04%)
Datasets For GoodList of datasets to apply stats/machine learning/technology to the world of social good.
Stars: ✭ 174 (-9.37%)
GladGlobal-Locally Self-Attentive Dialogue State Tracker
Stars: ✭ 185 (-3.65%)
Deep Math Machine Learning.aiA blog which talks about machine learning, deep learning algorithms and the Math. and Machine learning algorithms written from scratch.
Stars: ✭ 173 (-9.9%)
GermanwordembeddingsToolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Stars: ✭ 189 (-1.56%)
Dkpro CoreCollection of software components for natural language processing (NLP) based on the Apache UIMA framework.
Stars: ✭ 184 (-4.17%)
Hand pose actionDataset and code for the paper "First-Person Hand Action Benchmark with RGB-D Videos and 3D Hand Pose Annotations", CVPR 2018.
Stars: ✭ 173 (-9.9%)
Knockknock🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code
Stars: ✭ 2,304 (+1100%)
HntitlenatorTest your HN title against a neural network
Stars: ✭ 184 (-4.17%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+1211.46%)