MtntCode for the collection and analysis of the MTNT dataset
Stars: ✭ 48 (-97.6%)
ChazutsuThe tool to make NLP datasets ready to use
Stars: ✭ 238 (-88.08%)
user qualityDataset for Software Evolution and Quality Improvement
Stars: ✭ 27 (-98.65%)
CoarijCorpus of Annual Reports in Japan
Stars: ✭ 55 (-97.24%)
ParallaxTool for interactive embeddings visualization
Stars: ✭ 192 (-90.38%)
Catalyst🚀 Catalyst is a C# Natural Language Processing library built for speed. Inspired by spaCy's design, it brings pre-trained models, out-of-the box support for training word and document embeddings, and flexible entity recognition models.
Stars: ✭ 224 (-88.78%)
WikisqlA large annotated semantic parsing corpus for developing natural language interfaces.
Stars: ✭ 965 (-51.65%)
Spacy Course👩🏫 Advanced NLP with spaCy: A free online course
Stars: ✭ 1,920 (-3.81%)
Pytorch Sentiment AnalysisTutorials on getting started with PyTorch and TorchText for sentiment analysis.
Stars: ✭ 3,209 (+60.77%)
FakenewscorpusA dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (-87.22%)
Text2sql DataA collection of datasets that pair questions with SQL queries.
Stars: ✭ 287 (-85.62%)
CesiWWW 2018: CESI: Canonicalizing Open Knowledge Bases using Embeddings and Side Information
Stars: ✭ 85 (-95.74%)
Ngx Dynamic Dashboard FrameworkThis is a JSON driven angular x based dashboard framework that is inspired by JIRA's dashboard implementation and https://github.com/raulgomis/angular-dashboard-framework
Stars: ✭ 160 (-91.98%)
Nlp bahasa resourcesA Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (-92.08%)
Awesome Embedding ModelsA curated list of awesome embedding models tutorials, projects and communities.
Stars: ✭ 1,486 (-25.55%)
Awesome Persian Nlp IrCurated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (-76.95%)
DoccanoOpen source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+180.56%)
BpembPre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
Stars: ✭ 909 (-54.46%)
SpeedtorchLibrary for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (-69.19%)
MagnitudeA fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (-30.16%)
Ua GecUA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Stars: ✭ 108 (-94.59%)
BondBOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
Stars: ✭ 96 (-95.19%)
Mams For AbsaA Multi-Aspect Multi-Sentiment Dataset for aspect-based sentiment analysis.
Stars: ✭ 135 (-93.24%)
Vec4irWord Embeddings for Information Retrieval
Stars: ✭ 188 (-90.58%)
Pytreebank😡😇 Stanford Sentiment Treebank loader in Python
Stars: ✭ 93 (-95.34%)
Ner LstmNamed Entity Recognition using multilayered bidirectional LSTM
Stars: ✭ 532 (-73.35%)
Oie ResourcesA curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (-85.82%)
Wikipedia2vecA tool for learning vector representations of words and entities from Wikipedia
Stars: ✭ 655 (-67.18%)
TextData loaders and abstractions for text and NLP
Stars: ✭ 2,915 (+46.04%)
Char Rnn TensorflowMulti-layer Recurrent Neural Networks for character-level language models implements by TensorFlow
Stars: ✭ 58 (-97.09%)
Chars2vecCharacter-based word embeddings model based on RNN for handling real world texts
Stars: ✭ 130 (-93.49%)
ProsodyHelsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-93.04%)
PostaggaA Library to parse natural language in pure Clojure and ClojureScript
Stars: ✭ 152 (-92.38%)
SwagafRepository for paper "SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference"
Stars: ✭ 156 (-92.18%)
Quickdraw AppendixDataset of 25k penises: an appendix to the Quick, Draw! Dataset
Stars: ✭ 153 (-92.33%)
Sourced Cesource{d} Community Edition (CE)
Stars: ✭ 153 (-92.33%)
Awesome Pytorch ListA comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
Stars: ✭ 12,475 (+525%)
Maskedface NetMaskedFace-Net is a dataset of human faces with a correctly and incorrectly worn mask based on the dataset Flickr-Faces-HQ (FFHQ).
Stars: ✭ 152 (-92.38%)
Crf Layer On The Top Of BilstmThe CRF Layer was implemented by using Chainer 2.0. Please see more details here: https://createmomo.github.io/2017/09/12/CRF_Layer_on_the_Top_of_BiLSTM_1/
Stars: ✭ 148 (-92.59%)
Opbeat NodeDEPRECATED - See Elastic APM instead: https://github.com/elastic/apm-agent-nodejs
Stars: ✭ 155 (-92.23%)
ChineseblueChinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)
Stars: ✭ 149 (-92.54%)
Finnlp ProgressNLP progress in Fintech. A repository to track the progress in Natural Language Processing (NLP) related to the domain of Finance, including the datasets, papers, and current state-of-the-art results for the most popular tasks.
Stars: ✭ 148 (-92.59%)
Omr DatasetsCollection of datasets used for Optical Music Recognition
Stars: ✭ 158 (-92.08%)
SlingSLING - A natural language frame semantics parser
Stars: ✭ 1,892 (-5.21%)
SnapeSnape is a convenient artificial dataset generator that wraps sklearn's make_classification and make_regression and then adds in 'realism' features such as complex formating, varying scales, categorical variables, and missing values.
Stars: ✭ 155 (-92.23%)
Spacymoji💙 Emoji handling and meta data for spaCy with custom extension attributes
Stars: ✭ 151 (-92.43%)
Embedding As ServiceOne-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
Stars: ✭ 151 (-92.43%)
Dem.netDigital Elevation model library in C#. 3D terrain models, line/point Elevations, intervisibility reports
Stars: ✭ 153 (-92.33%)
GooseLoad testing tool, inspired by Locust
Stars: ✭ 151 (-92.43%)
Visdial RlPyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning
Stars: ✭ 157 (-92.13%)
PythonrougePython wrapper for evaluating summarization quality by ROUGE package
Stars: ✭ 155 (-92.23%)