Given a sentence automatically generate reading comprehension style factual questions from that sentence, such that the sentence contains answers to those questions.

Stars: ✭ 100 (-34.64%)

Mutual labels: nlp-machine-learning

Wiki Split

One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.

Stars: ✭ 95 (-37.91%)

Mutual labels: nlp-machine-learning

Remo Python

🐰 Python lib for remo - the app for annotations and images management in Computer Vision

Stars: ✭ 138 (-9.8%)

Mutual labels: datasets

Aesthetics

Image Aesthetics Toolkit - includes Fisher Vector implementation, AVA (Image Aesthetic Visual Analysis) dataset and fast multi-threaded downloader

Stars: ✭ 113 (-26.14%)

Mutual labels: datasets

Datascience

It consists of examples, assignments discussed in data science course taken at algorithmica.

Stars: ✭ 92 (-39.87%)

Mutual labels: nlp-machine-learning

Textaugmentation Gpt2

Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.

Stars: ✭ 104 (-32.03%)

Mutual labels: nlp-machine-learning

Pipedream

Connect APIs, remarkably fast. Free for developers.

Stars: ✭ 2,068 (+1251.63%)

Mutual labels: datasets

Mrc book

《机器阅读理解：算法与实践》代码

Stars: ✭ 102 (-33.33%)

Mutual labels: nlp-machine-learning

Onnxt5

Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.

Stars: ✭ 143 (-6.54%)

Mutual labels: nlp-machine-learning

Doppelganger

[IMC 2020 (Best Paper Finalist)] Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions

Stars: ✭ 97 (-36.6%)

Mutual labels: datasets

Bird Recognition Review

A list of useful resources in the bird sound (song and calls) recognition, such as datasets, papers, links to open source projects and competitions

Stars: ✭ 116 (-24.18%)

Mutual labels: datasets

Nottingham Dataset

Cleaned version of the Nottingham dataset

Stars: ✭ 94 (-38.56%)

Mutual labels: datasets

Zzz Retired openstt

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

Stars: ✭ 146 (-4.58%)

Mutual labels: nlp-machine-learning

Bertqa Attention On Steroids

BertQA - Attention on Steroids

Stars: ✭ 112 (-26.8%)

Mutual labels: nlp-machine-learning

Crossweigh

CrossWeigh: Training Named Entity Tagger from Imperfect Annotations

Stars: ✭ 91 (-40.52%)

Mutual labels: datasets

Text classification

Text Classification Algorithms: A Survey

Stars: ✭ 1,276 (+733.99%)

Mutual labels: nlp-machine-learning

Dareblopy

Data Reading Blocks for Python

Stars: ✭ 82 (-46.41%)

Mutual labels: datasets

Seq2seq tutorial

Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"

Stars: ✭ 132 (-13.73%)

Mutual labels: nlp-machine-learning

Cholera

R Package for Analyzing John Snow's 1854 Cholera Map

Stars: ✭ 110 (-28.1%)

Mutual labels: datasets

Gopup

数据接口：百度、谷歌、头条、微博指数,宏观数据，利率数据，货币汇率，千里马、独角兽公司，新闻联播文字稿，影视票房数据，高校名单，疫情数据…

Stars: ✭ 1,229 (+703.27%)

Mutual labels: datasets

Nlu datasets with task oriented dialogue

datasets of natural language understanding and dialogue state tracking

Stars: ✭ 104 (-32.03%)

Mutual labels: datasets

Nlp Pretrained Model

A collection of Natural language processing pre-trained models.

Stars: ✭ 122 (-20.26%)

Mutual labels: nlp-machine-learning

Repo 2016

R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation

Stars: ✭ 103 (-32.68%)

Mutual labels: nlp-machine-learning

Pix2code

pix2code: Generating Code from a Graphical User Interface Screenshot

Stars: ✭ 11,349 (+7317.65%)

Mutual labels: datasets

Multi object datasets

Multi-object image datasets with ground-truth segmentation masks and generative factors.

Stars: ✭ 121 (-20.92%)

Mutual labels: datasets

Photogrammetry datasets

Collection of 250+ datasets for photogrammetry

Stars: ✭ 76 (-50.33%)

Mutual labels: datasets

Transitland Datastore

Transitland's centralized web service API for both querying and editing aggregated transit data from around the world

Stars: ✭ 101 (-33.99%)

Mutual labels: datasets

Pins

Pin, Discover and Share Resources

Stars: ✭ 149 (-2.61%)

Mutual labels: datasets

Exposure correction

Reference code for the paper "Learning Multi-Scale Photo Exposure Correction", CVPR 2021.

Stars: ✭ 98 (-35.95%)

Mutual labels: datasets

G Reader

2018年机器阅读理解技术竞赛模型，国内外1000多支队伍中BLEU-4评分排名第6， ROUGE-L评分排名第14。（未ensemble，未嵌入训练好的词向量，无dropout）

Stars: ✭ 117 (-23.53%)

Mutual labels: nlp-machine-learning

Monkeylearn

⛔️ ARCHIVED ⛔️ 🐒 R package for text analysis with Monkeylearn 🐒

Stars: ✭ 95 (-37.91%)

Mutual labels: nlp-machine-learning

Lazy

Lazy, AI chatbot service.

Stars: ✭ 141 (-7.84%)

Mutual labels: nlp-machine-learning

Persian Swear Words

دیتاست کلمات نامناسب و بد فارسی برای فیلتر کردن متن ها

Stars: ✭ 95 (-37.91%)

Mutual labels: datasets

Aspect Based Sentiment Analysis

Aspect-Based Sentiment Analysis Experiments

Stars: ✭ 115 (-24.84%)

Mutual labels: datasets

Writeup Frontend

Beat Writer's Block with AI

Stars: ✭ 94 (-38.56%)

Mutual labels: nlp-machine-learning

Idenprof

IdenProf dataset is a collection of images of identifiable professionals. It is been collected to enable the development of AI systems that can serve by identifying people and the nature of their job by simply looking at an image, just like humans can do.

Stars: ✭ 149 (-2.61%)

Mutual labels: datasets

Doc2vec

📓 Long(er) text representation and classification using Doc2Vec embeddings

Stars: ✭ 92 (-39.87%)

Mutual labels: nlp-machine-learning

Lingo

package lingo provides the data structures and algorithms required for natural language processing

Stars: ✭ 113 (-26.14%)

Mutual labels: nlp-machine-learning

Lda Topic Modeling

A PureScript, browser-based implementation of LDA topic modeling.

Stars: ✭ 91 (-40.52%)

Mutual labels: nlp-machine-learning

Complete Life Cycle Of A Data Science Project

Complete-Life-Cycle-of-a-Data-Science-Project

Stars: ✭ 140 (-8.5%)

Mutual labels: datasets

Summarus

Models for automatic abstractive summarization

Stars: ✭ 83 (-45.75%)

Mutual labels: nlp-machine-learning

Firstcoursenetworkscience

Tutorials, datasets, and other material associated with textbook "A First Course in Network Science" by Menczer, Fortunato & Davis

Stars: ✭ 111 (-27.45%)

Mutual labels: datasets

Openml R

R package to interface with OpenML

Stars: ✭ 81 (-47.06%)

Mutual labels: datasets

Hands On Natural Language Processing With Python

This repository is for my students of Udemy. You can find all lecture codes along with mentioned files for reading in here. So, feel free to clone it and if you have any problem just raise a question.

Stars: ✭ 146 (-4.58%)

Mutual labels: nlp-machine-learning

Atis dataset

The ATIS (Airline Travel Information System) Dataset

Stars: ✭ 81 (-47.06%)

Mutual labels: datasets

Atnre

Adversarial Training for Neural Relation Extraction

Stars: ✭ 108 (-29.41%)

Mutual labels: nlp-machine-learning

Cluedatasetsearch

搜索所有中文NLP数据集，附常用英文NLP数据集