Joke DatasetA dataset of 200k English plaintext jokes.
Stars: ✭ 447 (+438.55%)
TR-TPBSA Dataset for Thai Text Summarization with over 310K articles.
Stars: ✭ 25 (-69.88%)
Pysgs📈 Python interface for the Brazilian Central Bank's Time Series Management System (SGS)
Stars: ✭ 60 (-27.71%)
portfolioPersonal portfolio (2018)
Stars: ✭ 388 (+367.47%)
Quickdraw DatasetDocumentation on how to access and use the Quick, Draw! Dataset.
Stars: ✭ 4,622 (+5468.67%)
teanaps자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Stars: ✭ 91 (+9.64%)
TitleStylistSource code for our "TitleStylist" paper at ACL 2020
Stars: ✭ 72 (-13.25%)
Squad ExplorerVisually Explore the Stanford Question Answering Dataset
Stars: ✭ 421 (+407.23%)
DeepChannelThe pytorch implementation of paper "DeepChannel: Salience Estimation by Contrastive Learning for Extractive Document Summarization"
Stars: ✭ 24 (-71.08%)
Esc 50ESC-50: Dataset for Environmental Sound Classification
Stars: ✭ 631 (+660.24%)
BIRLBIRL: Benchmark on Image Registration methods with Landmark validations
Stars: ✭ 66 (-20.48%)
factsummFactSumm: Factual Consistency Scorer for Abstractive Summarization
Stars: ✭ 83 (+0%)
Wuhan 2019 Ncov2019-nCoV 新冠状病毒 2019-12-01至今国家、省、市三级每日统计数据(支持接口读取)
Stars: ✭ 414 (+398.8%)
storymap-swipeA storytelling template that enables users to reveal a layer of a web map or another web map using a vertical bar or a spy glass.
Stars: ✭ 45 (-45.78%)
WikisqlA large annotated semantic parsing corpus for developing natural language interfaces.
Stars: ✭ 965 (+1062.65%)
DialogueGraphOpen-source node-based tool for developing branching conversation trees
Stars: ✭ 133 (+60.24%)
Maskrcnn ModanetA Mask R-CNN Keras implementation with Modanet annotations on the Paperdoll dataset
Stars: ✭ 59 (-28.92%)
FocusSeq2Seq[EMNLP 2019] Mixture Content Selection for Diverse Sequence Generation (Question Generation / Abstractive Summarization)
Stars: ✭ 109 (+31.33%)
Comma2k19A driving dataset for the development and validation of fused pose estimators and mapping algorithms
Stars: ✭ 391 (+371.08%)
pn-summaryA well-structured summarization dataset for the Persian language!
Stars: ✭ 29 (-65.06%)
Elastic dataElasticsearch datasets ready for bulk loading
Stars: ✭ 30 (-63.86%)
SRBCode for "Improving Semantic Relevance for Sequence-to-Sequence Learning of Chinese Social Media Text Summarization"
Stars: ✭ 41 (-50.6%)
VpgnetVPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition (ICCV 2017)
Stars: ✭ 382 (+360.24%)
TimelinestorytellerAn expressive visual storytelling environment for presenting timelines on the web and in Power BI. Developed at Microsoft Research.
Stars: ✭ 244 (+193.98%)
MmsaCH-SIMS: A Chinese Multimodal Sentiment Analysis Dataset with Fine-grained Annotations of Modality (ACL2020)
Stars: ✭ 70 (-15.66%)
Storymap TourThe Story Map Tour is ideal when you want to present a linear, place-based narrative featuring images or videos.
Stars: ✭ 146 (+75.9%)
MeldMELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation
Stars: ✭ 373 (+349.4%)
NarrowsOnline storytelling system
Stars: ✭ 109 (+31.33%)
Dns Lots Of Lookupsdnslol is a command line tool for performing lots of DNS lookups.
Stars: ✭ 30 (-63.86%)
Storymap CascadeThe Story Map Cascade℠ app lets you combine narrative text with maps, images, and multimedia content in an engaging, full-screen scrolling experience.
Stars: ✭ 92 (+10.84%)
Nlp Projectsword2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding
Stars: ✭ 360 (+333.73%)
Chinese Names Corpus中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
Stars: ✭ 3,053 (+3578.31%)
Stevens Vlp16 DatasetThis dataset is captured using a Velodyne VLP-16, which is mounted on an UGV - Clearpath Jackal, on Stevens Institute of Technology campus
Stars: ✭ 58 (-30.12%)
Cities.jsonCities of the world in Json, based on GeoNames Gazetteer
Stars: ✭ 251 (+202.41%)
DataPython related videos and metadata powering =>
Stars: ✭ 355 (+327.71%)
Recommendersystem DatasetThis repository contains some datasets that I have collected in Recommender Systems.
Stars: ✭ 249 (+200%)
FeversymmetricSymmetric evaluation set based on the FEVER (fact verification) dataset
Stars: ✭ 29 (-65.06%)
Taco🌮 Trash Annotations in Context Dataset Toolkit
Stars: ✭ 243 (+192.77%)
Medmnist[ISBI'21] MedMNIST Classification Decathlon: A Lightweight AutoML Benchmark for Medical Image Analysis
Stars: ✭ 338 (+307.23%)
ChazutsuThe tool to make NLP datasets ready to use
Stars: ✭ 238 (+186.75%)
UrbannavdatasetUrbanNav: an Open-Sourcing Localization Data Collected in Asian Urban Canyons, Including Tokyo and Hong Kong
Stars: ✭ 79 (-4.82%)
Covid Chestxray DatasetWe are building an open database of COVID-19 cases with chest X-ray or CT images.
Stars: ✭ 2,759 (+3224.1%)
Eseur Code DataCode and data used to create the examples in "Evidence-based Software Engineering based on the publicly available data"
Stars: ✭ 340 (+309.64%)
University1652 BaselineACM Multimedia2020 University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization 🚁 annotates 1652 buildings in 72 universities around the world.
Stars: ✭ 232 (+179.52%)
Jsut LabHTS-style full-context labels for JSUT v1.1
Stars: ✭ 28 (-66.27%)
Datasetssource{d} datasets ("big code") for source code analysis and machine learning on source code
Stars: ✭ 231 (+178.31%)
Deeperforensics 1.0[CVPR 2020] A Large-Scale Dataset for Real-World Face Forgery Detection
Stars: ✭ 338 (+307.23%)
WeatherbenchA benchmark dataset for data-driven weather forecasting
Stars: ✭ 227 (+173.49%)
Storymap SeriesThe Story Map Series lets you present a series of maps via tabs, numbered bullets, or a side accordion.
Stars: ✭ 57 (-31.33%)
snorkelingExtracting biomedical relationships from literature with Snorkel 🏊
Stars: ✭ 56 (-32.53%)
NndialNNDial is an open source toolkit for building end-to-end trainable task-oriented dialogue models. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.
Stars: ✭ 332 (+300%)
Vidvrd HelperTo keep updates with VRU Grand Challenge, please use https://github.com/NExTplusplus/VidVRD-helper
Stars: ✭ 81 (-2.41%)
Color NamesLarge list of handpicked color names 🌈
Stars: ✭ 1,198 (+1343.37%)
Convai Bot 1337NIPS Conversational Intelligence Challenge 2017 Winner System: Skill-based Conversational Agent with Supervised Dialog Manager
Stars: ✭ 65 (-21.69%)