Taco🌮 Trash Annotations in Context Dataset Toolkit
Stars: ✭ 243 (+834.62%)
Datasetssource{d} datasets ("big code") for source code analysis and machine learning on source code
Stars: ✭ 231 (+788.46%)
Cities.jsonCities of the world in Json, based on GeoNames Gazetteer
Stars: ✭ 251 (+865.38%)
H36m FetchHuman 3.6M 3D human pose dataset fetcher
Stars: ✭ 220 (+746.15%)
BIRLBIRL: Benchmark on Image Registration methods with Landmark validations
Stars: ✭ 66 (+153.85%)
Covid Chestxray DatasetWe are building an open database of COVID-19 cases with chest X-ray or CT images.
Stars: ✭ 2,759 (+10511.54%)
TVQAplus[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
Stars: ✭ 99 (+280.77%)
Chinese Names Corpus中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
Stars: ✭ 3,053 (+11642.31%)
Dataset SerializeJSON to DataSet and DataSet to JSON converter for Delphi and Lazarus (FPC)
Stars: ✭ 213 (+719.23%)
pump-and-dump-datasetAdditional material for paper: Pump and Dumps in the Bitcoin Era: Real Time Detection of Cryptocurrency Market Manipulations, ICCCN '20
Stars: ✭ 66 (+153.85%)
Recommendersystem DatasetThis repository contains some datasets that I have collected in Recommender Systems.
Stars: ✭ 249 (+857.69%)
BugZooKeep your bugs contained. A platform for studying historical software bugs.
Stars: ✭ 49 (+88.46%)
ChazutsuThe tool to make NLP datasets ready to use
Stars: ✭ 238 (+815.38%)
University1652 BaselineACM Multimedia2020 University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization 🚁 annotates 1652 buildings in 72 universities around the world.
Stars: ✭ 232 (+792.31%)
AITQAresources for the IBM Airlines Table-Question-Answering Benchmark
Stars: ✭ 12 (-53.85%)
WeatherbenchA benchmark dataset for data-driven weather forecasting
Stars: ✭ 227 (+773.08%)
ACVR2017An Innovative Salient Object Detection Using Center-Dark Channel Prior
Stars: ✭ 20 (-23.08%)
StationaryGet hourly meteorological data from one of thousands of global stations
Stars: ✭ 225 (+765.38%)
HJDatasetA Large Dataset of Historical Japanese Documents with Complex Layouts
Stars: ✭ 19 (-26.92%)
Bccd datasetBCCD (Blood Cell Count and Detection) Dataset is a small-scale dataset for blood cells detection.
Stars: ✭ 216 (+730.77%)
climateRAn R 📦 for getting point and gridded climate data by AOI
Stars: ✭ 93 (+257.69%)
cmorClimate Model Output Rewriter
Stars: ✭ 42 (+61.54%)
DialogrptEMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"
Stars: ✭ 216 (+730.77%)
squad-v1.1-ptPortuguese translation of the SQuAD dataset
Stars: ✭ 13 (-50%)
DatasetsTFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Stars: ✭ 3,094 (+11800%)
OTT-QACode and Data for ICLR2021 Paper "Open Question Answering over Tables and Text"
Stars: ✭ 92 (+253.85%)
TextData loaders and abstractions for text and NLP
Stars: ✭ 2,915 (+11111.54%)
Audio-Classification-using-CNN-MLPMulti class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to identify sound of a bee, cricket or noise.
Stars: ✭ 36 (+38.46%)
Cocostuff10kThe official homepage of the (outdated) COCO-Stuff 10K dataset.
Stars: ✭ 248 (+853.85%)
RetrieverQuickly download, clean up, and install public datasets into a database management system
Stars: ✭ 241 (+826.92%)
snorkelingExtracting biomedical relationships from literature with Snorkel 🏊
Stars: ✭ 56 (+115.38%)
Covid 19 Repo DataData archive of identifiable COVID-19 related public projects on GitHub
Stars: ✭ 236 (+807.69%)
covid19-data-greeceDatasets and analysis of Novel Coronavirus (COVID-19) outbreak in Greece
Stars: ✭ 16 (-38.46%)
DataladKeep code, data, containers under control with git and git-annex
Stars: ✭ 234 (+800%)
icedataIceData: Datasets Hub for the *IceVision* Framework
Stars: ✭ 41 (+57.69%)
Structured3d[ECCV'20] Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling
Stars: ✭ 224 (+761.54%)
HARRecognize one of six human activities such as standing, sitting, and walking using a Softmax Classifier trained on mobile phone sensor data.
Stars: ✭ 18 (-30.77%)
Stocknet DatasetA comprehensive dataset for stock movement prediction from tweets and historical stock prices.
Stars: ✭ 228 (+776.92%)
user qualityDataset for Software Evolution and Quality Improvement
Stars: ✭ 27 (+3.85%)
TorchdataPyTorch dataset extended with map, cache etc. (tensorflow.data like)
Stars: ✭ 226 (+769.23%)
recurrent-defocus-deblurring-synth-dual-pixelReference github repository for the paper "Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data". We propose a procedure to generate realistic DP data synthetically. Our synthesis approach mimics the optical image formation found on DP sensors and can be applied to virtual scenes rendered with standard computer software. Lev…
Stars: ✭ 30 (+15.38%)
Open-korean-corporaOpen Korean NLP Dataset Curation for the Users All Around the Globe
Stars: ✭ 82 (+215.38%)
CollectionCollection Data for Cooper Hewitt, Smithsonian Design Museum
Stars: ✭ 214 (+723.08%)
DatatableA go in-memory table
Stars: ✭ 215 (+726.92%)
MaskedFaceRepresentationMasked face recognition focuses on identifying people using their facial features while they are wearing masks. We introduce benchmarks on face verification based on masked face images for the development of COVID-safe protocols in airports.
Stars: ✭ 17 (-34.62%)
dbcollectionA collection of popular datasets for deep learning.
Stars: ✭ 26 (+0%)
StrayVisualizerVisualize Data From Stray Scanner https://keke.dev/blog/2021/03/10/Stray-Scanner.html
Stars: ✭ 30 (+15.38%)
tracing-vs-freehandTracing Versus Freehand for Evaluating Computer-Generated Drawings (SIGGRAPH 2021)
Stars: ✭ 21 (-19.23%)
mxmortalitydbA data only R package containing all injury intent deaths registered in Mexico from 2004 to 2019
Stars: ✭ 20 (-23.08%)
prepublishSimplifies the prepare step (bundling, transpiling, rebasing) during publishing NPM packages.
Stars: ✭ 21 (-19.23%)