RetrieverQuickly download, clean up, and install public datasets into a database management system
Stars: ✭ 241 (-11.4%)
MaskedFaceRepresentationMasked face recognition focuses on identifying people using their facial features while they are wearing masks. We introduce benchmarks on face verification based on masked face images for the development of COVID-safe protocols in airports.
Stars: ✭ 17 (-93.75%)
recurrent-defocus-deblurring-synth-dual-pixelReference github repository for the paper "Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data". We propose a procedure to generate realistic DP data synthetically. Our synthesis approach mimics the optical image formation found on DP sensors and can be applied to virtual scenes rendered with standard computer software. Lev…
Stars: ✭ 30 (-88.97%)
Structured3d[ECCV'20] Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling
Stars: ✭ 224 (-17.65%)
covid19-data-greeceDatasets and analysis of Novel Coronavirus (COVID-19) outbreak in Greece
Stars: ✭ 16 (-94.12%)
TextData loaders and abstractions for text and NLP
Stars: ✭ 2,915 (+971.69%)
icedataIceData: Datasets Hub for the *IceVision* Framework
Stars: ✭ 41 (-84.93%)
Audio-Classification-using-CNN-MLPMulti class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to identify sound of a bee, cricket or noise.
Stars: ✭ 36 (-86.76%)
HARRecognize one of six human activities such as standing, sitting, and walking using a Softmax Classifier trained on mobile phone sensor data.
Stars: ✭ 18 (-93.38%)
TorchdataPyTorch dataset extended with map, cache etc. (tensorflow.data like)
Stars: ✭ 226 (-16.91%)
OTT-QACode and Data for ICLR2021 Paper "Open Question Answering over Tables and Text"
Stars: ✭ 92 (-66.18%)
NLPrep🍳 NLPrep - dataset tool for many natural language processing task
Stars: ✭ 26 (-90.44%)
DatasetsTFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Stars: ✭ 3,094 (+1037.5%)
user qualityDataset for Software Evolution and Quality Improvement
Stars: ✭ 27 (-90.07%)
Cocostuff10kThe official homepage of the (outdated) COCO-Stuff 10K dataset.
Stars: ✭ 248 (-8.82%)
JschemaA simple, easy to use data modeling framework for JavaScript
Stars: ✭ 261 (-4.04%)
Covid 19 Repo DataData archive of identifiable COVID-19 related public projects on GitHub
Stars: ✭ 236 (-13.24%)
squad-v1.1-ptPortuguese translation of the SQuAD dataset
Stars: ✭ 13 (-95.22%)
DataladKeep code, data, containers under control with git and git-annex
Stars: ✭ 234 (-13.97%)
Stocknet DatasetA comprehensive dataset for stock movement prediction from tweets and historical stock prices.
Stars: ✭ 228 (-16.18%)
snorkelingExtracting biomedical relationships from literature with Snorkel 🏊
Stars: ✭ 56 (-79.41%)
StationaryGet hourly meteorological data from one of thousands of global stations
Stars: ✭ 225 (-17.28%)
tracing-vs-freehandTracing Versus Freehand for Evaluating Computer-Generated Drawings (SIGGRAPH 2021)
Stars: ✭ 21 (-92.28%)
ACVR2017An Innovative Salient Object Detection Using Center-Dark Channel Prior
Stars: ✭ 20 (-92.65%)
FakenewscorpusA dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (-6.25%)
TVQAplus[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
Stars: ✭ 99 (-63.6%)
BugZooKeep your bugs contained. A platform for studying historical software bugs.
Stars: ✭ 49 (-81.99%)
climateRAn R 📦 for getting point and gridded climate data by AOI
Stars: ✭ 93 (-65.81%)
Game Datasets🎮 A curated list of awesome game datasets, and tools to artificial intelligence in games
Stars: ✭ 261 (-4.04%)
Chinese Names Corpus中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
Stars: ✭ 3,053 (+1022.43%)
Cities.jsonCities of the world in Json, based on GeoNames Gazetteer
Stars: ✭ 251 (-7.72%)
dbcollectionA collection of popular datasets for deep learning.
Stars: ✭ 26 (-90.44%)
Recommendersystem DatasetThis repository contains some datasets that I have collected in Recommender Systems.
Stars: ✭ 249 (-8.46%)
HJDatasetA Large Dataset of Historical Japanese Documents with Complex Layouts
Stars: ✭ 19 (-93.01%)
Taco🌮 Trash Annotations in Context Dataset Toolkit
Stars: ✭ 243 (-10.66%)
Ergo🧠 A tool that makes AI easier.
Stars: ✭ 264 (-2.94%)
ChazutsuThe tool to make NLP datasets ready to use
Stars: ✭ 238 (-12.5%)
mxmortalitydbA data only R package containing all injury intent deaths registered in Mexico from 2004 to 2019
Stars: ✭ 20 (-92.65%)
Covid Chestxray DatasetWe are building an open database of COVID-19 cases with chest X-ray or CT images.
Stars: ✭ 2,759 (+914.34%)
StrayVisualizerVisualize Data From Stray Scanner https://keke.dev/blog/2021/03/10/Stray-Scanner.html
Stars: ✭ 30 (-88.97%)
University1652 BaselineACM Multimedia2020 University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization 🚁 annotates 1652 buildings in 72 universities around the world.
Stars: ✭ 232 (-14.71%)
pump-and-dump-datasetAdditional material for paper: Pump and Dumps in the Bitcoin Era: Real Time Detection of Cryptocurrency Market Manipulations, ICCCN '20
Stars: ✭ 66 (-75.74%)
Datasetssource{d} datasets ("big code") for source code analysis and machine learning on source code
Stars: ✭ 231 (-15.07%)
Dataset ApiThe ApolloScape Open Dataset for Autonomous Driving and its Application.
Stars: ✭ 260 (-4.41%)
WeatherbenchA benchmark dataset for data-driven weather forecasting
Stars: ✭ 227 (-16.54%)
BIRLBIRL: Benchmark on Image Registration methods with Landmark validations
Stars: ✭ 66 (-75.74%)
AITQAresources for the IBM Airlines Table-Question-Answering Benchmark
Stars: ✭ 12 (-95.59%)
Datagear数据可视化分析平台,使用Java语言开发,采用浏览器/服务器架构,支持SQL、CSV、Excel、HTTP接口、JSON等多种数据源
Stars: ✭ 266 (-2.21%)
HubDataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+1371.69%)
Awesome MsrA curated repository of software engineering repository mining data sets
Stars: ✭ 257 (-5.51%)
Open-korean-corporaOpen Korean NLP Dataset Curation for the Users All Around the Globe
Stars: ✭ 82 (-69.85%)