RetrieverQuickly download, clean up, and install public datasets into a database management system
Stars: β 241 (+322.81%)
Eseur Code DataCode and data used to create the examples in "Evidence-based Software Engineering based on the publicly available data"
Stars: β 340 (+496.49%)
Covid 19 Repo DataData archive of identifiable COVID-19 related public projects on GitHub
Stars: β 236 (+314.04%)
PcamThe PatchCamelyon (PCam) deep learning classification benchmark.
Stars: β 340 (+496.49%)
Covid Chestxray DatasetWe are building an open database of COVID-19 cases with chest X-ray or CT images.
Stars: β 2,759 (+4740.35%)
Structured3d[ECCV'20] Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling
Stars: β 224 (+292.98%)
WhylogsProfile and monitor your ML data pipeline end-to-end
Stars: β 328 (+475.44%)
Stocknet DatasetA comprehensive dataset for stock movement prediction from tweets and historical stock prices.
Stars: β 228 (+300%)
Datastream.ioAn open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana
Stars: β 814 (+1328.07%)
TorchdataPyTorch dataset extended with map, cache etc. (tensorflow.data like)
Stars: β 226 (+296.49%)
CaserecommenderCase Recommender: A Flexible and Extensible Python Framework for Recommender Systems
Stars: β 318 (+457.89%)
DataconfsA list of conferences connected with data worldwide.
Stars: β 36 (-36.84%)
CollectionCollection Data for Cooper Hewitt, Smithsonian Design Museum
Stars: β 214 (+275.44%)
DatatableA go in-memory table
Stars: β 215 (+277.19%)
DialogrptEMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"
Stars: β 216 (+278.95%)
ToflowTOFlow: Video Enhancement with Task-Oriented Flow
Stars: β 314 (+450.88%)
Ava downloader⬠Download AVA dataset (A Large-Scale Database for Aesthetic Visual Analysis)
Stars: β 214 (+275.44%)
OmnianomalyKDD 2019: Robust Anomaly Detection for Multivariate Time Series through Stochastic Recurrent Neural Network
Stars: β 208 (+264.91%)
Css10CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Stars: β 302 (+429.82%)
CharlatanCreate fake data in R
Stars: β 209 (+266.67%)
Nlp chinese corpus倧θ§ζ¨‘δΈζθͺηΆθ―θ¨ε€ηθ―ζ Large Scale Chinese Corpus for NLP
Stars: β 6,656 (+11577.19%)
CryptocmdCryptocurrency historical price data library in Python. Data from https://coinmarketcap.com.
Stars: β 299 (+424.56%)
Split Foldersπ Split folders with files (i.e. images) into training, validation and test (dataset) folders
Stars: β 203 (+256.14%)
Multi PlierAn unsupervised transfer learning approach for rare disease transcriptomics
Stars: β 33 (-42.11%)
Semantic Segmentation SuiteSemantic Segmentation Suite in TensorFlow. Implement, train, and test new Semantic Segmentation models easily!
Stars: β 2,395 (+4101.75%)
GvalExpression evaluation in golang
Stars: β 297 (+421.05%)
Awesome Json DatasetsA curated list of awesome JSON datasets that don't require authentication.
Stars: β 2,421 (+4147.37%)
Caffenet BenchmarkEvaluation of the CNN design choices performance on ImageNet-2012.
Stars: β 700 (+1128.07%)
Surface Defect Detectionππ Constantly summarizing open source dataset and important critical papers in the field of surface defect research which are very important. π
Stars: β 287 (+403.51%)
Data Setstate driven all in one data process for data visualization
Stars: β 191 (+235.09%)
PycmMulti-class confusion matrix library in Python
Stars: β 1,076 (+1787.72%)
Deeperforensics 1.0[CVPR 2020] A Large-Scale Dataset for Real-World Face Forgery Detection
Stars: β 338 (+492.98%)
University1652 BaselineACM Multimedia2020 University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization π annotates 1652 buildings in 72 universities around the world.
Stars: β 232 (+307.02%)
Chatitoπ―π― Generate datasets for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
Stars: β 678 (+1089.47%)
SiceLearning a Deep Single Image Contrast Enhancer from Multi-Exposure Images (TIP 2018)
Stars: β 175 (+207.02%)
RealsrToward Real-World Single Image Super-Resolution: A New Benchmark and A New Model (ICCV 2019)
Stars: β 282 (+394.74%)
Rstudioconf tweetsπ₯ A repository for tracking tweets about rstudio::conf
Stars: β 32 (-43.86%)
Hand pose actionDataset and code for the paper "First-Person Hand Action Benchmark with RGB-D Videos and 3D Hand Pose Annotations", CVPR 2018.
Stars: β 173 (+203.51%)
MeglassAn eyeglass face dataset collected and cleaned for face recognition evaluation, CCBR 2018.
Stars: β 281 (+392.98%)
FakerFaker is a Python package that generates fake data for you.
Stars: β 13,401 (+23410.53%)
Person searchJoint Detection and Identification Feature Learning for Person Search
Stars: β 666 (+1068.42%)
DataladKeep code, data, containers under control with git and git-annex
Stars: β 234 (+310.53%)
Covidnet CtCOVID-Net Open Source Initiative - Models and Data for COVID-19 Detection in Chest CT
Stars: β 57 (+0%)
Php MlPHP-ML - Machine Learning library for PHP
Stars: β 7,900 (+13759.65%)
Okutama ActionOkutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection
Stars: β 36 (-36.84%)
Covid CtCOVID-CT-Dataset: A CT Scan Dataset about COVID-19
Stars: β 820 (+1338.6%)