Top 765 dataset open source projects

metal dataset
metal lyrics and band names dataset (raw)
UrbanLoco
UrbanLoco: A Full Sensor Suite Dataset for Mapping and Localization in Urban Scenes
2020a SSH mapping NATL60
A challenge on the mapping of satellite altimeter sea surface height data organised by MEOM@IGE, Ocean-Next and CLS.
VisDrone-dataset-python-toolkit
This repository provides a basic Pythonic toolkit for the VisDrone-Dataset (2018).
awesome-climate-data
Data sources, programming libraries and open source organisations that are working on the climate emergency
berlin corona cases
Scraper for the official dashboard with current Corona case numbers, traffic light indicators ("Corona-Ampel") and vaccination situation for Berlin.
RecSysDatasets
This is a repository of public data sources for Recommender Systems (RS).
DVQA dataset
DVQA Dataset: A Bar chart question answering dataset presented at CVPR 2018
DeT
Dataset and Code for the paper "DepthTrack: Unveiling the Power of RGBD Tracking" (ICCV2021), and "Depth-only Object Tracking" (BMVC2021)
THE-SPARKS-FOUNDATION
📌 This repo. Contains Basic - Advance level Machine learning / business analysis Projects. 👨‍💻
sp-subway-scraper
🚆This web scraper builds a dataset for São Paulo subway operation status
ABSADatasets
Public & Community-shared datasets for Aspect-based sentiment analysis and Text Classification
The-SUSTech-SYSU-dataset-for-automatically-segmenting-and-classifying-corneal-ulcers
This is an official repository of corneal ulcer classification and segmentation for our Sci Data paper "The SUSTech-SYSU dataset for automatically segmenting and classifying corneal ulcers". https://doi.org/10.1038/s41597-020-0360-7
multi-contact-grasping
This project implements a simulated grasp-and-lift process in V-REP using the Barrett Hand, with an interface through a python remote API.
attention-target-detection
[CVPR2020] "Detecting Attended Visual Targets in Video"
auk
Working with eBird data in R
coronavirus-mask-image-dataset
Image dataset from Instagram of people wearing medical masks, no mask, or a non-medical (DIY) mask
torchvideo
🎥 Datasets, transforms and samplers for video in PyTorch
HydroData
An R 📦 for finding and getting geospatial earth systems data
NewsMTSC
Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k sentences and a state-of-the-art classification model.
EduData
Edudata: Datasets in Education and convenient interface for downloading and preprocessing dataset in education
adage
Data and code related to the paper "ADAGE-Based Integration of Publicly Available Pseudomonas aeruginosa..." Jie Tan, et al · mSystems · 2016
WikiTableQuestions
A dataset of complex questions on semi-structured Wikipedia tables
Legal-Entity-Recognition
A Dataset of German Legal Documents for Named Entity Recognition
covid-19
Current and historical coronavirus covid-19 confirmed, recovered, deaths and active case counts segmented by country and region. Includes csv, json and sqlite data along with an interactive website explorer.
AudioCaption
Dataset and baseline for the first Audiocaption task
mnist1d
A 1D analogue of the MNIST dataset for measuring spatial biases and answering "science of deep learning" questions.
midi degradation toolkit
A toolkit for generating datasets of midi files which have been degraded to be 'un-musical'.
wider-face-pascal-voc-annotations
WIDER FACE annotations converted to the Pascal VOC XML format
M2DGR
M2DGR: a Multi-modal and Multi-scenario Dataset for Ground Robots
permuted-bAbI-dialog-tasks
Dataset for 'Learning End-to-End Goal-Oriented Dialog with Multiple Answers' EMNLP 2018
PororoQA
PororoQA, https://arxiv.org/abs/1707.00836
VISO
[IEEE TGRS 2021] Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A Benchmark
state-codes
Brazilian states 2-letter codes (ISO 3166-2:BR), official abbreviations throughout the country's history
pedx
Python tools for working with PedX dataset.
dataspice
🌶️ Create lightweight schema.org descriptions of your datasets
nhlplaybyplay-node
Fetch and Convert NHL Play by Play game data