Top 765 dataset open source projects

bilkent-turkish-writings-dataset
Turkish writings dataset that promotes creativity, content, composition, grammar, spelling and punctuation.
Fall-Detection-Dataset
FUKinect-Fall dataset was created using Kinect V1. The dataset includes walking, bending, sitting, squatting, lying and falling actions performed by 21 subjects between 19-72 years of age.
UAV-Human
[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles
BOVText-Benchmark
BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting
covid-19-image-repository
Anonymized dataset of COVID-19 cases with a focus on radiological imaging. This includes images (x-ray / ct) with extensive metadata, such as admission-, ICU-, laboratory-, and patient master-data.
Coursera-Machine-Learning-Andrew-NG
This is a repository of my coursera Machine Learning by Standford, Andrew NG course's assignments
TVCaption
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
CMID
Chinese Medical Intent Dataset
epl mysql db
Free/open English Premier League results database from 1993-2017. Dump format is MySQL and sqlite.
Essential-Solar-Energy-and-Storage-Software-Resources
Curated links to APIs, SDKs, paltforms and tools relevant to solar energy and battery storage
TypeNet
A Hierarchical Type system for fine grained entity typing
When-in-Rome
A meta-corpus of functional harmonic analysis.
RSCD
[CVPR2021] Towards Rolling Shutter Correction and Deblurring in Dynamic Scenes
meta-coronavirus-dataset
MetaCOVID: META-Coronavrius dataset repository
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
LabelPropagation
A NetworkX implementation of Label Propagation from a "Near Linear Time Algorithm to Detect Community Structures in Large-Scale Networks" (Physical Review E 2008).
powerai-vision-object-detection
Use deep learning to create a model and a REST endpoint to allow your app to detect, locate and count your product on store shelves
DiscordLists
Tracking Discord data
HandyNet
Akshay Rangesh and Mohan M. Trivedi, "HandyNet: A One-stop Solution to Detect, Segment, Localize & Analyze Driver Hands," IEEE Conference on Computer Vision and Pattern Recognition - 3D HUMANS Workshop, 2018.
datacatalog
Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization system
car-logos-dataset
Collection of 374 car logos images with few variations of sizes and JSON file for better usability.
ASH-IR-Dataset
An impulse response dataset for binaural synthesis of spatial audio systems on headphones
nytwit
New York Times Word Innovation Types dataset
EconData
R package containing a host of datasets useful for economic research. Complete with raw data and cleaning functions.
transformers-lightning
A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transformers.
Sequoia
A neural network for CounterStrike:GlobalOffensive character detection and classification. Built on a custom-made dataset (csgo-data-collector)
scipp
Multi-dimensional data arrays with labeled dimensions
Molecules Dataset Collection
Collection of data sets of molecules for a validation of properties inference
dialogre
Dialogue-Based Relation Extraction
ckanext-datarequests
A plugin that allows users to request data that is not published yet
inferring-hidden-structure-retinal-circuits
Data and example scripts used in the paper `Inferring hidden structure in multilayered neural circuits`
fastdownload
Easily download, verify, and extract archives
emotion dataset
😄 Dataset for Emotion Classification
ava downloader
⏬ Download AVA dataset (A Large-Scale Database for Aesthetic Visual Analysis)
auctus
Dataset search engine, discovering data from a variety of sources, profiling it, and allowing advanced queries on the index
BadMedicine
Library and CLI for randomly generating medical data like you might get out of an Electronic Health Records (EHR) system
biomechanics dataset
Information of public available data sets for biomechanics.
pylabel
Python library for computer vision labeling tasks. The core functionality is to translate bounding box annotations between different formats-for example, from coco to yolo.
601-660 of 765 dataset projects