exams-qaA Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answering
BillSumUS Bill Summarization Corpus
Combinatorial-3D-Shape-GenerationAn official repository of paper "Combinatorial 3D Shape Generation via Sequential Assembly", presented at NeurIPS 2020 Workshop on Machine Learning for Engineering Modeling, Simulation, and Design
shrec17Supplementary code for SHREC 2017 RGB-D Object-to-CAD Retrieval track
RamaNetPreforms De novo protein design using machine learning and PyRosetta to generate a novel protein structure
ETDatasetThe Electricity Transformer dataset is collected to support the further investigation on the long sequence forecasting problem.
dataset-ssvep-exoskeletonSSVEP-based BCI recording of 12 subjects operating an upper limb exoskeleton during a shared control task. The exoskeleton is either controlled with a touchless interface detecting hand poses or with BCI.
clip-italianCLIP (Contrastive Language–Image Pre-training) for Italian
awesome-indoor-farmingA curated list of awesome dataset, technologies, companies, and media about Indoor Farming.
SpatialSenseAn Adversarially Crowdsourced Benchmark for Spatial Relation Recognition
budongsan주택 실거래가 분석을 위한 실 Datasets 및 함수 제공 R package
void-datasetVisual Odometry with Inertial and Depth (VOID) dataset
pyreportspyreports is a python library that allows you to create complex report from various sources
vedaiThis repository for training tensorflow models. Dataset based on Vehicle Detection in Aerial Imagery (VEDAI)
tape-neurips2019Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology. (DEPRECATED)
Cross-Language-DatasetA multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection
neuspellNeuSpell: A Neural Spelling Correction Toolkit
labelme2Datasetspython scripts to convert labelme-generated-jsons to voc/coco style datasets.
cia🐱💻 CIA Factbook data analysis and dataset reconstruction, modification, and tuning go here.
doccano-transformerThe official tool for transforming doccano format into common dataset formats.
conferencias matutinas amloCSVs de las versiones estenográficas de las conferencias matutinas del Presidente Andres Manuel López Obrador ( Mañaneras AMLO )
Phishing-DatasetPhishing dataset with more than 88,000 instances and 111 features. Web application available at. https://gregavrbancic.github.io/Phishing-Dataset/
FLOBOTEU funded Horizon 2020 project
metadatMeta-analytic datasets for R
docker-datasetDocker database images with pre-populated data for testing and/or practice.
Neural-Scam-ArtistWeb Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.
FacialEmotionRecognitionUsing Extended Cohn-Kanade AU-Coded Facial Expression Database to classify basic human facial emotion expressions using ann
TCPDThe Turing Change Point Dataset - A collection of time series for the evaluation and development of change point detection algorithms
sidechainnetAn all-atom protein structure dataset for machine learning.
beirA Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
chronistLong-term analysis of emotion, age, and sentiment using Lifeslice and text records.
RGBDAcquisitionA uniform library wrapper for input from V4L2,Freenect,OpenNI,OpenNI2,DepthSense,Intel Realsense,OpenGL simulations and other types of video and depth input..
FedScaleFedScale is a scalable and extensible open-source federated learning (FL) platform.
SQLiteHelper🗄 This project comes in handy when you want to write a sql statement easily and smarter.
recsys slates datasetFINN.no Slate Dataset for Recommender Systems. A dataset containing all interactions (viewed items + response (clicked item / no click) for users over a longer time horizon.
MVDet[ECCV 2020] Codes and MultiviewX dataset for "Multiview Detection with Feature Perspective Transformation".
PlantDoc-DatasetDataset used in "PlantDoc: A Dataset for Visual Plant Disease Detection" accepted in CODS-COMAD 2020
CrowdFlowOptical Flow Dataset and Benchmark for Visual Crowd Analysis
punks.attributescryptopunks.csv - All 10 000 CryptoPunks by ID with Type, Accessories & More