WatermarkrecoPytorch implementation of the paper "Large-Scale Historical Watermark Recognition: dataset and a new consistency-based approach"
LetsgodatasetThis repository makes the integral Let's Go dataset publicly available.
Covid CtsetLarge Covid-19 CT scans dataset from paper: https://doi.org/10.1101/2020.06.08.20121541
Qriyou're invited to a data party!
PtsQuantized Mesh Terrain Data Generator and Server for CesiumJS Library
Okutama ActionOkutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection
DataconfsA list of conferences connected with data worldwide.
Multi PlierAn unsupervised transfer learning approach for rare disease transcriptomics
WikisqlA large annotated semantic parsing corpus for developing natural language interfaces.
FeversymmetricSymmetric evaluation set based on the FEVER (fact verification) dataset
Jsut LabHTS-style full-context labels for JSUT v1.1
TedsdsApache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Khayyam106 Omar Khayyam quatrains in YAML format.
FacerankFaceRank - Rank Face by CNN Model based on TensorFlow (add keras version). FaceRank-人脸打分基于 TensorFlow (新增 Keras 版本) 的 CNN 模型(QQ群:167122861)。技术支持:http://tensorflow123.com
Musical Onset EfficientSupplementary information and code for the paper: An efficient deep learning model for musical onset detection
MobiusC# and F# language binding and extensions to Apache Spark
Cophy"CoPhy: Counterfactual Learning of Physical Dynamics", F. Baradel, N. Neverova, J. Mille, G. Mori, C. Wolf, ICLR'2020
Imagenetscraper👁 Bulk-download all thumbnails from an ImageNet synset, with optional rescaling
RdhsAPI Client and Data Munging for the Demographic and Health Survey Data
Covid CtCOVID-CT-Dataset: A CT Scan Dataset about COVID-19
Datastream.ioAn open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana
Osint collectionMaintained collection of OSINT related resources. (All Free & Actionable)
Clusterdatacluster data collected from production clusters in Alibaba for cluster management research
Cluener2020CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition
Chatito🎯🗯 Generate datasets for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
Person searchJoint Detection and Identification Feature Learning for Person Search
ProteinnetStandardized data set for machine learning of protein structure
Devblogs+2600 developer-related blogs and publications.
Esc 50ESC-50: Dataset for Environmental Sound Classification
Gensim DataData repository for pretrained NLP models and NLP corpora.
Label StudioLabel Studio is a multi-type data labeling and annotation tool with standardized output format
CvatPowerful and efficient Computer Vision Annotation Tool (CVAT)
Total Text DatasetTotal Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.