Datagear数据可视化分析平台,使用Java语言开发,采用浏览器/服务器架构,支持SQL、CSV、Excel、HTTP接口、JSON等多种数据源
Ergo🧠 A tool that makes AI easier.
Game Datasets🎮 A curated list of awesome game datasets, and tools to artificial intelligence in games
JschemaA simple, easy to use data modeling framework for JavaScript
Dataset ApiThe ApolloScape Open Dataset for Autonomous Driving and its Application.
Awesome MsrA curated repository of software engineering repository mining data sets
FakenewscorpusA dataset of millions of news articles scraped from a curated list of data sources.
NLPrep🍳 NLPrep - dataset tool for many natural language processing task
dbcollectionA collection of popular datasets for deep learning.
icedataIceData: Datasets Hub for the *IceVision* Framework
StrayVisualizerVisualize Data From Stray Scanner https://keke.dev/blog/2021/03/10/Stray-Scanner.html
AITQAresources for the IBM Airlines Table-Question-Answering Benchmark
tracing-vs-freehandTracing Versus Freehand for Evaluating Computer-Generated Drawings (SIGGRAPH 2021)
OTT-QACode and Data for ICLR2021 Paper "Open Question Answering over Tables and Text"
BugZooKeep your bugs contained. A platform for studying historical software bugs.
user qualityDataset for Software Evolution and Quality Improvement
HJDatasetA Large Dataset of Historical Japanese Documents with Complex Layouts
MaskedFaceRepresentationMasked face recognition focuses on identifying people using their facial features while they are wearing masks. We introduce benchmarks on face verification based on masked face images for the development of COVID-safe protocols in airports.
mxmortalitydbA data only R package containing all injury intent deaths registered in Mexico from 2004 to 2019
pump-and-dump-datasetAdditional material for paper: Pump and Dumps in the Bitcoin Era: Real Time Detection of Cryptocurrency Market Manipulations, ICCCN '20
Audio-Classification-using-CNN-MLPMulti class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to identify sound of a bee, cricket or noise.
BIRLBIRL: Benchmark on Image Registration methods with Landmark validations
snorkelingExtracting biomedical relationships from literature with Snorkel 🏊
HARRecognize one of six human activities such as standing, sitting, and walking using a Softmax Classifier trained on mobile phone sensor data.
ACVR2017An Innovative Salient Object Detection Using Center-Dark Channel Prior
recurrent-defocus-deblurring-synth-dual-pixelReference github repository for the paper "Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data". We propose a procedure to generate realistic DP data synthetically. Our synthesis approach mimics the optical image formation found on DP sensors and can be applied to virtual scenes rendered with standard computer software. Lev…
TVQAplus[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
climateRAn R 📦 for getting point and gridded climate data by AOI
Complete-Blood-Cell-Count-DatasetThe complete blood count (CBC) dataset contains a total of 360 blood smear images of red blood cells (RBCs), white blood cells (WBCs), and Platelets with annotations.
DeepSentiPersRepository for the experiments described in the paper named "DeepSentiPers: Novel Deep Learning Models Trained Over Proposed Augmented Persian Sentiment Corpus"
download audioset📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
WeFEND-AAAI20Dataset for paper "Weak Supervision for Fake News Detection via Reinforcement Learning" published in AAAI'2020.
COVID-19-DatasetsNovel Coronavirus (COVID-19) Cases for India, provided by University of Kalyani.
KVQAKorean Visual Question Answering
cspan dataA repo for tracking the number of followers of Congress, the Cabinet, and Governors
ebe-datasetEvidence-based Explanation Dataset (AACL-IJCNLP 2020)
grasp multiObjectRobotic grasp dataset for multi-object multi-grasp evaluation with RGB-D data. This dataset is annotated using the same protocal as Cornell Dataset, and can be used as multi-object extension of Cornell Dataset.
drone-nethttps://towardsdatascience.com/tutorial-build-an-object-detection-system-using-yolo-9a930513643a
uctfUnsupervised Controllable Text Generation (Applied to text Formalization)
msmdA Multimodal Audio Sheet Music Dataset