Knyfeknyfe is a python utility for rapid exploration of datasets.
Stars: ✭ 54 (-62.24%)
Dataset ApiThe ApolloScape Open Dataset for Autonomous Driving and its Application.
Stars: ✭ 260 (+81.82%)
Awesome MsrA curated repository of software engineering repository mining data sets
Stars: ✭ 257 (+79.72%)
Covid 19Novel Coronavirus 2019 time series data on cases
Stars: ✭ 1,060 (+641.26%)
FakenewscorpusA dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (+78.32%)
DeepweedsA Multiclass Weed Species Image Dataset for Deep Learning
Stars: ✭ 96 (-32.87%)
dbcollectionA collection of popular datasets for deep learning.
Stars: ✭ 26 (-81.82%)
CourseraforumsAnonymized versions of the discussion threads from the forums of 60 Coursera MOOCs
Stars: ✭ 50 (-65.03%)
hereRR package that provides an interface to the HERE REST APIs: Geocoder API, Routing API, Traffic API, Public Transit API and Destination Weather API. Locations and routes are returned as 'sf' objects.
Stars: ✭ 72 (-49.65%)
Netcdf FortranOfficial GitHub repository for netCDF-Fortran libraries, which depend on the netCDF C library. Install the netCDF C library first.
Stars: ✭ 141 (-1.4%)
Distil💧 In memory dataset filtering, inspired by snikch/aggro
Stars: ✭ 49 (-65.73%)
BondBOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
Stars: ✭ 96 (-32.87%)
Open-korean-corporaOpen Korean NLP Dataset Curation for the Users All Around the Globe
Stars: ✭ 82 (-42.66%)
MtntCode for the collection and analysis of the MTNT dataset
Stars: ✭ 48 (-66.43%)
OTT-QACode and Data for ICLR2021 Paper "Open Question Answering over Tables and Text"
Stars: ✭ 92 (-35.66%)
Scrna.seq.datasetsCollection of public scRNA-Seq datasets used by our group
Stars: ✭ 118 (-17.48%)
covid19-data-greeceDatasets and analysis of Novel Coronavirus (COVID-19) outbreak in Greece
Stars: ✭ 16 (-88.81%)
Deep trafficMIT DeepTraffic top 2% solution (75.01 mph) 🚗.
Stars: ✭ 47 (-67.13%)
user qualityDataset for Software Evolution and Quality Improvement
Stars: ✭ 27 (-81.12%)
Persian Swear Wordsدیتاست کلمات نامناسب و بد فارسی برای فیلتر کردن متن ها
Stars: ✭ 95 (-33.57%)
MaskedFaceRepresentationMasked face recognition focuses on identifying people using their facial features while they are wearing masks. We introduce benchmarks on face verification based on masked face images for the development of COVID-safe protocols in airports.
Stars: ✭ 17 (-88.11%)
LetsgodatasetThis repository makes the integral Let's Go dataset publicly available.
Stars: ✭ 41 (-71.33%)
mxmortalitydbA data only R package containing all injury intent deaths registered in Mexico from 2004 to 2019
Stars: ✭ 20 (-86.01%)
HakeHAKE: Human Activity Knowledge Engine (CVPR'18/19/20, NeurIPS'20)
Stars: ✭ 132 (-7.69%)
SocketHookSocket hook is an injector based on EasyHook (win only) which redirect the traffic to your local server.
Stars: ✭ 38 (-73.43%)
Qriyou're invited to a data party!
Stars: ✭ 1,003 (+601.4%)
Audio-Classification-using-CNN-MLPMulti class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to identify sound of a bee, cricket or noise.
Stars: ✭ 36 (-74.83%)
Ml PyxisTool for reading and writing datasets of tensors in a Lightning Memory-Mapped Database (LMDB). Designed to manage machine learning datasets with fast reading speeds.
Stars: ✭ 93 (-34.97%)
snorkelingExtracting biomedical relationships from literature with Snorkel 🏊
Stars: ✭ 56 (-60.84%)
HARRecognize one of six human activities such as standing, sitting, and walking using a Softmax Classifier trained on mobile phone sensor data.
Stars: ✭ 18 (-87.41%)
PtsQuantized Mesh Terrain Data Generator and Server for CesiumJS Library
Stars: ✭ 36 (-74.83%)
recurrent-defocus-deblurring-synth-dual-pixelReference github repository for the paper "Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data". We propose a procedure to generate realistic DP data synthetically. Our synthesis approach mimics the optical image formation found on DP sensors and can be applied to virtual scenes rendered with standard computer software. Lev…
Stars: ✭ 30 (-79.02%)
Core50CORe50: a new Dataset and Benchmark for Continual Learning
Stars: ✭ 91 (-36.36%)
DataconfsA list of conferences connected with data worldwide.
Stars: ✭ 36 (-74.83%)
Datasets🎁 3,000,000+ Unsplash images made available for research and machine learning
Stars: ✭ 1,805 (+1162.24%)
wotopWeb on top of any protocol
Stars: ✭ 118 (-17.48%)
Multi PlierAn unsupervised transfer learning approach for rare disease transcriptomics
Stars: ✭ 33 (-76.92%)
noddosNoddos client
Stars: ✭ 78 (-45.45%)
Stgcnimplementation of STGCN for traffic prediction in IJCAI2018
Stars: ✭ 87 (-39.16%)
storm-traffic使用Storm实时处理交通大数据(数据源:kafka,集群管理:zookeeper)
Stars: ✭ 34 (-76.22%)
Rstudioconf tweets🖥 A repository for tracking tweets about rstudio::conf
Stars: ✭ 32 (-77.62%)
Linux-adminShell scripts to automate download of GitHub traffic statistics, cluster administration, and create an animated GIF.
Stars: ✭ 23 (-83.92%)
Clue中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Stars: ✭ 2,425 (+1595.8%)
TriggernerTriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)
Stars: ✭ 141 (-1.4%)
ProsodyHelsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-2.8%)
Perf Tools⏱→ 🚀A set of tools for improving performance your application (balancer, performance, PerfKeeper, LazyPromise).
Stars: ✭ 135 (-5.59%)
Dbg PdsDeutsche Boerse's Financial Trading Public Data Set
Stars: ✭ 124 (-13.29%)
Ua GecUA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Stars: ✭ 108 (-24.48%)
Imdb FaceA new large-scale noise-controlled face recognition dataset.
Stars: ✭ 399 (+179.02%)