Knyfeknyfe is a python utility for rapid exploration of datasets.
Stars: ✭ 54 (-64.71%)
Open-korean-corporaOpen Korean NLP Dataset Curation for the Users All Around the Globe
Stars: ✭ 82 (-46.41%)
Covid 19Novel Coronavirus 2019 time series data on cases
Stars: ✭ 1,060 (+592.81%)
OTT-QACode and Data for ICLR2021 Paper "Open Question Answering over Tables and Text"
Stars: ✭ 92 (-39.87%)
covid19-data-greeceDatasets and analysis of Novel Coronavirus (COVID-19) outbreak in Greece
Stars: ✭ 16 (-89.54%)
CourseraforumsAnonymized versions of the discussion threads from the forums of 60 Coursera MOOCs
Stars: ✭ 50 (-67.32%)
user qualityDataset for Software Evolution and Quality Improvement
Stars: ✭ 27 (-82.35%)
Lapa DatasetA large-scale dataset for face parsing (AAAI2020)
Stars: ✭ 149 (-2.61%)
MaskedFaceRepresentationMasked face recognition focuses on identifying people using their facial features while they are wearing masks. We introduce benchmarks on face verification based on masked face images for the development of COVID-safe protocols in airports.
Stars: ✭ 17 (-88.89%)
LanternLantern官方版本下载 蓝灯 翻墙 代理 科学上网 外网 加速器 梯子 路由 lantern proxy vpn censorship-circumvention censorship gfw accelerator
Stars: ✭ 10,238 (+6591.5%)
squad-v1.1-ptPortuguese translation of the SQuAD dataset
Stars: ✭ 13 (-91.5%)
ObjectronObjectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes
Stars: ✭ 1,352 (+783.66%)
Audio-Classification-using-CNN-MLPMulti class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to identify sound of a bee, cricket or noise.
Stars: ✭ 36 (-76.47%)
snorkelingExtracting biomedical relationships from literature with Snorkel 🏊
Stars: ✭ 56 (-63.4%)
DeepecgECG classification programs based on ML/DL methods
Stars: ✭ 124 (-18.95%)
MultidigitmnistCombine multiple MNIST digits to create datasets with 100/1000 classes for few-shot learning/meta-learning
Stars: ✭ 48 (-68.63%)
HARRecognize one of six human activities such as standing, sitting, and walking using a Softmax Classifier trained on mobile phone sensor data.
Stars: ✭ 18 (-88.24%)
Cubicasa5kCubiCasa5k floor plan dataset
Stars: ✭ 98 (-35.95%)
recurrent-defocus-deblurring-synth-dual-pixelReference github repository for the paper "Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data". We propose a procedure to generate realistic DP data synthetically. Our synthesis approach mimics the optical image formation found on DP sensors and can be applied to virtual scenes rendered with standard computer software. Lev…
Stars: ✭ 30 (-80.39%)
Filternet PhpA simple utility to check whether the given url/domain is blocked in Iran.
Stars: ✭ 41 (-73.2%)
Dataspice🌶 Create lightweight schema.org descriptions of your datasets
Stars: ✭ 137 (-10.46%)
Covid CtsetLarge Covid-19 CT scans dataset from paper: https://doi.org/10.1101/2020.06.08.20121541
Stars: ✭ 40 (-73.86%)
SecurityHeaders GovUKA scan of all .gov.uk sites for the most common security headers or lack of
Stars: ✭ 14 (-90.85%)
DeepweedsA Multiclass Weed Species Image Dataset for Deep Learning
Stars: ✭ 96 (-37.25%)
updnsPublic Adfree DNS over HTTPS Server
Stars: ✭ 23 (-84.97%)
NouBan-js检测文本中是否含有豆瓣敏感词(JavaScript版本)。Nouban is an anti-censorship project aiming to record censored words in Douban, a Chinese social network platform. It is merely a glimpse of the situation in Chinese 'Innernet'.
Stars: ✭ 58 (-62.09%)
Ember ImpaginationAn Ember Addon that puts the fun back in asynchronous, paginated datasets
Stars: ✭ 123 (-19.61%)
kit-censuraSoftware used to censor the Internet in Italy
Stars: ✭ 22 (-85.62%)
PtsQuantized Mesh Terrain Data Generator and Server for CesiumJS Library
Stars: ✭ 36 (-76.47%)
BondBOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
Stars: ✭ 96 (-37.25%)
russian-blackoutThe RKN caused problems all over Russian Internet. This is list of services which suffered from RKN blockings activity.
Stars: ✭ 18 (-88.24%)
DataconfsA list of conferences connected with data worldwide.
Stars: ✭ 36 (-76.47%)
Chinese Names Corpus中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
Stars: ✭ 3,053 (+1895.42%)
BaidutrafficThis repo includes introduction, code and dataset of our paper Deep Sequence Learning with Auxiliary Information for Traffic Prediction (KDD 2018).
Stars: ✭ 143 (-6.54%)
Cities.jsonCities of the world in Json, based on GeoNames Gazetteer
Stars: ✭ 251 (+64.05%)
Multi PlierAn unsupervised transfer learning approach for rare disease transcriptomics
Stars: ✭ 33 (-78.43%)
Recommendersystem DatasetThis repository contains some datasets that I have collected in Recommender Systems.
Stars: ✭ 249 (+62.75%)
Persian Swear Wordsدیتاست کلمات نامناسب و بد فارسی برای فیلتر کردن متن ها
Stars: ✭ 95 (-37.91%)
Taco🌮 Trash Annotations in Context Dataset Toolkit
Stars: ✭ 243 (+58.82%)
Rstudioconf tweets🖥 A repository for tracking tweets about rstudio::conf
Stars: ✭ 32 (-79.08%)
ChazutsuThe tool to make NLP datasets ready to use
Stars: ✭ 238 (+55.56%)
Onepiece Kga knowledge graph project for ONEPIECE /《海贼王》知识图谱
Stars: ✭ 123 (-19.61%)
Maskedface NetMaskedFace-Net is a dataset of human faces with a correctly and incorrectly worn mask based on the dataset Flickr-Faces-HQ (FFHQ).
Stars: ✭ 152 (-0.65%)
Netcdf FortranOfficial GitHub repository for netCDF-Fortran libraries, which depend on the netCDF C library. Install the netCDF C library first.
Stars: ✭ 141 (-7.84%)
HakeHAKE: Human Activity Knowledge Engine (CVPR'18/19/20, NeurIPS'20)
Stars: ✭ 132 (-13.73%)
Toronto 3dA Large-scale Mobile LiDAR Dataset for Semantic Segmentation of Urban Roadways
Stars: ✭ 69 (-54.9%)
Cmu MultimodalsdkCMU MultimodalSDK is a machine learning platform for development of advanced multimodal models as well as easily accessing and processing multimodal datasets.
Stars: ✭ 388 (+153.59%)