RetrieverQuickly download, clean up, and install public datasets into a database management system
Stars: ✭ 241 (+197.53%)
Wb srgbWhite balance camera-rendered sRGB images (CVPR 2019) [Matlab & Python]
Stars: ✭ 101 (+24.69%)
Persian Swear Wordsدیتاست کلمات نامناسب و بد فارسی برای فیلتر کردن متن ها
Stars: ✭ 95 (+17.28%)
recurrent-defocus-deblurring-synth-dual-pixelReference github repository for the paper "Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data". We propose a procedure to generate realistic DP data synthetically. Our synthesis approach mimics the optical image formation found on DP sensors and can be applied to virtual scenes rendered with standard computer software. Lev…
Stars: ✭ 30 (-62.96%)
MeglassAn eyeglass face dataset collected and cleaned for face recognition evaluation, CCBR 2018.
Stars: ✭ 281 (+246.91%)
Awesome Json DatasetsA curated list of awesome JSON datasets that don't require authentication.
Stars: ✭ 2,421 (+2888.89%)
Chatito🎯🗯 Generate datasets for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
Stars: ✭ 678 (+737.04%)
DatasetsTFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Stars: ✭ 3,094 (+3719.75%)
HubDataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+4841.98%)
ColourColour Science for Python
Stars: ✭ 1,131 (+1296.3%)
HINT3This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020's Insights Workshop https://insights-workshop.github.io/ Preprint for the paper is available here https://arxiv.org/abs/2009.13833
Stars: ✭ 27 (-66.67%)
Clue中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Stars: ✭ 2,425 (+2893.83%)
Exposure correctionReference code for the paper "Learning Multi-Scale Photo Exposure Correction", CVPR 2021.
Stars: ✭ 98 (+20.99%)
Voice datasets🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
Stars: ✭ 494 (+509.88%)
Crd3The repo containing the Critical Role Dungeons and Dragons Dataset.
Stars: ✭ 83 (+2.47%)
IndonluThe first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)
Stars: ✭ 198 (+144.44%)
Datasetssource{d} datasets ("big code") for source code analysis and machine learning on source code
Stars: ✭ 231 (+185.19%)
dbcollectionA collection of popular datasets for deep learning.
Stars: ✭ 26 (-67.9%)
Wisty.js🧚♀️ Chatbot library turning conversations into actions, locally, in the browser.
Stars: ✭ 24 (-70.37%)
Label StudioLabel Studio is a multi-type data labeling and annotation tool with standardized output format
Stars: ✭ 7,264 (+8867.9%)
Openml RR package to interface with OpenML
Stars: ✭ 81 (+0%)
DoccanoOpen source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+6813.58%)
AestheticsImage Aesthetics Toolkit - includes Fisher Vector implementation, AVA (Image Aesthetic Visual Analysis) dataset and fast multi-threaded downloader
Stars: ✭ 113 (+39.51%)
Oie ResourcesA curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (+249.38%)
Chatbot cn基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口
Stars: ✭ 791 (+876.54%)
LetsgodatasetThis repository makes the integral Let's Go dataset publicly available.
Stars: ✭ 41 (-49.38%)
Wikipedia ner📖 Labeled examples from wiki dumps in Python
Stars: ✭ 61 (-24.69%)
MmsaCH-SIMS: A Chinese Multimodal Sentiment Analysis Dataset with Fine-grained Annotations of Modality (ACL2020)
Stars: ✭ 70 (-13.58%)
Pysgs📈 Python interface for the Brazilian Central Bank's Time Series Management System (SGS)
Stars: ✭ 60 (-25.93%)
La3dmLearning-aided 3D mapping
Stars: ✭ 77 (-4.94%)
Raccoon datasetThe dataset is used to train my own raccoon detector and I blogged about it on Medium
Stars: ✭ 1,177 (+1353.09%)
DreamDREAM: A Challenge Dataset and Models for Dialogue-Based Reading Comprehension
Stars: ✭ 60 (-25.93%)
BotsharpThe Open Source AI Chatbot Platform Builder in 100% C# Running in .NET Core with Machine Learning algorithm.
Stars: ✭ 1,103 (+1261.73%)
Covid19JSON time-series of coronavirus cases (confirmed, deaths and recovered) per country - updated daily
Stars: ✭ 1,177 (+1353.09%)
Maskrcnn ModanetA Mask R-CNN Keras implementation with Modanet annotations on the Paperdoll dataset
Stars: ✭ 59 (-27.16%)
Char Rnn TensorflowMulti-layer Recurrent Neural Networks for character-level language models implements by TensorFlow
Stars: ✭ 58 (-28.4%)
Dialogue UnderstandingThis repository contains PyTorch implementation for the baseline models from the paper Utterance-level Dialogue Understanding: An Empirical Study
Stars: ✭ 77 (-4.94%)
Csvpackcsvpack library / gem - tools 'n' scripts for working with tabular data packages using comma-separated values (CSV) datafiles in text with meta info (that is, schema, datatypes, ..) in datapackage.json; download, read into and query CSV datafiles with your SQL database (e.g. SQLite, PostgreSQL, ...) of choice and much more
Stars: ✭ 71 (-12.35%)
Stevens Vlp16 DatasetThis dataset is captured using a Velodyne VLP-16, which is mounted on an UGV - Clearpath Jackal, on Stevens Institute of Technology campus
Stars: ✭ 58 (-28.4%)
Geodata BrFree open public domain geographic data of Brazil available in multiple languages and formats.
Stars: ✭ 57 (-29.63%)
CherubnlpNatural Language Processing in .NET Core
Stars: ✭ 71 (-12.35%)
AnimeganA simple PyTorch Implementation of Generative Adversarial Networks, focusing on anime face drawing.
Stars: ✭ 1,095 (+1251.85%)
View Finding NetworkA deep ranking network that learns to find good compositions in a photograph.
Stars: ✭ 57 (-29.63%)
FacegrabA tool to collect public images from Facebook and create an image dataset for training computer vision applications like gender recognition, and face detection
Stars: ✭ 76 (-6.17%)
Toronto 3dA Large-scale Mobile LiDAR Dataset for Semantic Segmentation of Urban Roadways
Stars: ✭ 69 (-14.81%)
EasynluSimple embedded NLU for mobile apps
Stars: ✭ 57 (-29.63%)
Deep SegmentationCNNs for semantic segmentation using Keras library
Stars: ✭ 69 (-14.81%)
Pointclouddatasets3D point cloud datasets in HDF5 format, containing uniformly sampled 2048 points per shape.
Stars: ✭ 80 (-1.23%)