CrossweighCrossWeigh: Training Named Entity Tagger from Imperfect Annotations
Stars: ✭ 91 (-57.67%)
IdenprofIdenProf dataset is a collection of images of identifiable professionals. It is been collected to enable the development of AI systems that can serve by identifying people and the nature of their job by simply looking at an image, just like humans can do.
Stars: ✭ 149 (-30.7%)
FirstcoursenetworkscienceTutorials, datasets, and other material associated with textbook "A First Course in Network Science" by Menczer, Fortunato & Davis
Stars: ✭ 111 (-48.37%)
Unify Emotion DatasetsA Survey and Experiments on Annotated Corpora for Emotion Classification in Text
Stars: ✭ 169 (-21.4%)
Exposure correctionReference code for the paper "Learning Multi-Scale Photo Exposure Correction", CVPR 2021.
Stars: ✭ 98 (-54.42%)
Aidl kbA Knowledge Base for the FB Group Artificial Intelligence and Deep Learning (AIDL)
Stars: ✭ 219 (+1.86%)
Atis datasetThe ATIS (Airline Travel Information System) Dataset
Stars: ✭ 81 (-62.33%)
Remo Python🐰 Python lib for remo - the app for annotations and images management in Computer Vision
Stars: ✭ 138 (-35.81%)
DatasaurusR Package 📦 Containing the Datasaurus Dozen datasets 📊
Stars: ✭ 193 (-10.23%)
CodesearchnetDatasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (+540.93%)
Awesome Nlp PolishA curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.
Stars: ✭ 153 (-28.84%)
Persian Swear Wordsدیتاست کلمات نامناسب و بد فارسی برای فیلتر کردن متن ها
Stars: ✭ 95 (-55.81%)
Openml RR package to interface with OpenML
Stars: ✭ 81 (-62.33%)
Gekko DatasetsGekko Trading Bot dataset dumps. Ready to use and download history files in SQLite format.
Stars: ✭ 146 (-32.09%)
Coco Annotator✏️ Web-based image segmentation tool for object detection, localization, and keypoints
Stars: ✭ 1,138 (+429.3%)
MolaA Modular Optimization framework for Localization and mApping (MOLA)
Stars: ✭ 206 (-4.19%)
Awesome Earth Artificial IntelligenceA curated list of Earth Science's Artificial Intelligence (AI) tutorials, notebooks, software, datasets, courses, books, video lectures and papers. Contributions most welcome.
Stars: ✭ 44 (-79.53%)
Bird Recognition ReviewA list of useful resources in the bird sound (song and calls) recognition, such as datasets, papers, links to open source projects and competitions
Stars: ✭ 116 (-46.05%)
Nlp datasetsMy NLP datasets for Russian language
Stars: ✭ 198 (-7.91%)
AestheticsImage Aesthetics Toolkit - includes Fisher Vector implementation, AVA (Image Aesthetic Visual Analysis) dataset and fast multi-threaded downloader
Stars: ✭ 113 (-47.44%)
Machine Learning ResourcesA curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Stars: ✭ 226 (+5.12%)
CholeraR Package for Analyzing John Snow's 1854 Cholera Map
Stars: ✭ 110 (-48.84%)
3d PointcloudPapers and Datasets about Point Cloud.
Stars: ✭ 179 (-16.74%)
ChineseglueLanguage Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
Stars: ✭ 1,548 (+620%)
DatasetsTFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Stars: ✭ 3,094 (+1339.07%)
Wb srgbWhite balance camera-rendered sRGB images (CVPR 2019) [Matlab & Python]
Stars: ✭ 101 (-53.02%)
CorusLinks to Russian corpora + Python functions for loading and parsing
Stars: ✭ 154 (-28.37%)
Transitland DatastoreTransitland's centralized web service API for both querying and editing aggregated transit data from around the world
Stars: ✭ 101 (-53.02%)
Zr ObpOpen Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
Stars: ✭ 219 (+1.86%)
Doppelganger[IMC 2020 (Best Paper Finalist)] Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions
Stars: ✭ 97 (-54.88%)
Datasets for MLDatasets list for various computer vision tasks
Stars: ✭ 16 (-92.56%)
DareblopyData Reading Blocks for Python
Stars: ✭ 82 (-61.86%)
PinsPin, Discover and Share Resources
Stars: ✭ 149 (-30.7%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+471.63%)
Ner DatasetsDatasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
Stars: ✭ 220 (+2.33%)
Pix2codepix2code: Generating Code from a Graphical User Interface Screenshot
Stars: ✭ 11,349 (+5178.6%)
ColourColour Science for Python
Stars: ✭ 1,131 (+426.05%)
RetrieverQuickly download, clean up, and install public datasets into a database management system
Stars: ✭ 241 (+12.09%)
PersonasDatasets for Deep learning Personas
Stars: ✭ 49 (-77.21%)
Pytorch CppC++ Implementation of PyTorch Tutorials for Everyone
Stars: ✭ 1,014 (+371.63%)
IndonluThe first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)
Stars: ✭ 198 (-7.91%)
PipedreamConnect APIs, remarkably fast. Free for developers.
Stars: ✭ 2,068 (+861.86%)
COVID-NetLaunched in March 2020 in response to the coronavirus disease 2019 (COVID-19) pandemic, COVID-Net is a global open source, open access initiative dedicated to accelerating advancement in machine learning to aid front-line healthcare workers and clinical institutions around the world fighting the continuing pandemic. Towards this goal, our global…
Stars: ✭ 41 (-80.93%)
Datasetssource{d} datasets ("big code") for source code analysis and machine learning on source code
Stars: ✭ 231 (+7.44%)
Awesome Json DatasetsA curated list of awesome JSON datasets that don't require authentication.
Stars: ✭ 2,421 (+1026.05%)
Multi object datasetsMulti-object image datasets with ground-truth segmentation masks and generative factors.
Stars: ✭ 121 (-43.72%)