ml-datasets🌊 Machine learning dataset loaders for testing and example scripts
Stars: ✭ 40 (+207.69%)
RetrieverQuickly download, clean up, and install public datasets into a database management system
Stars: ✭ 241 (+1753.85%)
IndonluThe first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)
Stars: ✭ 198 (+1423.08%)
Datasets for MLDatasets list for various computer vision tasks
Stars: ✭ 16 (+23.08%)
data.world-pyPython package for data.world
Stars: ✭ 98 (+653.85%)
Zr ObpOpen Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
Stars: ✭ 219 (+1584.62%)
mlxMachine Learning eXchange (MLX). Data and AI Assets Catalog and Execution Engine
Stars: ✭ 132 (+915.38%)
thermostatCollection of NLP model explanations and accompanying analysis tools
Stars: ✭ 126 (+869.23%)
3d PointcloudPapers and Datasets about Point Cloud.
Stars: ✭ 179 (+1276.92%)
Robotcar Dataset SdkSoftware Development Kit for the Oxford Robotcar Dataset
Stars: ✭ 151 (+1061.54%)
spoken-command-recognitionA large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition
Stars: ✭ 59 (+353.85%)
DatasetsTFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Stars: ✭ 3,094 (+23700%)
Machine Learning ResourcesA curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Stars: ✭ 226 (+1638.46%)
Ner DatasetsDatasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
Stars: ✭ 220 (+1592.31%)
torchgeoTorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Stars: ✭ 1,125 (+8553.85%)
Nlp datasetsMy NLP datasets for Russian language
Stars: ✭ 198 (+1423.08%)
scrapeOPA python package for scraping oddsportal.com
Stars: ✭ 99 (+661.54%)
CorusLinks to Russian corpora + Python functions for loading and parsing
Stars: ✭ 154 (+1084.62%)
dagpiDagpi is a powerful and fast api that does image manipulation as well as serves datasets. It is fast and written in rust and python. Perfect for discord bots, social media apps, camera apps and more.
Stars: ✭ 25 (+92.31%)
awesome-dynamic-graphsA collection of resources on dynamic/streaming/temporal/evolving graph processing systems, databases, data structures, datasets, and related academic and industrial work
Stars: ✭ 89 (+584.62%)
PinsPin, Discover and Share Resources
Stars: ✭ 149 (+1046.15%)
Pix2codepix2code: Generating Code from a Graphical User Interface Screenshot
Stars: ✭ 11,349 (+87200%)
humanflow2Official repository of Learning Multi-Human Optical Flow (IJCV 2019)
Stars: ✭ 37 (+184.62%)
COVID-NetLaunched in March 2020 in response to the coronavirus disease 2019 (COVID-19) pandemic, COVID-Net is a global open source, open access initiative dedicated to accelerating advancement in machine learning to aid front-line healthcare workers and clinical institutions around the world fighting the continuing pandemic. Towards this goal, our global…
Stars: ✭ 41 (+215.38%)
geodaDataData package for accessing GeoDa datasets using R
Stars: ✭ 15 (+15.38%)
bugrepoA collection of publicly available bug reports
Stars: ✭ 93 (+615.38%)
Datasetssource{d} datasets ("big code") for source code analysis and machine learning on source code
Stars: ✭ 231 (+1676.92%)
delitos-caba🚓 Crime dataset for the City of Buenos Aires, Argentina
Stars: ✭ 44 (+238.46%)
biomechanics datasetInformation of public available data sets for biomechanics.
Stars: ✭ 31 (+138.46%)
Aidl kbA Knowledge Base for the FB Group Artificial Intelligence and Deep Learning (AIDL)
Stars: ✭ 219 (+1584.62%)
industrial-ml-datasetsA curated list of datasets, publically available for machine learning research in the area of manufacturing
Stars: ✭ 45 (+246.15%)
MolaA Modular Optimization framework for Localization and mApping (MOLA)
Stars: ✭ 206 (+1484.62%)
Awesome Json DatasetsA curated list of awesome JSON datasets that don't require authentication.
Stars: ✭ 2,421 (+18523.08%)
git-rdmA research data management plugin for the Git version control system.
Stars: ✭ 34 (+161.54%)
DatasaurusR Package 📦 Containing the Datasaurus Dozen datasets 📊
Stars: ✭ 193 (+1384.62%)
CHRSIXray : A Large-scale Security Inspection X-ray Benchmark in CVPR 2019
Stars: ✭ 78 (+500%)
Unify Emotion DatasetsA Survey and Experiments on Annotated Corpora for Emotion Classification in Text
Stars: ✭ 169 (+1200%)
morghulisNo description or website provided.
Stars: ✭ 18 (+38.46%)
Awesome Nlp PolishA curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.
Stars: ✭ 153 (+1076.92%)
DiscEvalDiscourse Based Evaluation of Language Understanding
Stars: ✭ 18 (+38.46%)
IdenprofIdenProf dataset is a collection of images of identifiable professionals. It is been collected to enable the development of AI systems that can serve by identifying people and the nature of their job by simply looking at an image, just like humans can do.
Stars: ✭ 149 (+1046.15%)
Gekko DatasetsGekko Trading Bot dataset dumps. Ready to use and download history files in SQLite format.
Stars: ✭ 146 (+1023.08%)
dh-coreFunctional data science
Stars: ✭ 123 (+846.15%)
Remo Python🐰 Python lib for remo - the app for annotations and images management in Computer Vision
Stars: ✭ 138 (+961.54%)
isarn-sketches-sparkRoutines and data structures for using isarn-sketches idiomatically in Apache Spark
Stars: ✭ 28 (+115.38%)
big-data-exploration[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product
Stars: ✭ 43 (+230.77%)
rs datasetsTool for autodownloading recommendation systems datasets
Stars: ✭ 22 (+69.23%)
metadatMeta-analytic datasets for R
Stars: ✭ 21 (+61.54%)