Label StudioLabel Studio is a multi-type data labeling and annotation tool with standardized output format
Stars: ✭ 7,264 (+14724.49%)
SER-datasetsA collection of datasets for the purpose of emotion recognition/detection in speech.
Stars: ✭ 74 (+51.02%)
Dataframes.jlIn-memory tabular data in Julia
Stars: ✭ 951 (+1840.82%)
kaggle-codeA repository for some of the code I used in kaggle data science & machine learning tasks.
Stars: ✭ 100 (+104.08%)
Animal MattingGithub repository for the paper End-to-end Animal Image Matting
Stars: ✭ 363 (+640.82%)
PharmacoGxR package to analyze large-scale pharmacogenomic datasets.
Stars: ✭ 42 (-14.29%)
ChakinSimple downloader for pre-trained word vectors
Stars: ✭ 323 (+559.18%)
AIODriveOfficial Python/PyTorch Implementation for "All-In-One Drive: A Large-Scale Comprehensive Perception Dataset with High-Density Long-Range Point Clouds"
Stars: ✭ 32 (-34.69%)
traj-pred-irlOfficial implementation codes of "Regularizing neural networks for future trajectory prediction via IRL framework"
Stars: ✭ 23 (-53.06%)
DatasetsMachine learning datasets used in tutorials on MachineLearningMastery.com
Stars: ✭ 536 (+993.88%)
HINT3This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020's Insights Workshop https://insights-workshop.github.io/ Preprint for the paper is available here https://arxiv.org/abs/2009.13833
Stars: ✭ 27 (-44.9%)
CleoraCleora AI is a general-purpose model for efficient, scalable learning of stable and inductive entity embeddings for heterogeneous relational data.
Stars: ✭ 303 (+518.37%)
let-it-be中国高等教育群体的心理健康状态数据集
Stars: ✭ 28 (-42.86%)
Entity Recognition DatasetsA collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Stars: ✭ 891 (+1718.37%)
farabio🤖 PyTorch toolkit for biomedical imaging ❤️
Stars: ✭ 48 (-2.04%)
TdcTherapeutics Data Commons: Machine Learning Datasets and Tasks for Therapeutics
Stars: ✭ 291 (+493.88%)
11K-HandsTwo-stream CNN for gender classification and biometric identification using a dataset of 11K hand images.
Stars: ✭ 44 (-10.2%)
covid19-datasetsA list of high quality open datasets for COVID-19 data analysis
Stars: ✭ 56 (+14.29%)
MeglassAn eyeglass face dataset collected and cleaned for face recognition evaluation, CCBR 2018.
Stars: ✭ 281 (+473.47%)
Few-Shot-Intent-DetectionFew-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines and results.
Stars: ✭ 63 (+28.57%)
Pytorch CppC++ Implementation of PyTorch Tutorials for Everyone
Stars: ✭ 1,014 (+1969.39%)
ml-datasets🌊 Machine learning dataset loaders for testing and example scripts
Stars: ✭ 40 (-18.37%)
HubDataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+8069.39%)
datasets🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+28206.12%)
Dataset-Sentimen-Analisis-Bahasa-IndonesiaRepositori ini merupakan kumpulan dataset terkait analisis sentimen Berbahasa Indonesia. Apabila Anda menggunakan dataset-dataset yang ada pada repositori ini untuk penelitian, maka cantumkanlah/kutiplah jurnal artikel terkait dataset tersebut. Dataset yang tersedia telah diimplementasikan dalam beberapa penelitian dan hasilnya telah dipublikasi…
Stars: ✭ 38 (-22.45%)
dbcollectionA collection of popular datasets for deep learning.
Stars: ✭ 26 (-46.94%)
OgbBenchmark datasets, data loaders, and evaluators for graph machine learning
Stars: ✭ 799 (+1530.61%)
datasetsThe primary repository for all of the CORGIS Datasets
Stars: ✭ 19 (-61.22%)
datasetsTFDS data loaders for sign language datasets.
Stars: ✭ 17 (-65.31%)
json2python-modelsGenerate Python model classes (pydantic, attrs, dataclasses) based on JSON datasets with typing module support
Stars: ✭ 119 (+142.86%)
OpenmlOpen Machine Learning
Stars: ✭ 489 (+897.96%)
dw-jdbcJDBC driver for data.world
Stars: ✭ 17 (-65.31%)
covid-19-data-cleanupScripts to cleanup data from https://github.com/CSSEGISandData/COVID-19
Stars: ✭ 25 (-48.98%)
bumblebee🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Stars: ✭ 120 (+144.9%)
HealthcheckHealth Check ✔ is a Machine Learning Web Application made using Flask that can predict mainly three diseases i.e. Diabetes, Heart Disease, and Cancer.
Stars: ✭ 35 (-28.57%)
mindsdb-examplesExamples for usage of Mindsdb https://www.mindsdb.com/
Stars: ✭ 25 (-48.98%)
kaggledatasetsCollection of Kaggle Datasets ready to use for Everyone (Looking for contributors)
Stars: ✭ 44 (-10.2%)
big-data-exploration[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product
Stars: ✭ 43 (-12.24%)
podiumPodium: a framework agnostic Python NLP library for data loading and preprocessing
Stars: ✭ 55 (+12.24%)
rs datasetsTool for autodownloading recommendation systems datasets
Stars: ✭ 22 (-55.1%)
Awesome TransitCommunity list of transit APIs, apps, datasets, research, and software 🚌🌟🚋🌟🚂
Stars: ✭ 713 (+1355.1%)
TSForecastingThis repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.
Stars: ✭ 53 (+8.16%)
Awesome Earth Artificial IntelligenceA curated list of Earth Science's Artificial Intelligence (AI) tutorials, notebooks, software, datasets, courses, books, video lectures and papers. Contributions most welcome.
Stars: ✭ 44 (-10.2%)
Commons⛲️ Commons Marketplace client & server to explore, download, and publish open data sets in the Ocean Protocol Network.
Stars: ✭ 34 (-30.61%)
EasyprAn easy, flexible, and accurate plate recognition project for Chinese licenses in unconstrained situations.
Stars: ✭ 6,046 (+12238.78%)
ml4seA curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
Stars: ✭ 46 (-6.12%)