cifairA duplicate-free variant of the CIFAR test set.
Stars: ✭ 13 (-69.05%)
parlitoolsA collection of useful tools for UK politics
Stars: ✭ 22 (-47.62%)
NLP PEMDCNLP Predtrained Embeddings, Models and Datasets Collections(NLP_PEMDC). The collection will keep updating.
Stars: ✭ 58 (+38.1%)
humanflow2Official repository of Learning Multi-Human Optical Flow (IJCV 2019)
Stars: ✭ 37 (-11.9%)
awesome-forests🌳 A curated list of ground-truth forest datasets for the machine learning and forestry community.
Stars: ✭ 111 (+164.29%)
bugrepoA collection of publicly available bug reports
Stars: ✭ 93 (+121.43%)
Three-Filters-to-NormalThree-Filters-to-Normal: An Accurate and Ultrafast Surface Normal Estimator (RAL+ICRA'21)
Stars: ✭ 41 (-2.38%)
CHRSIXray : A Large-scale Security Inspection X-ray Benchmark in CVPR 2019
Stars: ✭ 78 (+85.71%)
ake-datasetsLarge, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.
Stars: ✭ 125 (+197.62%)
datumaroDataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.
Stars: ✭ 274 (+552.38%)
delitos-caba🚓 Crime dataset for the City of Buenos Aires, Argentina
Stars: ✭ 44 (+4.76%)
akshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 5,155 (+12173.81%)
Spatio-Temporal-papersThis project is a collection of recent research in areas such as new infrastructure and urban computing, including white papers, academic papers, AI lab and dataset etc.
Stars: ✭ 180 (+328.57%)
Text-Summarization-Repo텍스트 요약 분야의 주요 연구 주제, Must-read Papers, 이용 가능한 model 및 data 등을 추천 자료와 함께 정리한 저장소입니다.
Stars: ✭ 213 (+407.14%)
datasetdataset is a command line tool, Go package, shared library and Python package for working with JSON objects as collections
Stars: ✭ 21 (-50%)
DiscEvalDiscourse Based Evaluation of Language Understanding
Stars: ✭ 18 (-57.14%)
PharmacoDBSearch across publicly available datasets to find instances where a drug or cell line of interest has been profiled.
Stars: ✭ 38 (-9.52%)
biomechanics datasetInformation of public available data sets for biomechanics.
Stars: ✭ 31 (-26.19%)
newtNatural World Tasks
Stars: ✭ 24 (-42.86%)
dh-coreFunctional data science
Stars: ✭ 123 (+192.86%)
masaderThe largest public catalogue for Arabic NLP and speech datasets. There are +250 datasets annotated with more than 25 attributes.
Stars: ✭ 66 (+57.14%)
geodaDataData package for accessing GeoDa datasets using R
Stars: ✭ 15 (-64.29%)
spectrochempySpectroChemPy is a framework for processing, analyzing and modeling spectroscopic data for chemistry with Python
Stars: ✭ 34 (-19.05%)
dw-jdbcJDBC driver for data.world
Stars: ✭ 17 (-59.52%)
covid19-datasetsA list of high quality open datasets for COVID-19 data analysis
Stars: ✭ 56 (+33.33%)
bumblebee🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Stars: ✭ 120 (+185.71%)
HINT3This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020's Insights Workshop https://insights-workshop.github.io/ Preprint for the paper is available here https://arxiv.org/abs/2009.13833
Stars: ✭ 27 (-35.71%)
mindsdb-examplesExamples for usage of Mindsdb https://www.mindsdb.com/
Stars: ✭ 25 (-40.48%)
Few-Shot-Intent-DetectionFew-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines and results.
Stars: ✭ 63 (+50%)
kaggledatasetsCollection of Kaggle Datasets ready to use for Everyone (Looking for contributors)
Stars: ✭ 44 (+4.76%)
AIODriveOfficial Python/PyTorch Implementation for "All-In-One Drive: A Large-Scale Comprehensive Perception Dataset with High-Density Long-Range Point Clouds"
Stars: ✭ 32 (-23.81%)
big-data-exploration[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product
Stars: ✭ 43 (+2.38%)
ml-datasets🌊 Machine learning dataset loaders for testing and example scripts
Stars: ✭ 40 (-4.76%)
rs datasetsTool for autodownloading recommendation systems datasets
Stars: ✭ 22 (-47.62%)
let-it-be中国高等教育群体的心理健康状态数据集
Stars: ✭ 28 (-33.33%)
torchgeoTorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Stars: ✭ 1,125 (+2578.57%)
datasets🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+32923.81%)
Dataset-Sentimen-Analisis-Bahasa-IndonesiaRepositori ini merupakan kumpulan dataset terkait analisis sentimen Berbahasa Indonesia. Apabila Anda menggunakan dataset-dataset yang ada pada repositori ini untuk penelitian, maka cantumkanlah/kutiplah jurnal artikel terkait dataset tersebut. Dataset yang tersedia telah diimplementasikan dalam beberapa penelitian dan hasilnya telah dipublikasi…
Stars: ✭ 38 (-9.52%)
mlxMachine Learning eXchange (MLX). Data and AI Assets Catalog and Execution Engine
Stars: ✭ 132 (+214.29%)
farabio🤖 PyTorch toolkit for biomedical imaging ❤️
Stars: ✭ 48 (+14.29%)
dagpiDagpi is a powerful and fast api that does image manipulation as well as serves datasets. It is fast and written in rust and python. Perfect for discord bots, social media apps, camera apps and more.
Stars: ✭ 25 (-40.48%)
metadatMeta-analytic datasets for R
Stars: ✭ 21 (-50%)
traj-pred-irlOfficial implementation codes of "Regularizing neural networks for future trajectory prediction via IRL framework"
Stars: ✭ 23 (-45.24%)
datasetsThe primary repository for all of the CORGIS Datasets
Stars: ✭ 19 (-54.76%)
data.world-pyPython package for data.world
Stars: ✭ 98 (+133.33%)
11K-HandsTwo-stream CNN for gender classification and biometric identification using a dataset of 11K hand images.
Stars: ✭ 44 (+4.76%)
json2python-modelsGenerate Python model classes (pydantic, attrs, dataclasses) based on JSON datasets with typing module support
Stars: ✭ 119 (+183.33%)
systematic-review-datasetsA collection of fully labeled systematic review datasets (title-abstract screening)
Stars: ✭ 25 (-40.48%)
allie🤖 A machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers).
Stars: ✭ 93 (+121.43%)
napkinXCExtremely simple and fast extreme multi-class and multi-label classifiers.
Stars: ✭ 38 (-9.52%)
Clustering-DatasetsThis repository contains the collection of UCI (real-life) datasets and Synthetic (artificial) datasets (with cluster labels and MATLAB files) ready to use with clustering algorithms.
Stars: ✭ 189 (+350%)
multi-task-defocus-deblurring-dual-pixel-nimatReference github repository for the paper "Improving Single-Image Defocus Deblurring: How Dual-Pixel Images Help Through Multi-Task Learning". We propose a single-image deblurring network that incorporates the two sub-aperture views into a multitask framework. Specifically, we show that jointly learning to predict the two DP views from a single …
Stars: ✭ 29 (-30.95%)