Datasets For Recommender SystemsThis is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)
Stars: ✭ 564 (+1181.82%)
PaperrobotCode for PaperRobot: Incremental Draft Generation of Scientific Ideas
Stars: ✭ 372 (+745.45%)
panoptic partsThis repository contains code and tools for reading, processing, evaluating on, and visualizing Panoptic Parts datasets. Moreover, it contains code for reproducing our CVPR 2021 paper results.
Stars: ✭ 82 (+86.36%)
PydatasetInstant access to many datasets in Python.
Stars: ✭ 880 (+1900%)
awesome-sweden-datasetsA curated list of awesome datasets to use when coding for the Swedish market.
Stars: ✭ 17 (-61.36%)
Dr.sure🏫DeepLearning学习笔记以及Tensorflow、Pytorch的使用心得笔记。Dr. Sure会不定时往项目中添加他看到的最新的技术,欢迎批评指正。
Stars: ✭ 365 (+729.55%)
LoghubA large collection of system log datasets for AI-powered log analytics
Stars: ✭ 551 (+1152.27%)
systematic-review-datasetsA collection of fully labeled systematic review datasets (title-abstract screening)
Stars: ✭ 25 (-43.18%)
AkshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+9750%)
allie🤖 A machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers).
Stars: ✭ 93 (+111.36%)
PharmacoDBSearch across publicly available datasets to find instances where a drug or cell line of interest has been profiled.
Stars: ✭ 38 (-13.64%)
Medical Datasetstracking medical datasets, with a focus on medical imaging
Stars: ✭ 296 (+572.73%)
napkinXCExtremely simple and fast extreme multi-class and multi-label classifiers.
Stars: ✭ 38 (-13.64%)
DatasetteAn open source multi-tool for exploring and publishing data
Stars: ✭ 5,640 (+12718.18%)
Text-Summarization-Repo텍스트 요약 분야의 주요 연구 주제, Must-read Papers, 이용 가능한 model 및 data 등을 추천 자료와 함께 정리한 저장소입니다.
Stars: ✭ 213 (+384.09%)
Open3d MlAn extension of Open3D to address 3D Machine Learning tasks
Stars: ✭ 284 (+545.45%)
Three-Filters-to-NormalThree-Filters-to-Normal: An Accurate and Ultrafast Surface Normal Estimator (RAL+ICRA'21)
Stars: ✭ 41 (-6.82%)
masaderThe largest public catalogue for Arabic NLP and speech datasets. There are +250 datasets annotated with more than 25 attributes.
Stars: ✭ 66 (+50%)
Cluecorpus2020Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Stars: ✭ 278 (+531.82%)
Clustering-DatasetsThis repository contains the collection of UCI (real-life) datasets and Synthetic (artificial) datasets (with cluster labels and MATLAB files) ready to use with clustering algorithms.
Stars: ✭ 189 (+329.55%)
Voice datasets🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
Stars: ✭ 494 (+1022.73%)
akshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 5,155 (+11615.91%)
awesome-forests🌳 A curated list of ground-truth forest datasets for the machine learning and forestry community.
Stars: ✭ 111 (+152.27%)
datasetdataset is a command line tool, Go package, shared library and Python package for working with JSON objects as collections
Stars: ✭ 21 (-52.27%)
RoapiCreate full-fledged APIs for static datasets without writing a single line of code.
Stars: ✭ 253 (+475%)
parlitoolsA collection of useful tools for UK politics
Stars: ✭ 22 (-50%)
DoccanoOpen source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+12627.27%)
newtNatural World Tasks
Stars: ✭ 24 (-45.45%)
newsletter-archiveMarkdown archive & RSS/Atom feeds for Data Is Plural.
Stars: ✭ 65 (+47.73%)
ake-datasetsLarge, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.
Stars: ✭ 125 (+184.09%)
AudinoOpen source audio annotation tool for humans™
Stars: ✭ 740 (+1581.82%)
spectrochempySpectroChemPy is a framework for processing, analyzing and modeling spectroscopic data for chemistry with Python
Stars: ✭ 34 (-22.73%)
multi-task-defocus-deblurring-dual-pixel-nimatReference github repository for the paper "Improving Single-Image Defocus Deblurring: How Dual-Pixel Images Help Through Multi-Task Learning". We propose a single-image deblurring network that incorporates the two sub-aperture views into a multitask framework. Specifically, we show that jointly learning to predict the two DP views from a single …
Stars: ✭ 29 (-34.09%)
Awesome RoboticsA curated list of awesome links and software libraries that are useful for robots.
Stars: ✭ 478 (+986.36%)
datumaroDataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.
Stars: ✭ 274 (+522.73%)
NetEmb-DatasetsA collection of real-world networks/graphs for Network Embedding
Stars: ✭ 18 (-59.09%)
NLP PEMDCNLP Predtrained Embeddings, Models and Datasets Collections(NLP_PEMDC). The collection will keep updating.
Stars: ✭ 58 (+31.82%)
Commons⛲️ Commons Marketplace client & server to explore, download, and publish open data sets in the Ocean Protocol Network.
Stars: ✭ 34 (-22.73%)
Spatio-Temporal-papersThis project is a collection of recent research in areas such as new infrastructure and urban computing, including white papers, academic papers, AI lab and dataset etc.
Stars: ✭ 180 (+309.09%)
recurrent-defocus-deblurring-synth-dual-pixelReference github repository for the paper "Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data". We propose a procedure to generate realistic DP data synthetically. Our synthesis approach mimics the optical image formation found on DP sensors and can be applied to virtual scenes rendered with standard computer software. Lev…
Stars: ✭ 30 (-31.82%)
cifairA duplicate-free variant of the CIFAR test set.
Stars: ✭ 13 (-70.45%)
GeobrEasy access to official spatial data sets of Brazil in R and Python
Stars: ✭ 411 (+834.09%)
disent🧶 Modular VAE disentanglement framework for python built with PyTorch Lightning ▸ Including metrics and datasets ▸ With strongly supervised, weakly supervised and unsupervised methods ▸ Easily configured and run with Hydra config ▸ Inspired by disentanglement_lib
Stars: ✭ 41 (-6.82%)
bugrepoA collection of publicly available bug reports
Stars: ✭ 93 (+111.36%)
EasyprAn easy, flexible, and accurate plate recognition project for Chinese licenses in unconstrained situations.
Stars: ✭ 6,046 (+13640.91%)
dplace-dataThe data repository for the D-PLACE Project (Database of Places, Language, Culture and Environment)
Stars: ✭ 49 (+11.36%)
Pytorch CppC++ Implementation of PyTorch Tutorials for Everyone
Stars: ✭ 1,014 (+2204.55%)
Dataframes.jlIn-memory tabular data in Julia
Stars: ✭ 951 (+2061.36%)
Label StudioLabel Studio is a multi-type data labeling and annotation tool with standardized output format
Stars: ✭ 7,264 (+16409.09%)
Awesome Holistic 3dA list of papers and resources (data,code,etc) for holistic 3D reconstruction in computer vision
Stars: ✭ 387 (+779.55%)
opendatasetsA Python library for downloading datasets from Kaggle, Google Drive, and other online sources.
Stars: ✭ 161 (+265.91%)