All Projects → Darel13712 → rs_datasets

Darel13712 / rs_datasets

Licence: MIT license
Tool for autodownloading recommendation systems datasets

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to rs datasets

git-rdm
A research data management plugin for the Git version control system.
Stars: ✭ 34 (+54.55%)
Mutual labels:  datasets, research-data-management
mildnet
Visual Similarity research at Fynd. Contains code to reproduce 2 of our research papers.
Stars: ✭ 76 (+245.45%)
Mutual labels:  ml, recommender-system
mindsdb-examples
Examples for usage of Mindsdb https://www.mindsdb.com/
Stars: ✭ 25 (+13.64%)
Mutual labels:  ml, datasets
Cleora
Cleora AI is a general-purpose model for efficient, scalable learning of stable and inductive entity embeddings for heterogeneous relational data.
Stars: ✭ 303 (+1277.27%)
Mutual labels:  ml, datasets
Codesearchnet
Datasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (+6163.64%)
Mutual labels:  ml, datasets
Nottingham Dataset
Cleaned version of the Nottingham dataset
Stars: ✭ 94 (+327.27%)
Mutual labels:  ml, datasets
Hub
Dataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+18095.45%)
Mutual labels:  ml, datasets
COVID-Net
Launched in March 2020 in response to the coronavirus disease 2019 (COVID-19) pandemic, COVID-Net is a global open source, open access initiative dedicated to accelerating advancement in machine learning to aid front-line healthcare workers and clinical institutions around the world fighting the continuing pandemic. Towards this goal, our global…
Stars: ✭ 41 (+86.36%)
Mutual labels:  ml, datasets
industrial-ml-datasets
A curated list of datasets, publically available for machine learning research in the area of manufacturing
Stars: ✭ 45 (+104.55%)
Mutual labels:  ml, datasets
neptune-client
📒 Experiment tracking tool and model registry
Stars: ✭ 348 (+1481.82%)
Mutual labels:  ml
Thirukkural-Tamil-Dataset
திருக்குறள் by திருவள்ளுவர்.
Stars: ✭ 44 (+100%)
Mutual labels:  datasets
scRNAseq cell cluster labeling
Scripts to run and benchmark scRNA-seq cell cluster labeling methods
Stars: ✭ 41 (+86.36%)
Mutual labels:  datasets
RecSys Course 2017
DEPRECATED This is the official repository for the 2017 Recommender Systems course at Polimi.
Stars: ✭ 23 (+4.55%)
Mutual labels:  recommender-system
responsible-ai-toolbox
This project provides responsible AI user interfaces for Fairlearn, interpret-community, and Error Analysis, as well as foundational building blocks that they rely on.
Stars: ✭ 615 (+2695.45%)
Mutual labels:  ml
neural inverse knitting
Code for Neural Inverse Knitting: From Images to Manufacturing Instructions
Stars: ✭ 30 (+36.36%)
Mutual labels:  ml
DiscEval
Discourse Based Evaluation of Language Understanding
Stars: ✭ 18 (-18.18%)
Mutual labels:  datasets
GatedPixelCNNPyTorch
PyTorch implementation of "Conditional Image Generation with PixelCNN Decoders" by van den Oord et al. 2016
Stars: ✭ 68 (+209.09%)
Mutual labels:  ml
cli
Polyaxon Core Client & CLI to streamline MLOps
Stars: ✭ 18 (-18.18%)
Mutual labels:  ml
FashionShopApp
Fashion Shop App : Flask, ChatterBot, ElasticSearch, Recommender-System
Stars: ✭ 28 (+27.27%)
Mutual labels:  recommender-system
aws-experiments-comprehend-custom-classifier
How to train a custom NLP classifier with AWS Comprehend?
Stars: ✭ 25 (+13.64%)
Mutual labels:  ml

Welcome to rs_datasets

This tool allows you download, unpack and read recommender systems datasets into pandas.DataFrame as easy as data = Dataset().

Installation

pip install rs_datasets

Documentation

Please see documentation to this project to see available datasets and examples of use.

Example of use

from rs_datasets import MovieLens
ml = MovieLens()
ml.info()
ratings
   user_id  item_id  rating  timestamp
0        1        1     4.0  964982703
1        1        3     4.0  964981247
2        1        6     4.0  964982224
items
   item_id  ...                                       genres
0        1  ...  Adventure|Animation|Children|Comedy|Fantasy
1        2  ...                   Adventure|Children|Fantasy
2        3  ...                               Comedy|Romance
[3 rows x 3 columns]
tags
   user_id  item_id              tag   timestamp
0        2    60756            funny  1445714994
1        2    60756  Highly quotable  1445714996
2        2    60756     will ferrell  1445714992
links
   item_id  imdb_id  tmdb_id
0        1   114709    862.0
1        2   113497   8844.0
2        3   113228  15602.0

Loaded DataFrames are available as class attributes.

Note

This package relies on datatable to read files. There are some known issues with reading some of the datasets, which should be solved with the release of datatable==1.1.0, but they are quite slow on releases. If you experience problems with reading datasets, you may try to downgrade datatable to 0.11 or 0.9. Or you can install a dev build 1.1.0a2102 or newer from s3. Find your python version, copy link for whl and do pip install link. Sorry for the inconvenience.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].