Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → zygmuntz → Goodbooks 10k

zygmuntz / Goodbooks 10k

Licence: other

Ten thousand books, six million ratings

Labels

jupyter-notebook books

Projects that are alternatives of or similar to Goodbooks 10k

Meteorological Books

气象相关书籍合集（持续更新）

Stars: ✭ 125 (-80.65%)

Mutual labels: jupyter-notebook, books

Awesome Ai Books

Some awesome AI related books and pdfs for learning and downloading, also apply some playground models for learning

Stars: ✭ 855 (+32.35%)

Mutual labels: jupyter-notebook, books

Papers Books Reading

🍬Some papers & books I’ve read.

Stars: ✭ 22 (-96.59%)

Mutual labels: jupyter-notebook, books

Python For Probability Statistics And Machine Learning

Jupyter Notebooks for Springer book "Python for Probability, Statistics, and Machine Learning"

Stars: ✭ 481 (-25.54%)

Mutual labels: jupyter-notebook, books

Practical Deep Learning For Coders 2.0

Notebooks for the "A walk with fastai2" Study Group and Lecture Series

Stars: ✭ 638 (-1.24%)

Mutual labels: jupyter-notebook

EPFL Machine Learning Course, Fall 2019

Stars: ✭ 634 (-1.86%)

Mutual labels: jupyter-notebook

Mining The Social Web 3rd Edition

The official online compendium for Mining the Social Web, 3rd Edition (O'Reilly, 2018)

Stars: ✭ 633 (-2.01%)

Mutual labels: jupyter-notebook

Bokeh Notebooks

Interactive Web Plotting with Bokeh in IPython notebook

Stars: ✭ 629 (-2.63%)

Mutual labels: jupyter-notebook

TensorFlow 101: Introduction to Deep Learning for Python Within TensorFlow

Stars: ✭ 642 (-0.62%)

Mutual labels: jupyter-notebook

📘 The interactive computing suite for you! ✨

Stars: ✭ 5,713 (+784.37%)

Mutual labels: jupyter-notebook

Hands On Reinforcement Learning With Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Stars: ✭ 640 (-0.93%)

Mutual labels: jupyter-notebook

Machine Learning toolbox for Humans

Stars: ✭ 634 (-1.86%)

Mutual labels: jupyter-notebook

Build a bot that speaks like you!

Stars: ✭ 641 (-0.77%)

Mutual labels: jupyter-notebook

Use evolutionary algorithms instead of gridsearch in scikit-learn

Stars: ✭ 633 (-2.01%)

Mutual labels: jupyter-notebook

Python implementation of algorithms from Russell And Norvig's "Artificial Intelligence - A Modern Approach"

Stars: ✭ 6,129 (+848.76%)

Mutual labels: jupyter-notebook

Ai Fundamentals

Code samples for AI fundamentals

Stars: ✭ 631 (-2.32%)

Mutual labels: jupyter-notebook

Zero To Mastery Ml

All course materials for the Zero to Mastery Machine Learning and Data Science course.

Stars: ✭ 631 (-2.32%)

Mutual labels: jupyter-notebook

Jupyter Notebooks and code for Python for Finance (2nd ed., O'Reilly) by Yves Hilpisch.

Stars: ✭ 640 (-0.93%)

Mutual labels: jupyter-notebook

Data Visualization

Misc data visualization projects, examples, and demos: mostly Python (pandas + matplotlib) and JavaScript (leaflet).

Stars: ✭ 639 (-1.08%)

Mutual labels: jupyter-notebook

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Stars: ✭ 633 (-2.01%)

Mutual labels: jupyter-notebook

View All Similar Projects ➔

goodbooks-10k

This dataset contains six million ratings for ten thousand most popular (with most ratings) books. There are also:

books marked to read by the users
book metadata (author, year, etc.)
tags/shelves/genres

Access

Some of these files are quite large, so GitHub won't show their contents online. See samples/ for smaller CSV snippets.

Open the notebook for a quick look at the data. Download individual zipped files from releases.

The dataset is accessible from Spotlight, recommender software based on PyTorch.

Contents

ratings.csv contains ratings sorted by time. It is 69MB and looks like that:

user_id,book_id,rating
1,258,5
2,4081,4
2,260,5
2,9296,5
2,2318,3

Ratings go from one to five. Both book IDs and user IDs are contiguous. For books, they are 1-10000, for users, 1-53424.

to_read.csv provides IDs of the books marked "to read" by each user, as user_id,book_id pairs, sorted by time. There are close to a million pairs.

books.csv has metadata for each book (goodreads IDs, authors, title, average rating, etc.). The metadata have been extracted from goodreads XML files, available in books_xml.

Tags

book_tags.csv contains tags/shelves/genres assigned by users to books. Tags in this file are represented by their IDs. They are sorted by goodreads_book_id ascending and count descending.

In raw XML files, tags look like this:

<popular_shelves>
	<shelf name="science-fiction" count="833"/>
	<shelf name="fantasy" count="543"/>
	<shelf name="sci-fi" count="542"/>
	...
	<shelf name="for-fun" count="8"/>
	<shelf name="all-time-favorites" count="8"/>
	<shelf name="science-fiction-and-fantasy" count="7"/>	
</popular_shelves>

Here, each tag/shelf is given an ID. tags.csv translates tag IDs to names.

goodreads IDs

Each book may have many editions. goodreads_book_id and best_book_id generally point to the most popular edition of a given book, while goodreads work_id refers to the book in the abstract sense.

You can use the goodreads book and work IDs to create URLs as follows:

https://www.goodreads.com/book/show/2767052
https://www.goodreads.com/work/editions/2792775

Note that book_id in ratings.csv and to_read.csv maps to work_id, not to goodreads_book_id, meaning that ratings for different editions are aggregated.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 646

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗