All Projects → vaslnk → Spotify-Song-Recommendation-ML

vaslnk / Spotify-Song-Recommendation-ML

Licence: other
UC Berkeley team's submission for RecSys Challenge 2018

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Spotify-Song-Recommendation-ML

Gorse
An open source recommender system service written in Go
Stars: ✭ 1,148 (+1540%)
Mutual labels:  data-mining, collaborative-filtering, recommender-system
Recommender-System
In this code we implement and compared Collaborative Filtering algorithm, prediction algorithms such as neighborhood methods, matrix factorization-based ( SVD, PMF, SVD++, NMF), and many others.
Stars: ✭ 30 (-57.14%)
Mutual labels:  collaborative-filtering, recommender-system
BPR MPR
BPR, Bayesian Personalized Ranking (BPR), extremely convenient BPR & Multiple Pairwise Ranking
Stars: ✭ 77 (+10%)
Mutual labels:  collaborative-filtering, recommender-system
Recommender-Systems-with-Collaborative-Filtering-and-Deep-Learning-Techniques
Implemented User Based and Item based Recommendation System along with state of the art Deep Learning Techniques
Stars: ✭ 41 (-41.43%)
Mutual labels:  collaborative-filtering, recommender-system
TIFUKNN
kNN-based next-basket recommendation
Stars: ✭ 38 (-45.71%)
Mutual labels:  collaborative-filtering, recommender-system
Awesome-Machine-Learning-Papers
📖Notes and remarks on Machine Learning related papers
Stars: ✭ 35 (-50%)
Mutual labels:  collaborative-filtering, recommender-system
genie
Genie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
Stars: ✭ 21 (-70%)
Mutual labels:  data-mining, data-analysis
python-notebooks
A collection of Jupyter Notebooks used in conferences or just to have some snippets.
Stars: ✭ 14 (-80%)
Mutual labels:  data-mining, data-analysis
genieclust
Genie++ Fast and Robust Hierarchical Clustering with Noise Point Detection - for Python and R
Stars: ✭ 34 (-51.43%)
Mutual labels:  data-mining, data-analysis
recsys spark
Spark SQL 实现 ItemCF,UserCF,Swing,推荐系统,推荐算法,协同过滤
Stars: ✭ 76 (+8.57%)
Mutual labels:  collaborative-filtering, recommender-system
leaflet heatmap
简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-81.43%)
Mutual labels:  spark, data-analysis
online-course-recommendation-system
Built on data from Pluralsight's course API fetched results. Works with model trained with K-means unsupervised clustering algorithm.
Stars: ✭ 31 (-55.71%)
Mutual labels:  data-analysis, recommender-system
chainRec
Mengting Wan, Julian McAuley, "Item Recommendation on Monotonic Behavior Chains", in Proc. of 2018 ACM Conference on Recommender Systems (RecSys'18), Vancouver, Canada, Oct. 2018.
Stars: ✭ 52 (-25.71%)
Mutual labels:  data-mining, recommender-system
taller SparkR
Taller SparkR para las Jornadas de Usuarios de R
Stars: ✭ 12 (-82.86%)
Mutual labels:  data-mining, data-analysis
Loan-Approval-Prediction
Loan Application Data Analysis
Stars: ✭ 61 (-12.86%)
Mutual labels:  data-mining, data-analysis
BARS
Towards open benchmarking for recommender systems https://openbenchmark.github.io/BARS
Stars: ✭ 157 (+124.29%)
Mutual labels:  collaborative-filtering, recommender-system
twitter-analytics-wrapper
A simple Python wrapper to download tweets data from the Twitter Analytics platform. Particularly interesting for the impressions metrics that are unavailable on current Twitter API. Also works for the videos data.
Stars: ✭ 44 (-37.14%)
Mutual labels:  data-mining, data-analysis
PracticalMachineLearning
A collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (-14.29%)
Mutual labels:  data-mining, data-analysis
heidi
heidi : tidy data in Haskell
Stars: ✭ 24 (-65.71%)
Mutual labels:  data-mining, data-analysis
popular restaurants from officials
서울시 공무원의 업무추진비를 분석하여 진짜 맛집 찾기 프로젝트
Stars: ✭ 22 (-68.57%)
Mutual labels:  data-mining, data-analysis

Spotify Million Playlists (RecSys 2018) Challenge Submission

UC Berkeley Team: Jack Vasylenko, Chitwan Kaudan, Anith Patel, Tyler Larsen and William Wang.

Databricks Spark Demo video

This project is a song recommendation system implemented using Spark MLib Alternating Squares Collaborative Filtering Algorithm trained on 1 million playlists open-sourced by Spotify.

About the dataset:

The MPD contains a million user-generated playlists. These playlists were created during the period of January 2010 through October 2017. Each playlist in the MPD contains a playlist title, the track list (including track metadata) editing information (last edit time, number of playlist edits) and other miscellaneous information about the playlist.

Obtaining The Data

Proceed with these steps to download Spotify's dataset (33 Gb) and convert the data into a memory-efficient format (~ 5 Gb) for use on the Databricks platform:

  1. Download Spotify's official dataset and place the 'data' folder into the root folder of the project.
  2. Run the following command:
python restructureData.py 

This script populates the \data_csv folder with the data that can be used to create a Databricks table.

1. Exploratory data analysis:

EDA.ipynb

2. Using Neural Collaborative Filtering approach on a subset of data:

Neural-Collaborative-Filtering.ipynb

3. Training & Using Spark MLib Alternative Least Squares algorithm on all of data:

Spark-MLib-ALS.ipynb

License

Usage of the Million Playlist Dataset is subject to these license terms

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].