All Projects → Tharun-tharun → Movie-Recommendation-System-with-Sentiment-Analysis

Tharun-tharun / Movie-Recommendation-System-with-Sentiment-Analysis

Licence: other
This is a Machine Learning project to create a "Movie Recommender System" and predict user ratings for movies using cosine similarity.

Programming Languages

Jupyter Notebook
11667 projects
HTML
75241 projects
javascript
184084 projects - #8 most used programming language
python
139335 projects - #7 most used programming language
CSS
56736 projects

Projects that are alternatives of or similar to Movie-Recommendation-System-with-Sentiment-Analysis

Movie-Recommendation-System-with-Sentiment-Analysis
Content based movie recommendation system with sentiment analysis
Stars: ✭ 44 (+109.52%)
Mutual labels:  sentiment-analysis, movie-recommendation
applytics
Perform Sentiment Analysis on reviews of your apps
Stars: ✭ 21 (+0%)
Mutual labels:  sentiment-analysis
RcppML
Rcpp Machine Learning: Fast robust NMF, divisive clustering, and more
Stars: ✭ 52 (+147.62%)
Mutual labels:  matrix-factorization
spark-twitter-sentiment-analysis
Sentiment Analysis of a Twitter Topic with Spark Structured Streaming
Stars: ✭ 55 (+161.9%)
Mutual labels:  sentiment-analysis
sentiment-analysis
🎈 A Node.js AFINN-111 based sentiment analysis module
Stars: ✭ 26 (+23.81%)
Mutual labels:  sentiment-analysis
OLSTEC
OnLine Low-rank Subspace tracking by TEnsor CP Decomposition in Matlab: Version 1.0.1
Stars: ✭ 30 (+42.86%)
Mutual labels:  matrix-factorization
visualization
Text visualization tools
Stars: ✭ 18 (-14.29%)
Mutual labels:  sentiment-analysis
lsh-rs
Locality Sensitive Hashing in Rust with Python bindings
Stars: ✭ 64 (+204.76%)
Mutual labels:  cosine-similarity
sentibol
⚽ Notebook feito para analisar o case do Sentibol
Stars: ✭ 18 (-14.29%)
Mutual labels:  sentiment-analysis
chronist
Long-term analysis of emotion, age, and sentiment using Lifeslice and text records.
Stars: ✭ 23 (+9.52%)
Mutual labels:  sentiment-analysis
cisip-FIRe
Fast Image Retrieval (FIRe) is an open source project to promote image retrieval research. It implements most of the major binary hashing methods to date, together with different popular backbone networks and public datasets.
Stars: ✭ 40 (+90.48%)
Mutual labels:  cosine-similarity
sentiment-analysis-torchtext
Seniment Analysis in Torchtext
Stars: ✭ 19 (-9.52%)
Mutual labels:  sentiment-analysis
keras-aquarium
a small collection of models implemented in keras, including matrix factorization(recommendation system), topic modeling, text classification, etc. Runs on tensorflow.
Stars: ✭ 14 (-33.33%)
Mutual labels:  matrix-factorization
semeval22 structured sentiment
SemEval-2022 Shared Task 10: Structured Sentiment Analysis
Stars: ✭ 67 (+219.05%)
Mutual labels:  sentiment-analysis
sentiment analysis dict
sentiment analysis、情感分析、文本分类、基于字典、python、classification
Stars: ✭ 111 (+428.57%)
Mutual labels:  sentiment-analysis
Content-based-Recommender-System
It is a content based recommender system that uses tf-idf and cosine similarity for N Most SImilar Items from a dataset
Stars: ✭ 64 (+204.76%)
Mutual labels:  cosine-similarity
text analysis tools
中文文本分析工具包(包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词-事件三元组抽取)
Stars: ✭ 410 (+1852.38%)
Mutual labels:  sentiment-analysis
Senti4SD
An emotion-polarity classifier specifically trained on developers' communication channels
Stars: ✭ 41 (+95.24%)
Mutual labels:  sentiment-analysis
AI-Sentiment-Analysis-on-IMDB-Dataset
Sentiment Analysis using Stochastic Gradient Descent on 50,000 Movie Reviews Compiled from the IMDB Dataset
Stars: ✭ 55 (+161.9%)
Mutual labels:  sentiment-analysis
analyzing-reddit-sentiment-with-aws
Learn how to use Kinesis Firehose, AWS Glue, S3, and Amazon Athena by streaming and analyzing reddit comments in realtime. 100-200 level tutorial.
Stars: ✭ 40 (+90.48%)
Mutual labels:  sentiment-analysis

Movie-Recommendation-System

This application provides all the details of the requested movie such as overview, genre, release date, rating, runtime, top cast, reviews, recommended movies, etc.

The details of the movies(title, genre, runtime, ratings, posters, etc) are fetched using an API by TMDB, https://www.themoviedb.org, and using the IMDB id of the movie in the API, I did web scraping to get the reviews given by the user in the IMDB site using beautifulsoup4 and performed sentiment analysis on those reviews.

Link to the application

Check out the live demo: https://lookupforcinema.herokuapp.com

Medium Article About My Project

https://medium.com/analytics-vidhya/build-a-movie-recommendation-flask-based-deployment-8e2970f1f5f1

Finding similar movies

Without taking content into account (Just based on ratings)

Here just based on the ratings of the users for different movies, we use K nearest neighbours algorithm to find the movies which are similar.

With taking Content into account

Here we just information about the movies, in this case the information of genres to predict the most similar movies.

Matrix Factorisation(Collabarative Filtering)

Two approaches were tried to do matrix factorisation, the low rank method is very slow, so used scipy's SVD for sparse matrix.

Architecture

110212434-597bb700-7ec1-11eb-9ffa-7ac319e33123

Deep Learning Methods

One popular recommender systems approach is called Matrix Factorisation. It works on the principle that we can learn a low-dimensional representation (embedding) of user and movie. For example, for each movie, we can have how much action it has, how long it is, and so on. For each user, we can encode how much they like action, or how much they like long movies, etc. Thus, we can combine the user and the movie embeddings to estimate the ratings on unseen movies. This approach can also be viewed as: given a matrix (A [M X N]) containing users and movies, we want to estimate low dimensional matrices (W [M X k] and H [M X k]), such that: A≈W.HT

1.Matrix Factorisation based on Deep learning

2. Matrix Factorisation based on Deep learning with non negative embeddings.

3. Advanced neural network with different number of embeddings for both and movies.

Required Tools

  1. Keras
  2. Scipy
  3. Numpy
  4. Pandas
  5. python 3

Sources of the datasets

  1. IMDB 5000 Movie Dataset
  2. The Movies Dataset
  3. List of movies in 2018
  4. List of movies in 2019
  5. List of movies in 2020

Please do the repository, if it helped you in anyways

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].