Movie Recommender - Personal Project

This is a personal project I worked on after following a YouTube tutorial. The goal of this project was to create a movie recommender app using machine learning. I used various techniques to vectorize the movie data and apply similarity measures to recommend movies based on user preferences.

Technologies Used:

Machine Learning: For building the recommendation system. Used scikit-learn.
Streamlit: For creating the web app interface.
Render: For deployment, instead of Heroku as suggested in the YouTube tutorial.

Techniques Applied:

Text Vectorization using Bag of Words:
- In the Bag of Words technique, I combined all the tags associated with a movie into a single large text string. From this large text, I calculated the frequency of all the words.
- The top 5000 words with the highest frequency were extracted to form a feature set.
Stemming:
- I applied stemming to remove different forms of the same word. For example, "action," "actions," and "acting" were all treated as "action."
Frequency Calculation:
- After stemming, I checked the frequency of these top 5000 words in each movie's tags.
- A DataFrame was created where the shape was 5000 x 5000 (movies vs. top words), representing the frequency of the top words in each movie's tag.
Stop Words Removal:
- Stop words like "and," "to," "the," "from," etc., were removed, as they don't add significant meaning to the text.
Cosine Distance for Similarity:
- Once each movie was represented in vector format, I used Cosine Distance to measure the similarity between movies.
- Cosine Distance is preferred over Euclidean distance in higher-dimensional spaces, as Euclidean distance is not a reliable measure of similarity in such contexts.

Link to the website: https://movie-recommender-1exi.onrender.com

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
Procfile		Procfile
README.md		README.md
app.py		app.py
movie_recom.ipynb		movie_recom.ipynb
movies.pkl		movies.pkl
requirements.txt		requirements.txt
rough.py		rough.py
setup.sh		setup.sh
similarity.pkl		similarity.pkl
tmdb_5000_credits.csv		tmdb_5000_credits.csv
tmdb_5000_movies.csv		tmdb_5000_movies.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Movie Recommender - Personal Project

Technologies Used:

Techniques Applied:

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Movie Recommender - Personal Project

Technologies Used:

Techniques Applied:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages