Netflix Movie Recommendation Project

Introduction

The aim of this project is to scrape data from the IMDb database and then use the team member's Netflix view history to predict what kind of movies we would love to watch as a team. This project is part of our Data Science practice where we will apply techniques such as EDA, Linear regression, web scraping with beautifulsoup, selenium, and feature engineering.

Data Sources

IMDb: We will scrape movie data from the IMDb website, which is one of the most comprehensive sources for movie information.
Netflix: We will use the team member's Netflix view history to understand what kind of movies we would love to watch as a team.

Tools and Techniques

Web scraping with beautifulsoup: We will use this Python library to extract data from the IMDb website.
Selenium: We will use this Python library to automate the scraping process and make it more efficient.
EDA: We will perform exploratory data analysis to gain insights into the data and identify patterns.
Linear regression: We will use linear regression to build a predictive model that can recommend movies based on our viewing history.
Feature engineering: We will create new features from the existing data to improve the accuracy of our predictive model.

Deliverables

Presentation File: We will create a visual and oral presentation to showcase our project and findings.
Project Repository: We will create a GitHub repository to share our code and project details.
Blog Post: We will publish a blog post on the internet (e.g. Medium) to share our project and findings with the broader data science community.

Conclusion

This project aims to provide insights into what kind of movies we would love to watch as a team based on our Netflix view history and IMDb data. We will use a combination of web scraping, exploratory data analysis, linear regression, and feature engineering to build a predictive model. Our project will showcase our data science skills and provide us with valuable experience in working with real-world datasets.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Scraped_datasets		Scraped_datasets
archive		archive
IMDb_Movie_EDA_Preprocessing.ipynb		IMDb_Movie_EDA_Preprocessing.ipynb
IMDb_Regression.ipynb		IMDb_Regression.ipynb
IMDb_Scraping.ipynb		IMDb_Scraping.ipynb
IMDb_Scraping_with_Selenium.ipynb		IMDb_Scraping_with_Selenium.ipynb
LICENSE		LICENSE
Python.gitignore		Python.gitignore
README.md		README.md
Selenium_testing.ipynb		Selenium_testing.ipynb
imdb_scraped_data_all.csv		imdb_scraped_data_all.csv
imdb_scraped_data_all_clear_genre.csv		imdb_scraped_data_all_clear_genre.csv
imdb_scraped_data_all_test.csv		imdb_scraped_data_all_test.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Netflix Movie Recommendation Project

Introduction

Data Sources

Tools and Techniques

Deliverables

Conclusion

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

MamiMrl/IMDb-Movie-Scraping-for-Team-Netflix-Recommendation

Folders and files

Latest commit

History

Repository files navigation

Netflix Movie Recommendation Project

Introduction

Data Sources

Tools and Techniques

Deliverables

Conclusion

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages