A machine learning-based Movie Recommendation System that uses K-Nearest Neighbors (KNN) and clustering techniques to suggest similar movies to users based on their preferences or selected titles.
- π Content-based Filtering using KNN
- π§ Clustering of similar movies using KMeans
- π― Personalized movie recommendations
- π Based on genres, ratings, popularity, and other metadata
- π§Ή Clean and scalable code structure for further improvements
- Python
- Pandas, NumPy
- Scikit-learn (KNeighborsClassifier, KMeans)
- Matplotlib / Seaborn (optional: for visualizations)
- Jupyter Notebook / Streamlit (if there's a UI)
You can use any of the following:
Make sure the dataset includes features like:
- Movie title
- Genre
- Keywords
- Ratings
- Popularity / Vote Count
- Preprocessing: Clean the dataset, handle missing values, and extract relevant features.
- Feature Vectorization: Use TF-IDF or CountVectorizer on text data (like genres and keywords).
- Clustering: Group similar movies using KMeans (optional).
- KNN Algorithm: For a given movie, find its top-N nearest neighbors in the feature space.
- Recommendation: Return the most similar movies.
- Clone the repository:
git clone https://github.com/YourUsername/movie-recommender.git cd movie-recommender