Skip to content

flamekaiser007/netflix_sql_project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

6 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Netflix Movies and TV showes Data Analysis using SQL

Netflix Logo

Objective

The objective of this project is to perform exploratory data analysis (EDA) on the Netflix dataset using SQL. It aims to extract meaningful insights about global content trends, genre distribution, release patterns, and actor appearances. The analysis includes data cleaning, content categorization, and aggregation using advanced SQL queries such as window functions, CASE statements, and string operations. This project serves as a portfolio demonstration of SQL proficiency for real-world data analytics.

🧰 Tools & Technologies

MySQL (Workbench or CLI)

SQL (Window functions, CASE, GROUP BY, string operations)

CSV Dataset (Cleaned from original Netflix titles dataset)

Data Source: netflix_titles.csv (cleaned)

πŸ—‚ Dataset Overview

The dataset contains metadata about TV Shows and Movies available on Netflix as of 2021. It includes:

Show ID, Type (Movie/TV Show), Title

Cast and Director

Country, Date Added, Release Year

Rating, Duration

Genres (listed_in), Description

πŸ“Š Key SQL Analyses Performed

βœ… Total number of titles on Netflix

βœ… Count of Movies vs. TV Shows

βœ… Most common content ratings by type

βœ… Most active countries producing Netflix content

βœ… Top genres using string splitting and aggregation

βœ… Year-wise release trend analysis (last 5 years)

βœ… Most frequent actors in Indian content

βœ… Categorization of content based on violent descriptions

βœ… Use of RANK(), CASE, UNNEST (PostgreSQL) / SUBSTRING_INDEX (MySQL workaround)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors