GitHub - codedbyasim/Email-Spam-Detection-system: A Flask-based web app that detects spam emails/SMS using Multinomial Naive Bayes and TF-IDF. Built with NLP, Scikit-learn, and NLTK for high-accuracy classification.

Email Spam Detection System using Naive Bayes

An intelligent web-based email spam classifier that uses natural language processing and a Multinomial Naive Bayes model to detect spam messages with high accuracy. Built with clean text preprocessing, TF-IDF vectorization, and trained on the well-known spam.csv dataset.

Algorithm Overview

Multinomial Naive Bayes is chosen for its effectiveness on textual data with discrete features like word frequencies and TF-IDF scores. It’s simple, fast, and highly accurate for spam filtering tasks.

Dataset

Source: SMS Spam Collection Dataset (Kaggle)
Features:
- v1: Label (spam or ham)
- v2: Email/SMS text content

Key Features

Text preprocessing pipeline: • Lowercasing, punctuation removal, stopwords removal, and stemming with PorterStemmer

Feature engineering: • TF-IDF vectorisation

Model training: • Multinomial Naive Bayes (via Scikit-learn)

Evaluation metrics: • Accuracy, confusion matrix, classification report

Frontend: • Clean and professional Flask-based web interface to analyze email text

Technologies Used

Python
Flask
Scikit-learn
NLTK
Pandas, NumPy
HTML, CSS, Jinja2

Model Performance

Metric	Value
Accuracy	97%+
Precision	High
Recall	High
F1-Score	Robust

Getting Started

1. Clone the Repository

git clone https://github.com/codedbyasim/Email-Spam-Detection-system.git
cd Email-Spam-Detection-system

2. Install Dependencies

pip install -r requirements.txt

3. Run the Web App

python app.py

Then open your browser and visit: http://localhost:5000

Project Structure

Email-Spam-Detection-system/
│
├── app.py                         # Flask web app
├── vectorizer.pkl                 # TF-IDF Vectorizer
├── spam_classifier_model.pkl      # Trained Naive Bayes model
├── Email_Spam_Detection_system_using_Naive_Bayes.ipynb
├── static/
│   └── style.css                  # Custom frontend styling
├── templates/
│   └── index.html                 # Frontend HTML with Jinja2
├── spam.csv                       # Raw dataset
├── README.md
└── requirements.txt

Author

Muhammad Asim Hanif Software Engineering Student | ML & AI Enthusiast GitHub | LinkedIn

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Email Spam Detection System using Naive Bayes

Algorithm Overview

Dataset

Key Features

Technologies Used

Model Performance

Getting Started

1. Clone the Repository

2. Install Dependencies

3. Run the Web App

Project Structure

Author

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
static		static
templates		templates
Email_Spam_Detection_system_using_Naive_Bayes.ipynb		Email_Spam_Detection_system_using_Naive_Bayes.ipynb
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
spam.csv		spam.csv
spam_classifier_model.pkl		spam_classifier_model.pkl
vectorizer.pkl		vectorizer.pkl

License

codedbyasim/Email-Spam-Detection-system

Folders and files

Latest commit

History

Repository files navigation

Email Spam Detection System using Naive Bayes

Algorithm Overview

Dataset

Key Features

Technologies Used

Model Performance

Getting Started

1. Clone the Repository

2. Install Dependencies

3. Run the Web App

Project Structure

Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages