BMC Product Review Scrapping & Sentiment Analysis

This project performs Web Scrapping & Sentiment Analysis on verified Gartner reviews of popular BMC Software Products, using Python NLP Techniques and Data Visualization.

BMC Product Review Scrapping & Sentiment Analysis is an open source project designed for performing sentiment analysis on customer reviews of BMC Software products scraped from public platforms like Gartner. It leverages Natural Language Processing (NLP) techniques and visualization tools to extract actionable insights from product reviews.

This project is perfect for beginners and intermediate contributors who want hands-on experience with web scraping, NLP, data visualization, and open source collaboration.

It includes:

Web scraping from Gartner Peer Insights
Preprocessing text with NLP
VADER-based sentiment scoring
Charts, word clouds, and Excel exports

🌐 Products Covered

We scrape verified reviews from the following Gartner pages:

Product Name	Review Page
🧠 BMC Helix ITSM	Link
📈 BMC Helix Operations Management	Link
⚙️ TrueSight Server Automation	Link
📊 Control-M	Link

📁 Output Format

Your final analysis should look like this (in Excel or CSV):

Product Name	Review Title	Overall Rating	Industry	Function	Date	Other Vendors	Country	Pros	Cons	Overall Comment	Sentiment

Visuals like pie charts and word clouds should be stored in the outputs/ folder.

📦 Example Directory Structure

BMC-Product-Review-Scrapping-and-Sentiment-Analysis/
│
├── 📂 data/                   # Sample scraped data files (Excel/CSV)
├── 📂 notebooks/             # Jupyter notebooks for quick experimentation
├── 📂 scripts/
│   ├── scraper.py            # Scraper module
│   ├── nlp_preprocessing.py  # Text cleaning + POS + lemmatization
│   ├── sentiment.py          # VADER-based sentiment scoring
│   └── visualize.py          # Wordclouds, pie charts, bar graphs
│
├── 📂 outputs/               # Saved images, processed files
│
├── requirements.txt          # Install dependencies
├── README.md                 # Project overview
├── CONTRIBUTING.md           # Contribution guidelines
├── LICENSE                   # Open-source license
└── .gitignore

🧠 IMP Features

Robust product review scraper for BMC products
Clean text with:- Tokenization Lemmatization POS Tagging Stopword Removal
Sentiment classification using VADER
Generate sentiment reports and dashboards
Modularized structure for easy expansion and contributions
Export analysis to Excel and visual graphs

🚀 Tech Stack

Python 3.x
Selenium / Playwright (for scraping)
NLTK, VADER (for sentiment)
Pandas, Matplotlib, WordCloud
Excel output (xlsxwriter/openpyxl)
Any

🛠️ Getting Started

🔧 Installation

git clone https://github.com/Yash22222/BMC-Product-Review-Scrapping-and-Sentiment-Analysis.git
cd BMC-Product-Review-Scrapping-and-Sentiment-Analysis
pip install -r requirements.txt

📊 Run Sentiment Analysis

Scrape reviews using the scraper.py script.
Clean and preprocess with nlp_preprocessing.py.
Analyze sentiment using sentiment.py.
Visualize using visualize.py.

🤝 How to Contribute (for GSSoC'25)

We welcome contributions from GSSoC contributors and all open source enthusiasts!

🔁 Steps to Contribute

Fork the repository

Clone your fork

git clone https://github.com/YOUR_USERNAME/BMC-Product-Review-Scrapping-and-Sentiment-Analysis.git

Commit your changes

git commit -m "✨ Added sentiment model for XYZ"

Push to your fork

git push origin feature/your-feature-name

Open a Pull Request with a clear explanation.

🧠 Contribution Ideas

Type	Ideas
🔄 Add new BMC products	Expand the scraper
🎨 Streamlit UI	Upload reviews & analyze sentiment
🧾 PDF/Excel report generator	Auto reports for each product
🤖 Add BERT	Use HuggingFace transformer models
🌐 Multi-language support	Translate & analyze non-English reviews
🛠 Docker Support	Add Dockerfile for easy setup

📜 License

This project is licensed under the MIT License.

🙌 Credits

Proudly open for contributions under GSSoC 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BMC Product Review Scrapping & Sentiment Analysis

🌐 Products Covered

📁 Output Format

📦 Example Directory Structure

🧠 IMP Features

🚀 Tech Stack

🛠️ Getting Started

🔧 Installation

📊 Run Sentiment Analysis

🤝 How to Contribute (for GSSoC'25)

🔁 Steps to Contribute

🧠 Contribution Ideas

📜 License

🙌 Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

BMC Product Review Scrapping & Sentiment Analysis

🌐 Products Covered

📁 Output Format

📦 Example Directory Structure

🧠 IMP Features

🚀 Tech Stack

🛠️ Getting Started

🔧 Installation

📊 Run Sentiment Analysis

🤝 How to Contribute (for GSSoC'25)

🔁 Steps to Contribute

🧠 Contribution Ideas

📜 License

🙌 Credits

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages