This project is designed to identify fake social media accounts by analyzing various attributes such as follower count, following count, profile images, background images, the network of followers and followings, posting frequency, and types of posts. The detection model is implemented using the Random Forest algorithm and trained on a large dataset.
The project consists of two main components:
-
Data Collection (Python): Utilizes Tweepy library to gather details of user accounts from Twitter, including follower and following information, profile and background images, and post insights.
-
Detection Model (Python): Implements a Random Forest algorithm to train and predict fake social media accounts based on the collected data. The trained model is saved using Pickle.
-
Web App (Node.js): A graphical user interface is created using Node.js to showcase the detection results. The web app allows users to input a Twitter username and receive a prediction regarding the authenticity of the account.
- Python 3.x
- Node.js
- Tweepy library (
pip install tweepy
) - Scikit-learn library (
pip install scikit-learn
)
-
Clone the repository:
git clone https://github.com/<your-username>/Fake-Account-Detection.git
-
Install Python dependencies:
pip install -r requirements.txt
-
Install Node.js dependencies:
npm install
-
Run the web app:
node index.js
The web app will be accessible at http://localhost:3000 in your web browser.
-
Run the data collection script to gather user details:
python Checker.py
-
Train the detection model:
python model.py
-
Open the web app and input a Twitter username to get the detection results.
- Sujeet Sharma