BPOM Data Scraper

A Python web scraper to collect product registration data from the Indonesian Food and Drug Authority (BPOM) website and store it in a PostgreSQL database.

Features

Scrapes 639,024+ product records from BPOM website
Handles pagination automatically
Stores data in PostgreSQL database
Progress tracking with progress bar
Error handling and retry mechanism
Batch processing for efficient database operations

Installation

Clone the repository
Install dependencies:

pip install -r requirements.txt

Create .env file from .env.example:

cp .env.example .env

Update .env with your database credentials if different

Usage

Run the scraper:

python main.py

The scraper will:

Connect to the PostgreSQL database
Create the necessary table if it doesn't exist
Fetch all product data from BPOM API
Store the data in the database with batch processing
Show progress with a progress bar

Database Schema

The scraper creates a table bpom_products with all fields from the BPOM API response.

Configuration

Edit .env file to configure:

DATABASE_URL: PostgreSQL connection string
BATCH_SIZE: Number of records to insert per batch (default: 100)
REQUEST_TIMEOUT: HTTP request timeout in seconds (default: 30)
MAX_RETRIES: Maximum number of retry attempts (default: 3)

Data Source

Data is scraped from: https://cekbpom.pom.go.id/produk-dt/all

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
config.py		config.py
database.py		database.py
main.py		main.py
requirements.txt		requirements.txt
scraper.py		scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BPOM Data Scraper

Features

Installation

Usage

Database Schema

Configuration

Data Source

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

BPOM Data Scraper

Features

Installation

Usage

Database Schema

Configuration

Data Source

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages