Skip to content
This repository was archived by the owner on May 28, 2026. It is now read-only.

Filip-Podstavec/extract_url_parameters_from_logs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

Extract URL parameters from access log

This Python script read access-log file, and extract from it the most crawled URL parameters extracted by specific bot.

If you don't have any access log find some on Google: https://www.google.com/search?q=filetype%3Alog+inurl%3Aaccess

Example output

Spreadsheet: https://docs.google.com/spreadsheets/d/1ExndYXQ54QJH0SrEw-RIPGHb7yEECBjLTHgxwRuP52k/edit?usp=sharing

Example of the output CSV

Setup

pip install -r requirements.txt

Example usage

Analyze access log and export output to parameters.csv file:

python logparser.py C:/my_folder/access-log.log

Analyze access log and export output to custom CSV file:

python logparser.py C:/my_folder/access-log.log --output=example.csv

About

Python script that extracts URLs crawled by a specific bot and analyze crawled GET parameters.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages