Implement ETL pipeline for OpenAlex data standardization by hadimobini00-ship-it · Pull Request #7 · PRAISELab-PicusLab/bibliometrix-python

hadimobini00-ship-it · 2026-05-30T15:04:00Z

Project Summary

This Pull Request introduces a modular ETL (Extract, Transform, Load) pipeline designed to retrieve academic data from the OpenAlex API and transform it into the standardized Web of Science format.

Key Components

api_retriever.py: Handles the extraction of raw data from the OpenAlex API.
mapping.py: Contains the core logic for schema conversion and field mapping.
dispatcher.py: Manages the flow and routing of data throughout the pipeline.
validator.py: Ensures data integrity and format compliance before final output.
main.py: The entry point for executing the complete data pipeline.

How to Run the Code

Ensure all dependencies are installed in your Python environment.
Navigate to the project directory:
cd Hardware & Software_2nd_semester
Execute the pipeline using:
python main.py

Expected Results

The pipeline processes queries ("machine learning"), validates the structure against Web of Science requirements, and outputs the formatted research data.

Additional Notes

This implementation ensures modularity, allowing for easier maintenance and testing of individual pipeline stages.
Please review the mapping logic in mapping.py to ensure it meets the latest requirements for the target schema.

Add ETL pipeline for OpenAlex -> WoS standardization

hadimobini00-ship-it added 3 commits May 30, 2026 16:53

Add files via upload

75b9bce

Add ETL pipeline for OpenAlex -> WoS standardization

Add files via upload

47c2843

Add files via upload

d93c591

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement ETL pipeline for OpenAlex data standardization#7

Implement ETL pipeline for OpenAlex data standardization#7
hadimobini00-ship-it wants to merge 3 commits into
PRAISELab-PicusLab:mainfrom
hadimobini00-ship-it:main

hadimobini00-ship-it commented May 30, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

hadimobini00-ship-it commented May 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Project Summary

Key Components

How to Run the Code

Expected Results

Additional Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

hadimobini00-ship-it commented May 30, 2026 •

edited

Loading