Webscraping from Local HTML File

Some computation servers block users from pulling a URL directly, so we have to download the HTML page first. Then we can do webscraping locally to retrieve data from the HTML file.
Downloading static webpages allows us to retrieve the data even when the webpage is removed in the future. This is feasible for developing a "stand still in time" analysis.
If we want to auto-update a web dashboard, we will benefit from using external APIs, such as the Bureau of Labor Statistics API (link). In this case, we will have to request access for these APIs.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.gitignore		.gitignore
2022_Varshney_TrustworthyML-Book.pdf		2022_Varshney_TrustworthyML-Book.pdf
LICENSE.txt		LICENSE.txt
NRP_1040_Study_Webpage.html		NRP_1040_Study_Webpage.html
Python_Read_PDF_and_OCR_Public.ipynb		Python_Read_PDF_and_OCR_Public.ipynb
Python_Webscraping_Example.ipynb		Python_Webscraping_Example.ipynb
README.md		README.md
R_Read_PDF_and_OCR_Public.R		R_Read_PDF_and_OCR_Public.R
R_Webscraping_Example.R		R_Webscraping_Example.R
easy_text_ocr.png		easy_text_ocr.png

Provide feedback