Open
Description
See an example of the saved HTML in S3. We should ask them to whitelist our IP? If they won't answer, and this is an automated challenge, we should have a subset of scrapers that makes requests every 24 hours, instead of every single hour
This is making our coverage spotty. For example, from the 5 most recent opinions we only have 2
I was trying to run ./manage.py cl_back_scrape_citations --courts juriscraper.opinions.united_states.state.minn --backscrape-start=2024/10/01 --backscrape-end=2024/10/20 --verbosity 3
for #858 (comment) and noticed suspicious 0 results
Metadata
Metadata
Assignees
Type
Projects
Status