This repository was archived by the owner on Mar 1, 2023. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 11
Spider error when refreshing download numbers #234
Copy link
Copy link
Open
Labels
bugSomething isn't workingSomething isn't workingspiderIssue with the web crawlerIssue with the web crawler
Description
Refreshing article 19143
Error requesting article metrics. Retrying: HTTPSConnectionPool(host='www.biorxiv.org', port=443): Max retries exceeded with url: /content/early/2016/11/30/061689.article-metrics (Caused by NewConnectionError('<urllib3.connection.VerifiedHTTPSConnection object at 0x7f942ac57e48>: Failed to establish a new connection: [Errno -2] Name or service not known',))
Error requesting article metrics. Retrying: HTTPSConnectionPool(host='www.biorxiv.org', port=443): Max retries exceeded with url: /content/early/2016/11/30/061689.article-metrics (Caused by NewConnectionError('<urllib3.connection.VerifiedHTTPSConnection object at 0x7f942ac05c18>: Failed to establish a new connection: [Errno -2] Name or service not known',))
Error requesting article metrics. Retrying: HTTPSConnectionPool(host='www.biorxiv.org', port=443): Max retries exceeded with url: /content/early/2016/11/30/061689.article-metrics (Caused by NewConnectionError('<urllib3.connection.VerifiedHTTPSConnection object at 0x7f942ac05860>: Failed to establish a new connection: [Errno -2] Name or service not known',))
Error AGAIN requesting article metrics. Bailing: HTTPSConnectionPool(host='www.biorxiv.org', port=443): Max retries exceeded with url: /content/early/2016/11/30/061689.article-metrics (Caused by NewConnectionError('<urllib3.connection.VerifiedHTTPSConnection object at 0x7f942ac05978>: Failed to establish a new connection: [Errno -2] Name or service not known',))
Traceback (most recent call last):
File "spider.py", line 1136, in <module>
full_run(spider)
File "spider.py", line 983, in full_run
spider.refresh_article_stats(collection, config.refresh_category_cap)
File "spider.py", line 330, in refresh_article_stats
self.save_article_stats(article_id, stat_table)
File "spider.py", line 493, in save_article_stats
for i, record in enumerate(stats):
TypeError: 'NoneType' object is not iterableMetadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't workingspiderIssue with the web crawlerIssue with the web crawler