Skip to content

Releases: MITLibraries/browsertrix-harvester

v2.0 - Pivot to full HTML records versus metadata records

11 Dec 18:46
284cafe

Choose a tag to compare

What's Changed

  • USE 240 - prep work for staff directory in mitlibwebsite source by @ghukill in #52
  • USE 258 - Rework harvester to return "records" vs "metadata records" by @ghukill in #53
  • USE 272 - Add response headers to output records by @ghukill in #54

Full Changelog: v1.4...v2.0

v1.4 Handle empty crawls

31 Oct 18:36
1e61e73

Choose a tag to compare

What's Changed

Full Changelog: v1.3...v1.4

v1.3 - Initial Production Release

23 Oct 14:38
d186380

Choose a tag to compare

NOTE: it is known that crawls resulting in zero seed URLs will throw an error. This release will allow for a "full" harvest in production, with a fix for this coming soon, at which time we'll enable daily harvests.

What's Changed

  • TIMX 557 and misc updates by @ghukill in #44
  • TIMX 562 - Handle crawls with different pages and CDX data by @ghukill in #45
  • USE-93 - Support pre-crawl, sitemap parsing by @ghukill in #46
  • USE 97 - Generate delete metadata records by @ghukill in #47
  • USE 93 (contd) - Streamline sitemap CLI arg by @ghukill in #48
  • USE 86 - Remove crawler workers defaults by @ghukill in #49
  • In 1524 - 2025-10 Maintenance by @jonavellecuerdo in #50

New Contributors

Full Changelog: v1.2.1...v1.3

v1.2.1 - Update Deployment Workflows

06 Oct 13:13
17eaef1

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v1.2...v1.2.1

v1.2 - Support JSONLines output

19 Aug 15:00
3b79a44

Choose a tag to compare

What's Changed

  • IN-1240 - Replace pipenv check with pip-audit by @ghukill in #41
  • TIMX 542 - support JSONLines output by @ghukill in #42

Full Changelog: v1.1.1...v1.2

Maintenance updates

20 Sep 15:58
38f6316

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v1.1.0...v1.1.1

v1.1.0 Align with Browsertrix-Crawler 12.x

06 Nov 15:20
50b8009

Choose a tag to compare

What's Changed

  • Align btrix CLI arguments for v0.12.0 release by @ghukill in #22

Full Changelog: v1.0.0...v1.1.0

Initial Release

16 Oct 20:10
7e4a5e1

Choose a tag to compare

Initial production release.

What's Changed

Full Changelog: https://github.com/MITLibraries/browsertrix-harvester/commits/v1.0.0