IN 1438 - v1 of tag exporting notebook #2
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Purpose and background context
This PR turns the stubbed
tag_export.pynotebook into a functional notebook. This is 90% a port from the POC notebook intimdex-notebooksand 10% improvement on that. This first pass is establishing a baseline similar to the POC to continue testing in a deployed context.There are currently no meaningful tests. I'm feeling a little uncertain what kind of tests would be valuable for such a notebook, and would like to return to that in a future PR. Having this notebook functional will unblock us to test the notebook in a deployed fashion: configurations, permissions, memory/CPU resources tuning, etc.
How can a reviewer manually see the effects of these changes?
With those caveats above, it is functional as-is!
1- Set Dev1 AWS
TimdexManagerscredentials in terminal and set env vars:2- Start notebook with Makefile, that opens it in edit mode:
3- View as "app" mode which is consistent with how users will see it. Click this button in the lower-right:
4- Experiment with MARC tags (e.g. try
650,655or918,985,900), and limits, etc! Just be aware that omitting the limit, or setting it large, can take quite awhile. I think the full Alma "current records" is about 3.9m, and takes about 10 minutes to download and parse tags.As I type this... realizing we may have an opportunity for caching records we've downloaded; maybe For a future ticket 😎.
5- Perform an export. Try a small set like:
Then try to export it using the "Download" UI in the lower-right:
Includes new or updated dependencies?
YES
Changes expectations for external applications?
NO
What are the relevant tickets?