-
Notifications
You must be signed in to change notification settings - Fork 17
Update phytoplankton dataset (chl_pp) #271
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
BBeltz1
commented
Jan 2, 2026
- Refactor get function to accommodate new datafile
- Update indicator and build package
- Re-knit comparison document
|
It looks like |
|
No, it exceeds Github's file size limit. |
jcaracappa1
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks good. Will need to update for the final bit of 2025 before IR
If we subset just the columns that are used in ecodata its only 11MB vs 168MB |
|
We could but then the raw input would essentially be a duplicate of the dataset. We would have to either manipulate the input file manually before uploading to Github or write new code elsewhere in the package that subsets the input file. This is what the get function is doing already. The only difference would be that the "input file" would be wide and |
Actually, they would be identical except the manipulated input file would have the original column names rather than the standardized column names. Not wide vs long. Otherwise, they would be identical files. |
|
Ok then it's probably fine to keep it out in that case |
|
I am running the remainder 2025 data now and will let you know when it is complete. It will still be considered preliminary because the final data may not be available until February. I don't think the final data will change the narrative, but there will be another update. |
|
@BBeltz1 |
|
@khyde just noting that i've seen your message. i will process the update as soon as i can. thanks kim! |