We're getting some new formats in dataone, https://gist.github.com/amoeba/d4771fc01d4f8f66c44202856d078e8e - Update `guess_format_id`'s extension<->formatId map - Make sure ipynb -> json is in there as it's non-obvious - Can we add MATLAB version detection to the algo? I already did this for netcdf - What can we do about RAW files? Are there a few common RAW extensions?