It would make it possible to filter them out when computing metrics (files, pages) or when browsing the catalog
However, I'm wondering if we should have 2 flags, one for "suitable as GT for layout recognition" and one for "suitable as GT for text recognition".
Some datasets as suitable to train text recognition models but not layout recognition models.