Add a flag for datasets designed only for layout recognition

It would make it possible to filter them out when computing metrics (files, pages) or when browsing the catalog

However, I'm wondering if we should have 2 flags, one for "suitable as GT for layout recognition" and one for "suitable as GT for text recognition".

Some datasets as suitable to train text recognition models but not layout recognition models.