We have the whole dataset stored on Huggingface and in Google Drive. Perhaps we can describe a testing process for a new model, or include a local set of ~ 10 bill texts to test any summarization model against. These bills should include the three that are described in the Wiki.