The corpora.json should be updated periodically running a GitHub action that would create a pull request when corpus data changed.