I am looking to perform statistical analysis on the Greek corpus' annotation of paragraphs (type and subtype tags of divs). I have found quit a large variety of tags that do not seem to be clearly defined, as well as some spelling mistakes.
Is there a glossary of terms?
I am attaching a JSON file that you might find useful in case you want to create one. It contains all type - subtype pairs as well as an example path in data/ for each.
div_hierarchy.json