How may we improve the encoding of the source(s) of a DraCor TEI file in the sourceDesc? #171
Replies: 4 comments
-
1. Scenarios of TEI file creation for DraCorScenario AThe DraCor TEI file is based on a digital text (such as provided by Project Gutenberg, TextGrid, or CENETON) based on a printed edition.
Note: Currently, this is the most common scenario throughout the DraCor corpora. Scenario BThe DraCor TEI file is based on a digital scan of a printed edition. The book scan has been published and licensed online (e.g. by a national library or Google Books).
Note: This scenario seems to be one of the more frequent ones in DraCor. Scenario CThe DraCor TEI file is based on a digital text, based on a book scan which has been published and licensed online and is based on a printed edition.
Note: This scenario pertains to more than 100 dramatic texts from the TransLatin project at least. Scenario DThe DraCor TEI file is based on a manuscript without an intermediary (digital) source.
Note: This scenario seems not to have been very present in the TEI file creation until now. However, we will have ca. 40 dramatic texts added to NeoLatDraCor based on manuscripts without an intermediary (critical) text edition and without an intermediary digital source starting from 2026 via the Neo-Latin School Drama and Ljubljana Jesuit Drama projects. Scenario EThe DraCor TEI file is based on a born-digital text.
Scenario FThe DraCor TEI file is based on a printed edition without an intermediary (digital) source.
(Please let me know if you would like to revise a scenario or add another one.) |
Beta Was this translation helpful? Give feedback.
-
2. Current source type terminologyTerminology<bibl type="digitalSource"> refers to a digital surrogate of a ‘physical’ print or manuscript source. A digital source may be a born-digital (critical) edition or a digital (critical) text edition based on a print or manuscript source. Digitisations of a print or manuscript published with an identifier and/or under a certain licence constitute a digital source. <bibl type="originalSource"> refers to a print or manuscript source. The original source may be the basis of the digital source of the TEI file or the direct source of the TEI file if the TEI file was created from scratch. (For more details, see the DraCorODD or the updated version for better clarification.) Observations on current usage
ScenariosScenario AExample(s): Gottsched’s “Der sterbende Cato” (GerDraCor)
Scenario BThe DraCor TEI file is based on a digital scan of a printed edition. The book scan has been published and licensed online (e.g. by a national library or Google Books).
Observation: Some DraCor TEI files, e.g. this one, link to a digital surrogate of a printed text edition within the <bibl type="originalSource"> which does not match the definitions of “digital source” and “original source”. Scenario CThe DraCor TEI file is based on a digital text, based on a book scan which has been published and licensed online and is based on a printed edition.
Problem: The current schema does not allow more than one <bibl type="digitalSource"> (see this issue). Scenario DThe DraCor TEI file is based on a manuscript without an intermediary (digital) source.
Scenario EThe DraCor TEI file is based on a born-digital text.
Scenario FThe DraCor TEI file is based on a printed edition without an intermediary (digital) source.
|
Beta Was this translation helpful? Give feedback.
-
3. Alternative source type terminology A (under discussion)BackgroundBased on the discussion with @lb42, we may consider using the terms “copy-text” and “origin”. Considerations
Tentative definitions of the alternative source types<bibl type="copy-text"> refers always to the direct source of the DraCor TEI file, may it be digital or non-digital. If the DraCor TEI file refers to a digital source based on a ‘physical’ source, i. e. a printed text edition or manuscript, the ‘physical’ origin is given by <bibl type="origin">. ScenariosScenario AExample(s): Gottsched’s “Der sterbende Cato” (GerDraCor)
Scenario BThe DraCor TEI file is based on a digital scan of a printed edition. The book scan has been published and licensed online (e.g. by a national library or Google Books).
Scenario CThe DraCor TEI file is based on a digital text, based on a book scan which has been published and licensed online and is based on a printed edition.
Comment: The source type terminology of “copy-text” and “origin” would not cover the intermediary Google Books scan. But we may use < relatedItem> to encode it (see this comment by @lb42). Scenario DThe DraCor TEI file is based on a manuscript without an intermediary (digital) source.
Comment: According to the tentative definitions, the manuscript would be classified as “copy-text”, and the type “origin” would not be relevant here as there is no intermediary digital source. Scenario EThe DraCor TEI file is based on a born-digital text.
Comment: According to the tentative definitions, the manuscript would be classified as “copy-text”, and the type “origin” would not be relevant here as there is no intermediary digital source. Scenario FThe DraCor TEI file is based on a printed edition without an intermediary (digital) source.
Comment: According to the tentative definitions, the printed text edition would be classified as “copy-text”, and the type “origin” would not be relevant here as there is no intermediary digital source. |
Beta Was this translation helpful? Give feedback.
-
4. Aim and scope of the DraCor source type terminologyAgainst the backdrop of the current and discussed alternative source type terminologies, we should consider the following questions before implementing new source type terminologies from my point of view.
@lehkost, @peertrilcke, @cmil, what do you think? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Overview
As discussed during the DraCor Summit and some GitHub issues (#107, #161), we may revise the encoding of the source(s) of a DraCor TEI file in the <sourceDesc> element. The aim is to better define and clarify the different source types and to cover the various ways that DraCor TEI files are created through the DraCor corpora. For instance, while most of the DraCor TEI files have been created from existing digital sources, some are now created from scratch.
We may exchange ideas in this GitHub discussion and later document the discussion results in the related issues rather than have lengthy conversations in different GitHub issues.
Generally, in our previous discussions, we have seen two areas that need improvement:
We may proceed as follows:
Beta Was this translation helpful? Give feedback.
All reactions