TEI by ExampleModule 6: Primary SourcesRon Van den BrandenEdward VanhoutteMelissa TerrasAssociation for Literary and Linguistic Computing (ALLC)Centre for Data, Culture and Society, University of Edinburgh, UKCentre for Digital Humanities (CDH), University College London, UKCentre for Computing in the Humanities (CCH), King’s College London, UKCentre for Scholarly Editing and Document Studies (CTB) , Royal Academy of Dutch Language and Literature, BelgiumCentre for Scholarly Editing and Document Studies (CTB)Royal Academy of Dutch Language and LiteratureKoningstraat 189000 GentBelgiumctb@kantl.beEdward VanhoutteMelissa TerrasCentre for Scholarly Editing and Document Studies (CTB) , Royal Academy of Dutch Language and Literature, BelgiumCentre for Scholarly Editing and Document Studies (CTB) , Royal Academy of Dutch Language and Literature, BelgiumGentCentre for Scholarly Editing and Document Studies (CTB)Royal Academy of Dutch Language and LiteratureKoningstraat 189000 GentBelgium
Licensed under a Creative Commons Attribution ShareAlike 3.0 License
9 July 2010TEI by Example.Edward VanhoutteeditorRon Van den BrandeneditorMelissa Terraseditor
TEI by Example offers a series of freely available online tutorials walking individuals through the different stages in marking up a document in TEI (Text Encoding Initiative). Besides a general introduction to text encoding, step-by-step tutorial modules provide example-based introductions to eight different aspects of electronic text markup for the humanities. Each tutorial module is accompanied with a dedicated examples section, illustrating actual TEI encoding practise with real-life examples. The theory of the tutorial modules can be tested in interactive tests and exercises.
en-GBintegrated examples in a single file
Jeremy Bentham: manuscript JB/116/010/001
This manuscript page was written by the philosopher and jurist Jeremy Bentham (1748–1832).
Since this is a prose text, the basic structural units are encoded as paragraphs (p), with line breaks encoded as lb where they occur. Marginal notes are encoded with the note element; the note occurring on the sixth line in this example contains a simple deletion (the final word disorder), which is marked with the del element. This manuscript contains many deletions and additions. Some are simple, such as the addition of the word still in the phrase the same barbarity is still shown on line 6. This is indicated in the encoding by wrapping the added content in a add element. More often, deletions and additions occur in combination, in which case the transcriber tries to reflect their order in the nesting of del and add elements. For example:
forth, turned adrift and thought no more of out of
This fragment is marked as an addition. Yet, Bentham had emended this addition by adding the phrase turned adrift (as a second-level addition). Later, he canceled this addition by deleting it again: that’s why this phrase is encoded inside add, with a nesting del, indicating that this added text had been deleted entirely. Further, at the end of this fragment, another addition is indicated with an add element. Again, this entire second-level addition had been deleted. Yet, since the encoder could not decipher the deleted text anymore, this is indicated by the empty gap element, which signals that text was present on the manuscript, but left out from the transcription. In order to record why the transcriber had decided to omit this text, a reason attribute could have been provided on gap (with a value such as illegible).
Where text could still be transcribed, but the encoder is not certain of the reading, this reading is recorded in an unclear element, as is the case with the word that, occurring in a deletion on the last but one line.
Finally, when the encoder spotted obvious mistakes, these have been identified with the sic element, as is the case with the word compleat. The encoder could equally have provided a correction by wrapping both the incorrect form (sic) and correction (corr) inside a choice element:
Bentham, Jeremy. 1802. Manuscript JB/116/010/001. Manuscript encoded and made available by the Transcribe Bentham project at .Whitman, Walt1890. After the Argument. Manuscript encoded and made available by the Walt Whitman Archive at .