<TEI xmlns="http://www.tei-c.org/ns/1.0">
  <teiHeader>
    <fileDesc>
      <titleStmt>
        <title type="main">TEI by Example</title>
        <title type="sub">Module 7: Critical Editing</title>
        <author xml:id="RvdB">Ron Van den Branden</author>
        <editor xml:id="EV">Edward Vanhoutte</editor>
        <editor xml:id="MT">Melissa Terras</editor>
        <sponsor>Association for Literary and Linguistic Computing (ALLC)</sponsor>
        <sponsor>Centre for Data, Culture and Society, University of Edinburgh, UK</sponsor> 
        <sponsor>Centre for Digital Humanities (CDH), University College London, UK</sponsor>
        <sponsor>Centre for Computing in the Humanities (CCH), King’s College London, UK</sponsor>
        <sponsor>Centre for Scholarly Editing and Document Studies (CTB) , Royal Academy of Dutch Language and Literature, Belgium</sponsor>
        <funder>
          <address>
            <addrLine>Centre for Scholarly Editing and Document Studies (CTB)</addrLine>
            <addrLine>Royal Academy of Dutch Language and Literature</addrLine>
            <addrLine>Koningstraat 18</addrLine>
            <addrLine>9000 Gent</addrLine>
            <addrLine>Belgium</addrLine>
          </address>
          <email>ctb@kantl.be</email>
        </funder>
        <principal>Edward Vanhoutte</principal>
        <principal>Melissa Terras</principal>
      </titleStmt>
      <publicationStmt>
        <publisher>Centre for Scholarly Editing and Document Studies (CTB) , Royal Academy of Dutch Language and Literature, Belgium</publisher>
        <distributor>Centre for Scholarly Editing and Document Studies (CTB) , Royal Academy of Dutch Language and Literature, Belgium</distributor>
        <pubPlace>Gent</pubPlace>
        <address>
          <addrLine>Centre for Scholarly Editing and Document Studies (CTB)</addrLine>
          <addrLine>Royal Academy of Dutch Language and Literature</addrLine>
          <addrLine>Koningstraat 18</addrLine>
          <addrLine>9000 Gent</addrLine>
          <addrLine>Belgium</addrLine>
        </address>
        <availability status="free">
          <p>Licensed under a <ref target="http://creativecommons.org/licenses/by-sa/3.0/">Creative Commons Attribution ShareAlike 3.0 License</ref>
                    </p>
        </availability>
        <date when="2010-07-09">9 July 2010</date>
      </publicationStmt>
      <seriesStmt>
        <title>TEI By Example.</title>
        <respStmt>
          <name>Edward Vanhoutte</name>
          <resp>editor</resp>
        </respStmt>
        <respStmt>
          <name>Ron Van den Branden</name>
          <resp>editor</resp>
        </respStmt>
        <respStmt>
          <name>Melissa Terras</name>
          <resp>editor</resp>
        </respStmt>
      </seriesStmt>
      <sourceDesc>
        <p>Digitally born</p>
      </sourceDesc>
    </fileDesc>
    <encodingDesc>
      <projectDesc>
        <p>TEI By Example offers a series of freely available online tutorials walking individuals through the different stages in marking up a document in TEI (Text Encoding Initiative). Besides a general introduction to text encoding, step-by-step tutorial modules provide example-based introductions to eight different aspects of electronic text markup for the humanities. Each tutorial module is accompanied with a dedicated examples section, illustrating actual TEI encoding practise with real-life examples. The theory of the tutorial modules can be tested in interactive tests and exercises.</p>
      </projectDesc>
    </encodingDesc>
    <profileDesc>
      <langUsage>
        <language ident="en-GB">en-GB</language>
      </langUsage>
    </profileDesc>
    <revisionDesc>
      <change when="2020-07-02" who="#RvdB">proofing corrections</change>
      <change when="2020-06-12" who="#RvdB">technical revision</change>
      <change when="2010-07-13" who="#RvdB">
                <list>
                    <item>added distinction <gi>gi</gi> — <tag>gi scheme="..."</tag> — <gi>tag</gi>
                    </item>
        <item>final spellcheck</item>
                </list>
            </change>
      <change when="2010-07-09" who="#RvdB">release</change>    
      <change when="2009-09-28" who="RvdB">authoring</change>
    </revisionDesc>
  </teiHeader>
  <text xml:id="TBED07v00" type="tutorials">
    <body>
            <div xml:id="variation">
        <head>Textual Variation</head>
        <p>Similar to all other TEI modules, the elements and attributes defined in the TEI <ident type="module">textcrit</ident> module can be used for the encoding of existing source materials (be they in print or digital form), or the encoding of electronic documents from scratch. However, the <emph>use</emph> of this module in the context of electronic critical editing, adds another perspective to this traditional authorial/editorial angle (see <ref type="bibl" target="#vanhoutte2009">Vanhoutte and Van den Branden 2009</ref>). Electronic or digital critical editions can be created from scratch either by encoding different primary source materials straight <emph>as</emph> a critical edition, or by generating the edition from previously encoded electronic transcriptions of those materials as independent texts in their own right. Therefore, the tags defined in the TEI <ident type="module">textcrit</ident> module can be used to: 
          <list type="gloss">
            <label>digitise</label>
            <item>an existing print edition</item>
            <label>create</label>
            <item>a digital edition, e.g., by recording some or all of the known variations among different witnesses to the text in a critical apparatus of variants</item>
            <label>generate</label>
            <item>a digital edition from encoded transcriptions of the documentary source material</item>
          </list>
        </p>
        <p>In the examples in this TBE module, <soCalled>critical editing with TEI</soCalled> will be understood as the act of encoding material sources in a TEI representation that allows for the creation or generation of a digital edition in some form (using any output format in the digital medium, e.g., HTML pages, PDF, flash movies,...), rather than digitising an existing critical edition. In this sense, the authorial/editorial angle of this TBE module differs from that of the other modules (focusing on the digitisation of a material source text in a certain genre). However, the strategies discussed in this tutorial for representing textual variation can equally be applied to the digitisation of existing critical editions. Where there are differences, these will be pointed out explicitly.</p>
        <p>For example, consider following texts:
          <table rows="2" cols="2" xml:id="table1">
            <row>
              <cell>
                <figure xml:id="figure1">
                  <graphic url="../../../images/tutorials/TBED07v00/SG_P2_small.jpg"/>
                  <head type="legend">A page of version P2 of the TEI Guidelines</head>
                </figure>
              </cell>
              <cell>
                <figure xml:id="figure2">
                  <graphic url="../../../images/tutorials/TBED07v00/SG_P3_small.jpg"/>
                  <head type="legend">A page of version P3 of the TEI Guidelines</head>
                </figure>
              </cell>
            </row>
            <row>
              <cell>
                <figure xml:id="figure3">
                  <graphic url="../../../images/tutorials/TBED07v00/SG_P4_small.jpg"/>
                  <head type="legend">A page of version P4 of the TEI Guidelines</head>
                </figure>
              </cell>
              <cell>
                <figure xml:id="figure4">
                  <graphic url="../../../images/tutorials/TBED07v00/SG_P5_small.jpg"/>
                  <head type="legend">A page of version P5 of the TEI Guidelines</head>
                </figure>
              </cell>
            </row>
          </table>
        </p>
        <p>Some of these images may look more or less familiar to you: they are facsimiles from the first page of chapter 2 of the printed TEI Guidelines throughout their different incarnations, from version P2 (1992) to the latest version, P5 (2009). As you can imagine, the technological evolutions of these 17 years have prompted considerable changes to this chapter that introduces the technological background of text encoding with TEI, ranging from rephrasing, addition or deletion of notes, changes in italicisation, restructuring of paragraphs, etc. One way of approaching this textual variation could consist of encoding these text versions as physically distinct TEI documents, in which corresponding text structures could be aligned by a common identification mechanism. For example, the first couple of paragraphs in these 4 text witnesses could be encoded in different TEI documents as follows:
          <table rows="1" cols="2" xml:id="table2">
            <row>
              <cell role="label">P2</cell>
              <cell role="label">P3</cell>
            </row>
            <row>
              <cell>
                <figure xml:id="example1">
                  <egXML xmlns="http://www.tei-c.org/ns/Examples">
                    <pb n="2"/>
                    <head>Chapter 2 <lb/>A GENTLE INTRODUCTION TO SGML</head>
                    <p xml:id="p1" corresp="P3.xml#p1 P4.xml#p1 P5.xml#p1">The encoding scheme defined by these Guidelines is formulated as an application of a system known as the Standard Generalized Markup Language (SGML).<note place="foot" xml:id="n1" corresp="P3.xml#n1 P4.xml#n2">
                                                <bibl>
                                                    <editor>International Organization for Standardization</editor>, <title>ISO 8879: Information processing--Text and office systems--Standard Generalized Mark-up Language (SGML)</title>, ([<pubPlace>Geneva</pubPlace>]: <publisher>ISO</publisher>, <date>1986</date>).</bibl> Although widely said to be short for the surnames of its progenitors, the official expansion of this abbreviation is "Standard Generalized Markup Language."</note> SGML is an international standard for the definition of device-independent, system-independent methods of representing texts in electronic form. This chapter presents a brief tutorial guide to its main features, for those readers who have not encountered it before. For a more technical account of TEI practice in using the SGML standard, see chapter 30, "TEI Conformance," [in separate fascicle]; for a more technical description of the subset of SGML used by the TEI encoding scheme, see chapter 39, "Formal Grammar for the TEI-Interchange-Format Subset of SGML," [in separate fascicle].</p>
                    <p xml:id="p2a" corresp="P3.xml#p2a P4.xml#p2 P5.xml#p2">SGML is an international standard for the description of marked-up electronic text. More exactly, SGML is a metalanguage, that is, a means of formally describing a language, in this case, a markup language. Before going any further we should define these terms.</p>
                    <p xml:id="p2b" corresp="P3.xml#p2b P4.xml#p2 P5.xml#p2">Historically, the word markup has been used to describe annotation or other marks within a text intended to instruct a compositor or typist how a particular passage should be printed or laid out. Examples include wavy underlining to indicate boldface, special symbols for passages to be omitted or printed in a particular font and so forth. As the formatting and printing of texts was automated, the term was extend-ed to cover all sorts of special markup codes inserted into electronic texts to govern formatting, printing, or other processing.</p>
                  </egXML>
                  <head type="legend">An encoded page of version P2 of the TEI Guidelines</head>
                </figure>
              </cell>
              <cell>
                <figure xml:id="example2">
                  <egXML xmlns="http://www.tei-c.org/ns/Examples">
                    <pb n="13"/>
                    <head>Chapter 2 <lb/>A Gentle Introduction to SGML</head>
                    <p xml:id="p1" corresp="P2.xml#p1 P4.xml#p1 P5.xml#p1">The encoding scheme defined by these Guidelines is formulated as an application of a system known as the Standard Generalized Markup Language (SGML). <note place="foot" xml:id="n1" corresp="P2.xml#n1 P4.xml#n2">
                                                <bibl>
                                                    <editor>International Organization for Standardization</editor>, <title>ISO 8879: Information processing - Text and office systems - Standard Generalized Markup Language (SGML)</title>, ([<pubPlace>Geneva</pubPlace>]: <publisher>ISO</publisher>, <date>1986</date>)</bibl>
                                            </note> SGML is an international standard for the definition of device-independent, system-independent methods of representing texts in electronic form. This chapter presents a brief tutorial guide to its main features, for those readers who have not encountered it before. For a more technical account of TEI practice in using the SGML standard, see chapter 28, "Conformance," on page 727. For a more technical description of the subset of SGML used by the TEI encoding scheme, see chapter 39, "Formal Grammar for the TEI-Interchange-Format Subset of SGML," on page 1247.</p>
                    <p xml:id="p2a" corresp="P2.xml#p2a P4.xml#p2 P5.xml#p2">SGML is an international standard for the description of marked-up electronic text. More exactly, SGML is a <hi>metalanguage</hi>, that is, a means of formally describing a language, in this case, a <hi>markup language</hi>. Before going any further we should define these terms.</p>
                    <p xml:id="p2b" corresp="P2.xml#p2b P4.xml#p2 P5.xml#p2">Historically, the word <hi>markup</hi> has been used to describe annotation or other marks within a text intended to instruct a compositor or typist how a particular passage should be printed or laid out. Examples include wavy underlining to indicate boldface, special symbols for passages to be omitted or printed in a particular font and so forth. As the formatting and printing of texts was automated, the term was extended to cover all sorts of special <hi>markup codes</hi> inserted into electronic texts to govern formatting, printing, or other processing.</p>
                  </egXML>
                  <head type="legend">An encoded page of version P3 of the TEI Guidelines</head>
                </figure>
              </cell>
            </row>
            <row>
              <cell role="label">P4</cell>
              <cell role="label">P5</cell>
            </row>
            <row>
              <cell>
                <figure xml:id="example3">
                  <egXML xmlns="http://www.tei-c.org/ns/Examples">
                    <pb n="13"/>
                    <head>2 A Gentle Introduction to XML</head>
                    <note type="disclaimer" xml:id="n1">As originally published in previous editions of the Guidelines, this chapter provided a gentle introduction to 'just enough' SGML for anyone to understand how the TEI used that standard. Since then, the Gentle Guide seems to have taken on a life of its own independent of the Guidelines, having been widely distributed (and flatteringly imitated) on the web. In revising it for the present draft, the editors have therefore felt free to reduce considerably its discussion of SGML-specific matters, in favour of a simple presentation of how the TEI uses XML.</note>
                    <p xml:id="p1" corresp="P2.xml#p1 P3.xml#p1 P5.xml#p1">The encoding scheme defined by these Guidelines may be formulated either as an application of the ISO Standard Generalized Markup Language (SGML)<note place="foot" corresp="P2.xml#n1 P3.xml#n1">
                                                <bibl>
                                                    <editor>International Organization for Standardization</editor>, <title>ISO 8879: Information processing - Text and office systems - Standard Generalized Markup Language (SGML)</title>, ([<pubPlace>Geneva</pubPlace>]: <publisher>ISO</publisher>, <date>1986</date>)</bibl>
                                            </note> or of the more recently developed W3C Extensible Markup Language (XML)<note place="foot" xml:id="n3">
                                                <bibl>
                                                    <editor>World Wide Web Consortium</editor>: <title>Extensible Markup Language (XML) 1.0</title>, available from <ref target="http://www.w3.org/TR/REC-xml">http://www.w3.org/TR/REC-xml</ref>
                                                </bibl>
                                            </note>. Both SGML and XML are widely-used for the definition of device-independent, system-independent methods of storing and processing texts in electronic form; XML being in fact a simplification or derivation of SGML. In the present chapter we introduce informally the basic concepts underlying such markup languages and attempt to explain to the reader encountering them for the first time how they are actually used in the TEI scheme. Except where the two are explicitly distinguished, references to XML in what follows may be understood to apply equally well to the TEI usage of SGML. For a more technical account of TEI practice see chapter 28 <hi>Conformance</hi>; for a more technical description of the subset of SGML used by the TEI encoding scheme, see chapter 39 <hi>Formal Grammar for the TEI-Interchange-Format Subset of SGML</hi>.</p>
                    <p xml:id="p2" corresp="P2.xml#p2a P2.xml#p2b P3.xml#p2a P3.xml#p2b P5.xml#p2">XML is an extensible markup language used for the description of marked-up electronic text. More exactly, XML is a <hi>metalanguage</hi>, that is, a means of formally describing a language, in this case, a <hi>markup language</hi>. Historically, the word <hi>markup</hi> has been used to describe annotation or other marks within a text intended to instruct a compositor or typist how a particular passage should be printed or laid out. Examples include wavy underlining to indicate boldface, special symbols for passages to be omitted or printed in a particular font and so forth. As the formatting and printing of texts was automated, the term was extended to cover all sorts of special codes inserted into electronic texts to govern formatting, printing, or other processing.</p>
                  </egXML>
                  <head type="legend">An encoded page of version P4 of the TEI Guidelines</head>
                </figure>
              </cell>
              <cell>
                <figure xml:id="example4">
                  <egXML xmlns="http://www.tei-c.org/ns/Examples">
                    <pb n="xxxi"/>
                    <head>v <lb/>A Gentle Introduction to XML</head>
                    <p xml:id="p1" corresp="P2.xml#p1 P3.xml#p1 P5.xml#p1">The encoding scheme defined by these Guidelines is formulated as an application of the Extensible Markup Language (XML) (Bray et al. (eds.) (2006)). XML is widely used for the definition of device-independent, system-independent methods of storing and processing texts in electronic form. It is now also the interchange and communication format used by many applications on the World Wide Web. In the present chapter we informally introduce some of its basic concepts and attempt to explain to the reader encountering them for the first time how and why they are used in the TEI scheme. More detailed technical accounts of TEI practice in this respect are provided in chapters <hi>23. Using the TEI</hi>, <hi>1. The TEI Infrastructure</hi>, and <hi>22. Documentation Elements</hi> of these Guidelines.</p>
                    <p xml:id="p2" corresp="P2.xml#p2a P2.xml#p2b P3.xml#p2a P3.xml#p2b P4.xml#p2">Strictly speaking, XML is a metalanguage, that is, a language used to describe other languages, in this case, markup languages. Historically, the word markup has been used to describe annotation or other marks within a text intended to instruct a compositor or typist how a particular passage should be printed or laid out. Examples include wavy underlining to indicate boldface, special symbols for passages to be omitted or printed in a particular font, and so forth. As the formatting and printing of texts was automated, the term was extended to cover all sorts of special codes inserted into electronic texts to govern formatting, printing, or other processing.</p>
                  </egXML>
                  <head type="legend">An encoded page of version P5 of the TEI Guidelines</head>
                </figure>
              </cell>
            </row>
          </table>
        </p>
        <p>This would allow for maximal representation of the distinct material sources, and leave the identification of the actual variation either to further processing or human inspection. A variant of this approach could integrate the transcriptions of the text in all material witnesses in a single TEI document, and make use of appropriate linking attributes to point out the alignment between the different text structures. In their naivety, such systems are both redundant and crude. While providing <emph>all</emph> text of all text witnesses, and aligning the corresponding text structures, they provide little <emph>insight</emph> in the places where the different witnesses actually differ.</p>
        <p>In order to encode the actual textual variation between the different text versions in a meaningful way, the TEI Guidelines provide a specialised module of elements and attributes that allow you to encode textual variation at word level. This TBE tutorial will first discuss how to describe the different text witnesses represented in the critical edition; next deal with the encoding of textual variants between these witnesses in isolation; then treat different ways of integrating such records of variation within the encoding of the critical edition; and finally point out potential problems and pitfalls when creating a critical edition with TEI.</p>
      </div>
        </body>
    <back>
      <div type="bibliography">
        <listBibl>
          <bibl xml:id="vanhoutte2009">
                        <author>Vanhoutte, Edward</author>, and <author>Ron Van den Branden</author>. <date>2009</date>. <title level="a">Describing, Transcribing, Encoding, and Editing Modern Correspondence Material: a Textbase Approach</title>. <title level="j">Literary and Linguistic Computing</title> <biblScope unit="volume">24</biblScope> (<biblScope unit="issue">1</biblScope>): <biblScope unit="page">77–98</biblScope>. <idno type="DOI">10.1093/llc/fqn035</idno>.</bibl>
        </listBibl>
      </div>
    </back>
  </text>
  <!--
    $Date$
    $Id$  -->
</TEI>