TEI by Example

Metaphorical Language

Just like any other aspect of the transcription and encoding process, analysing metaphorical language involves interpretation. Unlike the expression of this interpretation in structural markup, the analysis of metaphorical language addresses logical structures that probably will cross structural boundaries. Due to the primary focus of the TEI encoding scheme on representing structural characteristics of texts, and the syntactic requirement that all XML structures should nest properly, this kind of logical analysis will need a kind of workaround. One such workaround defines different analytic categories separately and links them to the relevant passages in the poems. A first requirement is that the smallest addressable structures be identified with an xml:id attribute: xml:id provides a unique identifier for the element bearing the attribute. Its value must start with a letter, or _ and may be followed by one or more Named Characters (letter, digit, ., -, _, Unicode Combining Character, or Extender). See the XML Specification for more guidance on the use of xml:id.

Poppadom Oatmeal Bubble gum Cut of veal Mince for pie Frozen peas Video for Guy Selection of teas Paper towels/garbage bags Pasta sauce and Parmesan Pumpkin seed and olive oil Cheesy crisps and favourite mags Kidney beans (1 large can) Cling film and kitchen foil Making structural units addressable via xml:id.

These xml:id values can later be used to identify spans of interpretive categories. Such interpretive spans are encoded as span, identifying the structural scope of this interpretation with the from attribute marking its beginning and the to attribute marking its end. Their values are values for an xml:id attribute elsewhere in the document, prefixed with a hash character (#), in order to indicate it as the identifier part of a formal URI reference. Preferably, the resp attribute is used to identify who is responsible for the interpretation.The value of the resp attribute must be a pointer to an element in the document header that is associated with a person asserted as responsible for some aspect of the text’s creation, transcription, editing, or encoding. See chapter 14 Certainty and Responsibility of the TEI Guidelines. Related span elements can be grouped in a spanGrp element. A type attribute can characterise the kind of analysis. In our poem, one way of identifying all images concerned with food and non-food could be:

Notice that the from attribute is mandatory on span. The to attribute is optional; when it is missing, the entire structure identified by the from attribute will be taken as the context for the interpretation.

As will be clear from this example, this system of linking analyses to textual structures is not very economic when discontinuous structures share the same analysis. Another approach for logical analysis works from the opposite angle, by hooking text structures to specific interpretations. In order to do so, interpretive categories should be formally defined somewhere else, either in the same document or externally. This is done inside the interp element, bearing a unique xml:id attribute. Related interpretive categories can be grouped inside an interpGrp element. Following example could identify the same semantic categories from the previous example:

food non-food Defining interpretive categories in interp elements.

Inside the transcription of the poem, reference can be made to these interpretations with an ana attribute. Its value should always point to an identifier, either locally or externally. Suppose these interpretive categories are stored in a separate document named analysis.xml. Then the poem could be analysed as follows: Of course, analyses can be much more complex. The TEI provides a generic mechanism for expressing complex hierarchies of interpretive relationships, called Feature Structures. See chapter 12 Feature Structures of the TEI Guidelines for a detailed treatment.

For the study of metaphorical language in poetry, logical structures should be identified and grouped into interpretive categories which can be related to other such categories. These interpretive categories with their logical relationships can be stored inside or outside the XML document containing the analysed text. In the poem, reference can be made to these internally or externally documented analyses.