Module 4: Poetry

2. Patterned Arrangement of Language

2.1. Document Type

A shopping list is commonly not considered a form of literature, not even when it features in a literary work, or when it shows some incidental metrical structure or rhyme, unless someone identifies the text as such. The following text can thus be considered as non-literary prose:

Poppadom Oatmeal Bubble gum Cut of veal Mince for pie Frozen peas Video for Guy Selection of teas Paper towels/garbage bags Pasta sauce and Parmesan Pumpkin seed and olive oil Cheesy crisps and favourite mags Kidney beans (1 large can) Cling film and kitchen foil
Example 1. A verse-like text.

What we see here is a functional block of fourteen lines of prose, typographically separated from each other by line breaks. The fundamental organisational unit for prose is the paragraph which is encoded in TEI as <p>. The start of a new typographical line is encoded with the milestone tag <lb>, which appears as <lb/> because it is a self-closing empty element. Applied to the shopping list, this produces the following encoding:

<p xmlns="http://www.tei-c.org/ns/1.0">Poppadom
<lb/>
Oatmeal
<lb/>
Bubble gum
<lb/>
Cut of veal
<lb/>
Mince for pie
<lb/>
Frozen peas
<lb/>
Video for Guy
<lb/>
Selection of teas
<lb/>
Paper towels/garbage bags
<lb/>
Pasta sauce and Parmesan
<lb/>
Pumpkin seed and olive oil
<lb/>
Cheesy crisps and favourite mags
<lb/>
Kidney beans (1 large can)
<lb/>
Cling film and kitchen foil
<lb/>
</p>
Example 2. A verse-like text, encoded as prose with line breaks (<lb>).

But our shopping list is a specific type of prose: it is a list, for which we could use <list> containing twelve items for which we use <item>. This could result in the following encoding:

<list xmlns="http://www.tei-c.org/ns/1.0">
<item>Poppadom</item>
<item>Oatmeal</item>
<item>Bubble gum</item>
<item>Cut of veal</item>
<item>Mince for pie</item>
<item>Frozen peas</item>
<item>Video for Guy</item>
<item>Selection of teas</item>
<item>Paper towels​/garbage bags</item>
<item>Pasta sauce and Parmesan</item>
<item>Pumpkin seed and olive oil</item>
<item>Cheesy crisps and favourite mags</item>
<item>Kidney beans (1 large can)</item>
<item>Cling film and kitchen foil</item>
</list>
Example 3. A verse-like text, encoded as a list.

On the most basic structural level, this shopping list is thus a named grouping of lines of text. When we use the element <lg> for line group, and the element <l> for line, we arrive at the following encoding:

<lg xmlns="http://www.tei-c.org/ns/1.0">
<l>Poppadom</l>
<l>Oatmeal</l>
<l>Bubble gum</l>
<l>Cut of veal</l>
<l>Mince for pie</l>
<l>Frozen peas</l>
<l>Video for Guy</l>
<l>Selection of teas</l>
<l>Paper towels​/garbage bags</l>
<l>Pasta sauce and Parmesan</l>
<l>Pumpkin seed and olive oil</l>
<l>Cheesy crisps and favourite mags</l>
<l>Kidney beans (1 large can)</l>
<l>Cling film and kitchen foil</l>
</lg>
Example 4. A verse-like text encoded with <l>.

Summary

Up to now we have looked at three different possible encodings for the same document which we identified as a shopping list. The first encoding considers the document as a block of prose in which different lines of text are separated by line breaks. The second one structures the shopping list with the appropriate labels (list and item). The third one makes abstraction of the meaning of these typical labels and considers the shopping list as a group of lines of text. The latter is less specific than the second, but more structurally descriptive than the first.

2.2. Structural Divisions

But is this shopping list just a block of prose, organised as a group of lines? The metrical composition, form, and structure of this shopping list, together with the use of rhyme may suggest that this is rather a piece of poetry which can be encoded as such. The rhyme present in the poem gives away a possible structure which consists not of one but of four line groups or stanzas. We use <lg> to encode them:

<lg xmlns="http://www.tei-c.org/ns/1.0">
<lg>
<l>Poppadom</l>
<l>Oatmeal</l>
<l>Bubble gum</l>
<l>Cut of veal</l>
</lg>
<lg>
<l>Mince for pie</l>
<l>Frozen peas</l>
<l>Video for Guy</l>
<l>Selection of teas</l>
</lg>
<lg>
<l>Paper towels​/garbage bags</l>
<l>Pasta sauce and Parmesan</l>
<l>Pumpkin seed and olive oil</l>
</lg>
<lg>
<l>Cheesy crisps and favourite mags</l>
<l>Kidney beans (1 large can)</l>
<l>Cling film and kitchen foil</l>
</lg>
</lg>
Example 5. Distinction of line groups in a verse-like text, with <lg>.

There is nothing in this encoding, however, which documents whether these four line groups belong together or not. Therefore we can wrap another <lg> element around them inside which they nest comfortably.

Note

Like other text-division elements, <lg> elements can nest hierarchically.

In order to distinguish among the nesting line groups and the parental one, we can add some semantic information in a @type attribute which specifies a name conventionally used for this level of division and label the line groups respectively as "stanza" and "poem" as in the following example:

<lg xmlns="http://www.tei-c.org/ns/1.0" type="poem">
<lg type="stanza">
<l>Poppadom</l>
<l>Oatmeal</l>
<l>Bubble gum</l>
<l>Cut of veal</l>
</lg>
<lg type="stanza">
<l>Mince for pie</l>
<l>Frozen peas</l>
<l>Video for Guy</l>
<l>Selection of teas</l>
</lg>
<lg type="stanza">
<l>Paper towels​/garbage bags</l>
<l>Pasta sauce and Parmesan</l>
<l>Pumpkin seed and olive oil</l>
</lg>
<lg type="stanza">
<l>Cheesy crisps and favourite mags</l>
<l>Kidney beans (1 large can)</l>
<l>Cling film and kitchen foil</l>
</lg>
</lg>
Example 6. Distinguishing nesting line groups with nesting <lg> elements.

Notice that, while the @type attribute can have any value defined by the encoder (as long as it does not contain white space), it is intended solely for conventional names of different classes of text blocks. When <lg> is used to encode paragraphs in prose poetry, the @type attribute value could be "para" or anything else. If the <lg> represents an arbitrary organisation of lines, the @type attribute value could be "free" or anything else. If, for instance, this poem would have been organised differently, say in two quatrains and one sestet, we could have the following encoding:

<lg xmlns="http://www.tei-c.org/ns/1.0" type="poem">
<lg type="quatrain">
<l>Poppadom</l>
<l>Oatmeal</l>
<l>Bubble gum</l>
<l>Cut of veal</l>
</lg>
<lg type="quatrain">
<l>Mince for pie</l>
<l>Frozen peas</l>
<l>Video for Guy</l>
<l>Selection of teas</l>
</lg>
<lg type="sestet">
<l>Paper towels​/garbage bags</l>
<l>Pasta sauce and Parmesan</l>
<l>Pumpkin seed and olive oil</l>
<l>Cheesy crisps and favourite mags</l>
<l>Kidney beans (1 large can)</l>
<l>Cling film and kitchen foil</l>
</lg>
</lg>
Example 7. An alternative grouping of lines, distinguished with the @type attribute on <lg>.

Next, we can number the stanzas and lines in our poem inside an @n attribute and document that this poem has four stanzas and fourteen lines of verse:

Note

For large corpora of verse texts these numberings are commonly added automatically by some sort of programmed routine.
<lg xmlns="http://www.tei-c.org/ns/1.0" type="poem">
<lg type="stanza" n="1">
<l n="1">Poppadom</l>
<l n="2">Oatmeal</l>
<l n="3">Bubble gum</l>
<l n="4">Cut of veal</l>
</lg>
<lg type="stanza" n="2">
<l n="5">Mince for pie</l>
<l n="6">Frozen peas</l>
<l n="7">Video for Guy</l>
<l n="8">Selection of teas</l>
</lg>
<lg type="stanza" n="3">
<l n="9">Paper towels​/garbage bags</l>
<l n="10">Pasta sauce and Parmesan</l>
<l n="11">Pumpkin seed and olive oil</l>
</lg>
<lg type="stanza" n="4">
<l n="12">Cheesy crisps and favourite mags</l>
<l n="13">Kidney beans (1 large can)</l>
<l n="14">Cling film and kitchen foil</l>
</lg>
</lg>
Example 8. Numbering of structural units in a verse text, with @n.

Poems often carry a title which can be encoded using <head>:

<lg xmlns="http://www.tei-c.org/ns/1.0" type="poem">
<head>Shopping</head>
<lg type="stanza" n="1">
<l n="1">Poppadom</l>
<l n="2">Oatmeal</l>
<l n="3">Bubble gum</l>
<l n="4">Cut of veal</l>
</lg>
<lg type="stanza" n="2">
<l n="5">Mince for pie</l>
<l n="6">Frozen peas</l>
<l n="7">Video for Guy</l>
<l n="8">Selection of teas</l>
</lg>
<lg type="stanza" n="3">
<l n="9">Paper towels​/garbage bags</l>
<l n="10">Pasta sauce and Parmesan</l>
<l n="11">Pumpkin seed and olive oil</l>
</lg>
<lg type="stanza" n="4">
<l n="12">Cheesy crisps and favourite mags</l>
<l n="13">Kidney beans (1 large can)</l>
<l n="14">Cling film and kitchen foil</l>
</lg>
</lg>
Example 9. Encoding the title of a poem in <head>.

Preceding quotations introducing the poem as a motto can be encoded using <epigraph>:

<lg xmlns="http://www.tei-c.org/ns/1.0" type="poem">
<head>Shopping</head>
<note type="attribution">To my sweetest son</note>
<epigraph>
<cit>
<quote>Thou shalt not steal</quote>
<bibl>Ex. 20​:15</bibl>
</cit>
</epigraph>
<lg type="stanza">
<l>Poppadom</l>
<l>Oatmeal</l>
<l>Bubble gum</l>
<l>Cut of veal</l>
</lg>
<lg type="stanza">
<l>Mince for pie</l>
<l>Frozen peas</l>
<l>Video for Guy</l>
<l>Selection of teas</l>
</lg>
<lg type="stanza">
<l>Paper towels​/garbage bags</l>
<l>Pasta sauce and Parmesan</l>
<l>Pumpkin seed and olive oil</l>
</lg>
<lg type="stanza">
<l>Cheesy crisps and favourite mags</l>
<l>Kidney beans (1 large can)</l>
<l>Cling film and kitchen foil</l>
</lg>
</lg>
Example 10. Encoding introductory quotations with <epigraph>.

Poems are often signed, which can be encoded using <signed> outside the <lg type="poem"> element:

<lg xmlns="http://www.tei-c.org/ns/1.0" type="poem">
<head>Shopping</head>
<lg type="stanza">
<l>Poppadom</l>
<l>Oatmeal</l>
<l>Bubble gum</l>
<l>Cut of veal</l>
</lg>
<lg type="stanza">
<l>Mince for pie</l>
<l>Frozen peas</l>
<l>Video for Guy</l>
<l>Selection of teas</l>
</lg>
<lg type="stanza">
<l>Paper towels​/garbage bags</l>
<l>Pasta sauce and Parmesan</l>
<l>Pumpkin seed and olive oil</l>
</lg>
<lg type="stanza">
<l>Cheesy crisps and favourite mags</l>
<l>Kidney beans (1 large can)</l>
<l>Cling film and kitchen foil</l>
</lg>
</lg>
<signed xmlns="http://www.tei-c.org/ns/1.0">M. Ystery​-Shopper</signed>
Example 11. Encoding the signature for a poem with <signed>.

Summary

Different document types such as prose, verse, or drama can be considered poetry. Poetry of the document type “verse” consists typically of minimally one group of one or more lines, which may be encoded using <lg> and <l>, respectively. Line groups can nest inside each other. To all instances of line groups and lines, semantic and analytical information can be added in attributes. The title of a poem can be encoded using <head>, motto’s can be encoded using <epigraph>, the signature of the poet can be encoded using <signed>.

2.3. Grouping Structures

Single poems may appear isolated as independent texts, as part of some other document types such as prose and drama, or in combination with other poems as part of composite texts. Typical examples of such composite texts are anthologies, cycles of poems, and composite poems, i.e., poems consisting of other poems. The line between cycles of poems and composite poems, however, is thin and assigning either interpretation to the texts is the encoder’s decision, who can usually depend on how the author or publisher represented the texts in the original publication.

Many encoding strategies can be used to encode either of the composite text types mentioned. In the following paragraphs we suggest only a couple of them.

2.3.1. Composite Poems

Since <lg> elements can nest, it is possible to encode nesting poems using <lg> with a value "poem" for the @type attribute. If we consider the stanza’s of the example poem as poems in their own right, which were numbered by the author, this can result in the following encoding:

<lg xmlns="http://www.tei-c.org/ns/1.0" type="poem">
<head>Shopping</head>
<lg type="poem" n="1">
<head>I.</head>
<l>Poppadom</l>
<l>Oatmeal</l>
<l>Bubble gum</l>
<l>Cut of veal</l>
</lg>
<lg type="poem" n="2">
<head>II.</head>
<l>Mince for pie</l>
<l>Frozen peas</l>
<l>Video for Guy</l>
<l>Selection of teas</l>
</lg>
<lg type="poem" n="3">
<head>I​II.</head>
<l>Paper towels​/garbage bags</l>
<l>Pasta sauce and Parmesan</l>
<l>Pumpkin seed and olive oil</l>
</lg>
<lg type="poem" n="4">
<head>IV.</head>
<l>Cheesy crisps and favourite mags</l>
<l>Kidney beans (1 large can)</l>
<l>Cling film and kitchen foil</l>
</lg>
</lg>
Example 12. Encoding a composite poem.

2.3.2. Cycles of Poems

Cycles of poems are structurally akin to composite poems but the top level element is not a <lg type="poem"> but a <div> with a suggested value "cycle" for the @type attribute. If we reconsider the example poem, this generates the next encoding:

<div xmlns="http://www.tei-c.org/ns/1.0" type="cycle">
<head>Shopping</head>
<lg type="poem" n="1">
<head>I.</head>
<l>Poppadom</l>
<l>Oatmeal</l>
<l>Bubble gum</l>
<l>Cut of veal</l>
</lg>
<lg type="poem" n="2">
<head>II.</head>
<l>Mince for pie</l>
<l>Frozen peas</l>
<l>Video for Guy</l>
<l>Selection of teas</l>
</lg>
<lg type="poem" n="3">
<head>I​II.</head>
<l>Paper towels​/garbage bags</l>
<l>Pasta sauce and Parmesan</l>
<l>Pumpkin seed and olive oil</l>
</lg>
<lg type="poem" n="4">
<head>IV.</head>
<l>Cheesy crisps and favourite mags</l>
<l>Kidney beans (1 large can)</l>
<l>Cling film and kitchen foil</l>
</lg>
</div>
Example 13. Encoding a cycle of poems.

2.3.3. Anthologies

An anthology can as well be represented using a <div type="anthology"> element but it is good practice to use the more powerful <group> element. The <group> element groups together a sequence of distinct texts (or groups of such texts) which are regarded as a unit for some purpose. The <group> element consists of <text> elements which may have optional <front> and <back> elements and mandatory <body> elements. If we reconsider the example poem that way, the following encoding may apply:

Note

Each <text> element can have its own <front> and <back> elements so that it becomes possible, for instance, to encode complete works of poets, maintaining the original front and back materials of the separately published poetry volumes. See Module 1: Common Structure, Elements, and Attributes, section 2.2.
<text xmlns="http://www.tei-c.org/ns/1.0">
<front>
<docTitle>
<titlePart>Shopping</titlePart>
</docTitle>
</front>
<group>
<text>
<body>
<lg type="poem" n="1">
<head>I.</head>
<l>Poppadom</l>
<l>Oatmeal</l>
<l>Bubble gum</l>
<l>Cut of veal</l>
</lg>
</body>
</text>
<text>
<body>
<lg type="poem" n="2">
<head>II.</head>
<l>Mince for pie</l>
<l>Frozen peas</l>
<l>Video for Guy</l>
<l>Selection of teas</l>
</lg>
</body>
</text>
<text>
<body>
<lg type="poem" n="3">
<head>I​II.</head>
<l>Paper towels​/garbage bags</l>
<l>Pasta sauce and Parmesan</l>
<l>Pumpkin seed and olive oil</l>
</lg>
</body>
</text>
<text>
<body>
<lg type="poem" n="4">
<head>IV.</head>
<l>Cheesy crisps and favourite mags</l>
<l>Kidney beans (1 large can)</l>
<l>Cling film and kitchen foil</l>
</lg>
</body>
</text>
</group>
</text>
Example 14. Encoding an anthology with <group>.

Summary

Composite poems, cycles of poems, and anthologies are composite texts which can be encoded using different encoding strategies. Composite poems may be encoded using nesting <lg> elements. Cycles of poems may be encoded using sibling <lg type="poem"> elements. Anthologies may be encoded using the <group> element which may have multiple <text> elements.