Module 5: Drama

3. Basic Structure

3.1. Structural Divisions of the Actual Text

As with any other TEI text, the actual contents of drama texts are encoded in <body>, possibly preceded by front matter in <front> and succeeded by back matter in <back>. Inside <body>, the actual text often has specific structures: acts and scenes. Like parts and chapters in prose works, these can be encoded using the generic TEI element <div> (division). In order to identify the nature of the text divisions, they can be characterised in a @type attribute. For text divisions in drama texts, the values "act" and "scene" can be used, although you can use your own typology if so desired. For our example text, the major structural divisions can be encoded as follows:

<body xmlns="http://www.tei-c.org/ns/1.0">
<div type="act">
<head>ACT I</head>
<div type="scene">
<head>SCENE 1: HELLO WORLD</head>

user, computer

In a clean room without windows, a table stands in the middle, carrying a computer screen.
The screen blinks, the computer emits random beeps.

[A big screen on the left hand side of the stage displays the word HELLO, and will be used to project the contents of the computer screen. A big screen on the right hand stage of the stage displays the word WORD, until the action starts. Then, the user's face is filmed in close up and projected onto this screen.]

(user enters room, walks towards the table, takes a seat, and reads what's on the screen)
[spotlight follows user]
COMP: Hello.
(user looks around and starts typing)
USER: What is on the table?

COMP: The table does not exist.
(user frowns)
USER: There is a table in the room.

COMP: Ok.
USER: What is on the table?

COMP: Nothing.
USER: There is a black box on the table.

COMP: Ok.
USER: What is on the table?

COMP: A black box.
USER: There is a black box on the table.

(computer beeps frantically)
COMP: I know.
USER: There is a red ball in the black box.

COMP: Ok.
USER: Where is the black box?

COMP: The black box is on the table.
USER: What is in the black box?

COMP: A red ball.
USER: Where is the black ball?

COMP: I don't know what black ball you are talking about.
USER: Where is the red ball?

COMP: In the black box.
USER: There is a black ball in the box.

COMP: Ok.
USER: The red ball is on the table.

COMP: In fact the red ball is in the box.
USER: The ball in the box is small.

COMP: I cannot understand such a sentence at the present time.
USER: The black ball is small.

COMP: I see.
USER: What is in the box?

COMP: A small black ball and a red ball.

(user smiles, stands up, takes a small green ball out of his pocket, walks away throwing and catching the ball)
[Kraftwerk's "Computer Love" accompanies the exit]</div>
</div>
</body>
Example 2. Encoding the basic text structure of a drama text.

Of course, this markup is far from complete, and even invalid TEI, as the TEI Guidelines stipulate that <div> text divisions cannot contain unstructured text. We’ll fill in the blanks in the next sections of this tutorial.

Summary

Commonly distinguished large structures in drama texts are acts and scenes. They can be encoded in TEI using the generic <div> element for text divisions. The nature of the text division can be characterised in a @type attribute.

3.2. The Basic Building Block: Speeches

One of the most distinctive organisational principles in drama texts is the grouping of text lines in utterances, spoken by the respective characters. Each line of speech can be encoded in a <sp> (speech) element. When present in the text source, the indications of the speaking character can be encoded in a <speaker> element. The actual text of a speech line should be further encoded as a paragraph (<p>) or verse line (<l>, possibly grouped in <lg>). If the encoder is undecided about the prose or verse qualities of the text lines, <ab> (anonymous block) may also be used.

Note

A dramatic speech line (<sp>) cannot contain unstructured text: it should be encoded further as paragraph (<p>), verse line (<l>, possibly grouped in <lg>), or genre-unspecified anonymous block (<ab>).

For example, the last three speech lines in the example can be encoded as follows:

<sp xmlns="http://www.tei-c.org/ns/1.0">
<speaker>C​OMP</speaker>
<p>I see.</p>
</sp>
<sp xmlns="http://www.tei-c.org/ns/1.0">
<speaker>U​SER</speaker>
<p>What is in the box?</p>
</sp>
<sp xmlns="http://www.tei-c.org/ns/1.0">
<speaker>C​OMP</speaker>
<p>A small black ball and a red ball.</p>
</sp>
Example 3. Encoding of speech lines with <sp>.

Notice how in this example, the colons after the name labels have been left out of the transcription. This is an editorial choice that can be motivated by the fact that the colons themselves are purely typographical separators between the labels and the text, that can systematically be reproduced when this transcription is rendered. Likewise, the blank line between both groups of speeches has been omitted as a purely typographic feature. Typographical line breaks can of course be encoded where they occur with empty <lb> elements.

The <sp> element provides a more formal mechanism for indicating the speaking character, by means of a @who attribute. Its value points to a definition of this character elsewhere in the transcription. The value of this definition’s @xml:id attribute is used in the @who attribute, preceded with a hash character (#), in order to indicate it as the identifier part of a formal URI reference. For example:

<sp xmlns="http://www.tei-c.org/ns/1.0" who="#comp">
<speaker>C​OMP</speaker>
<p>I see.</p>
</sp>
<sp xmlns="http://www.tei-c.org/ns/1.0" who="#user">
<speaker>U​SER</speaker>
<p>What is in the box?</p>
</sp>
<sp xmlns="http://www.tei-c.org/ns/1.0" who="#comp">
<speaker>C​OMP</speaker>
<p>A small black ball and a red ball.</p>
</sp>
Example 4. Formal identification of speakers in a @who attribute on <sp>, and transcription of the labels in the source text with <speaker>.

Notice how this example assumes that both characters are identified elsewhere in the transcription, with "comp" and "user" as respective values for their @xml:id attribute. A common place for such definitions in drama texts is in a “cast list,” introducing all characters of the play. For the encoding of cast lists, see section 4.1 in this tutorial module. When text is spoken by more than one character, the identification codes for all of them can be listed in the @who attribute, separated by white space.

Of course, the encoder is free to choose which identification mechanism will be used for the speakers in the transcription. When the markers are present in the text source, their bare transcription in <speaker> (see example 3) may suffice. It may make equal sense to use both systems (transcription of the actual speaker labels in the source text in <speaker>, and a formal identification in the @who attribute), as in example 4. If the source text has no explicit markers of the speaking characters, or if these markers are present but not deemed relevant to the transcription (and could be reconstructed systematically when rendering the transcription), only the formal identification in the @who may be used:

Note

It is important to point out the different nature of both identification mechanisms for speakers: <speaker> can only contain a literal transcription of a speaker label in the source text; @who can only contain an abstract identification code provided by the encoder.
<sp xmlns="http://www.tei-c.org/ns/1.0" who="#comp">
<p>I see.</p>
</sp>
<sp xmlns="http://www.tei-c.org/ns/1.0" who="#user">
<p>What is in the box?</p>
</sp>
<sp xmlns="http://www.tei-c.org/ns/1.0" who="#comp">
<p>A small black ball and a red ball.</p>
</sp>
Example 5. Formal identification of speakers in a @who attribute on <sp>.

Summary

The speeches of a drama text can be encoded in the <sp> element. In order to identify the speaking character, either the @who attribute may be used on the <sp> element, or the speaker label may be transcribed from the source text in a <speaker> element (or both mechanisms may be used together). Apart from the <speaker> label, speeches must contain structured text, either in <p>, <l> (possibly grouped in <lg>), or genre-neutral <ab> elements.

3.3. Stage Directions

The encoding of all speeches in our example text would still leave some text structures uncovered: all fragments enclosed in parentheses or square brackets. These text fragments are not meant to be spoken in the performance, but rather convey directions on aspects of the performance. Such “stage directions” can be transcribed with the <stage> element. Stage directions can address aspects of the acting, technical circumstances, locations, music, and so on. In order to distinguish between these different types of stage directions, the @type attribute can be used on the <stage> element. The encoder is free to develop an own typology for the kinds of stage directions.

Stage directions can occur in the transcription wherever they appear in the source text. In our example, the stage directions (between parentheses or square brackets) can be encoded as follows:

<div xmlns="http://www.tei-c.org/ns/1.0" type="act">
<head>A​CT I</head>
<div type="scene">
<head>S​CENE 1: H​ELLO W​ORLD</head>
<stage>user, computer</stage>
<stage type="setting">In a clean room without windows, a table stands in the middle, carrying a computer screen. The screen blinks, the computer emits random beeps.</stage>
<stage type="technical">[A big screen on the left hand side of the stage displays the word H​ELLO, and will be used to project the contents of the computer screen. A big screen on the right hand stage of the stage displays the word W​ORD, until the action starts. Then, the user​'s face is filmed in close up and projected onto this screen.]</stage>
<stage type="entrance">(user enters room, walks towards the table, takes a seat, and reads what​'s on the screen)</stage>
<stage type="technical">[spotlight follows user]</stage>
<sp>
<speaker>C​OMP</speaker>
<p>Hello.</p>
</sp>
<stage type="gesture">(user looks around and starts typing)</stage>
<sp>
<speaker>U​SER</speaker>
<p>What is on the table?</p>
</sp>
<sp>
<speaker>C​OMP</speaker>
<p>The table does not exist.</p>
</sp>
<stage type="gesture">(user frowns)</stage>
<sp>
<speaker>U​SER</speaker>
<p>There is a table in the room.</p>
</sp>
<sp>
<speaker>C​OMP</speaker>
<p>Ok.</p>
</sp>
<sp>
<speaker>U​SER</speaker>
<p>What is on the table?</p>
</sp>
<sp>
<speaker>C​OMP</speaker>
<p>Nothing.</p>
</sp>
<sp>
<speaker>U​SER</speaker>
<p>There is a black box on the table.</p>
</sp>
<sp>
<speaker>C​OMP</speaker>
<p>Ok.</p>
</sp>
<sp>
<speaker>U​SER</speaker>
<p>What is on the table?</p>
</sp>
<sp>
<speaker>C​OMP</speaker>
<p>A black box.</p>
</sp>
<sp>
<speaker>U​SER</speaker>
<p>There is a black box on the table.</p>
</sp>
<stage type="technical">(computer beeps frantically)</stage>
<sp>
<speaker>C​OMP</speaker>
<p>I know.</p>
</sp>
<sp>
<speaker>U​SER</speaker>
<p>There is a red ball in the black box.</p>
</sp>
<sp>
<speaker>C​OMP</speaker>
<p>Ok.</p>
</sp>
<sp>
<speaker>U​SER</speaker>
<p>Where is the black box?</p>
</sp>
<sp>
<speaker>C​OMP</speaker>
<p>The black box is on the table.</p>
</sp>
<sp>
<speaker>U​SER</speaker>
<p>What is in the black box?</p>
</sp>
<sp>
<speaker>C​OMP</speaker>
<p>A red ball.</p>
</sp>
<sp>
<speaker>U​SER</speaker>
<p>Where is the black ball?</p>
</sp>
<sp>
<speaker>C​OMP</speaker>
<p>I don​'t know what black ball you are talking about.</p>
</sp>
<sp>
<speaker>U​SER</speaker>
<p>Where is the red ball?</p>
</sp>
<sp>
<speaker>C​OMP</speaker>
<p>In the black box.</p>
</sp>
<sp>
<speaker>U​SER</speaker>
<p>There is a black ball in the box.</p>
</sp>
<sp>
<speaker>C​OMP</speaker>
<p>Ok.</p>
</sp>
<sp>
<speaker>U​SER</speaker>
<p>The red ball is on the table.</p>
</sp>
<sp>
<speaker>C​OMP</speaker>
<p>In fact the red ball is in the box.</p>
</sp>
<sp>
<speaker>U​SER</speaker>
<p>The ball in the box is small.</p>
</sp>
<sp>
<speaker>C​OMP</speaker>
<p>I cannot understand such a sentence at the present time.</p>
</sp>
<sp>
<speaker>U​SER</speaker>
<p>The black ball is small.</p>
</sp>
<sp>
<speaker>C​OMP</speaker>
<p>I see.</p>
</sp>
<sp>
<speaker>U​SER</speaker>
<p>What is in the box?</p>
</sp>
<sp>
<speaker>C​OMP</speaker>
<p>A small black ball and a red ball.</p>
</sp>
<stage type="exit">(user smiles, stands up, takes a small green ball out of his pocket, walks away throwing and catching the ball)</stage>
<stage type="technical">[
<name type="music​.group">Kraftwerk</name>
's "
<name type="music​.song">Computer Love</name>
" accompanies the exit]</stage>
</div>
</div>
Example 6. Encoding of stage directions in <stage>.

The TEI Guidelines provide a specialised element for indicating movements of characters: the empty <move> element. It may be used either to complement implicit movement descriptions in stage directions, or to formally document movement of characters when it is absent from the stage directions. Being an empty element, all information is conveyed in specific attributes. The @type attribute can be used to categorise the movement (for example: "entrance" and "exit" for stage entrance and exit, respectively; or "onStage" for movement on stage). The moving character(s) can be indicated in the @who attribute, as a white space separated list of formal codes with which they are identified elsewhere in the transcription (see section 4.1). The @where attribute may be used to specify the direction of the movement. Any system of location information may be used (for example, "C" for “centre,” "L" for “stage left,” "R" for “stage right”), as long as it is identified in the header of the transcription.

In our example, the encoding of the entrance and exit of the user character could be formally enriched with the <move> element:

<stage xmlns="http://www.tei-c.org/ns/1.0" type="entrance">
<move who="#user" type="entrance" where="C"/>
(user enters room, walks towards the table, takes a seat, and reads what​'s on the screen)</stage>
<!-- ... -->
<stage xmlns="http://www.tei-c.org/ns/1.0" type="exit">
<move who="#user" type="exit" where="L"/>
(user smiles, stands up, takes a small green ball out of his pocket, walks away throwing and catching the ball)</stage>
Example 7. Encoding of movement of actors with <move>.

It is equally possible to distinguish technical stage directions from general stage directions, using specialised elements. Directions regarding sounds can be encoded using the <sound> element. The @type element can indicate what kind of sound is meant, while the @discrete attribute can specify whether the sound overlaps with the surrounding speeches ("false"), or not ("true"). For example:

<stage xmlns="http://www.tei-c.org/ns/1.0" type="setting">In a clean room without windows, a table stands in the middle, carrying a computer screen. The screen blinks,
<sound type="sound​Effect">the computer emits random beeps</sound>
.</stage>
<!-- ... -->
<sound xmlns="http://www.tei-c.org/ns/1.0" type="sound​Effect" discrete="true">(computer beeps frantically)</sound>
<!-- ... -->
<sound xmlns="http://www.tei-c.org/ns/1.0" type="music">[
<name type="music​.group">Kraftwerk</name>
's "
<name type="music​.song">Computer Love</name>
" accompanies the exit]</sound>
Example 8. Encoding of sound effects with <sounnd>.

Descriptions of the contents of a screen (usual in transcriptions of screen plays) can be encoded in a <view> element. In our example, this could apply to the description of both screens on the stage:

<stage xmlns="http://www.tei-c.org/ns/1.0" type="technical">[
<view>A big screen on the left hand side of the stage displays the word H​ELLO, and will be used to project the contents of the computer screen.</view>
<view>A big screen on the right hand stage of the stage displays the word W​ORD, until the action starts. Then, the user​'s face is filmed in close up and projected onto this screen.</view>
]</stage>
Example 9. Encoding of screen contents with <view>.

Specific camera directions can be encoded using the <camera> element, whose @type attribute can specify the direction:

<stage xmlns="http://www.tei-c.org/ns/1.0" type="technical">[
<view>A big screen on the left hand side of the stage displays the word H​ELLO, and will be used to project the contents of the computer screen.</view>
<view>A big screen on the right hand stage of the stage displays the word W​ORD, until the action starts. Then, the user​'s face is filmed in
<camera type="angle">close up</camera>
and projected onto this screen.</view>
]</stage>
Example 10. Encoding of camera directions with <camera>.

Directions about text to be displayed on screen (usual in transcriptions of screen plays) can be encoded in a <caption> element. This could be applied to the directions on both screens on the stage:

<stage xmlns="http://www.tei-c.org/ns/1.0" type="technical">[
<view>A big screen on the left hand side of the stage displays the word
<caption>H​ELLO</caption>
, and will be used to project the contents of the computer screen.</view>
<view>A big screen on the right hand stage of the stage displays the word
<caption>W​ORD</caption>
, until the action starts. Then, the user​'s face is filmed in
<camera type="angle">close up</camera>
and projected onto this screen.</view>
]</stage>
Example 11. Encoding of text in a caption on the screen with <caption>.

Other technical stage directions, not meant for the actors can be encoded with the <tech> element, with a @type attribute for classifying the stage direction. The @type attribute can be used for a typology of such technical directions: "light" (a lighting instruction), "sound" (a sound instruction), "prop" (a direction for props), or "block" (a blocking cue, i.e., related to position and movement on stage).

<tech xmlns="http://www.tei-c.org/ns/1.0" type="light">[spotlight follows user]</tech>
Example 12. Encoding of a technical stage direction with <tech>.

Summary

Stage directions, providing more information about the circumstances of the performance, may be transcribed using the <stage> element. The @type attribute can be used to further specify the kind of stage direction. Movement of characters can be formally encoded using the empty <move> element. The type of movement can be specified in a @type attribute, a pointer to the moving character can be provided in the @who attribute, and the location of the movement can be specified in the @where attribute. Technical stage directions can be distinguished from more general ones by means of specific elements. Sound directions can be encoded with the <sound> element, whose @type attribute can specify the kind of sound described, while the @discrete attribute can specify whether the sound overlaps ("false") with the dialogue or not ("true"). Descriptions of screen contents can be encoded with the <view> element. Specific camera instructions can be encoded with <camera>, whose @type attribute specifies the kind of camera instruction. Directions for screen captions can be encoded using the <caption> element. Other technical stage directions can be encoded in the <tech> element, whose @type attribute can be used to specify whether the stage direction concerns lighting cues ("light"), sound cues ("sound"), prop cues ("prop"), or blocking cues ("block").