abbrabbreviation
contains an abbreviation of any sort.
expan
expansion
gives an expansion of the abbreviation.
CDATA
any string of characters
#IMPLIED
Rd>.
]]>
Only one expansion may be given for an abbreviation; if
different expansions are to be proposed, the tags for critical apparatus
should be used.
resp
responsibility
identifies the individual responsible for the interpretation given
the abbreviation.
CDATA
any string of characters, usually the initials of the
individual, if known, or a descriptive phrase (e.g. 16c hand) if
not.
#IMPLIED
Rd>.
]]>
If no expansion is given, the resp attribute has
no meaning.
type
allows the encoder to classify the abbreviation according to some
convenient typology.
CDATA
suspension the abbreviation provides the first
letter(s) of the word or phrase, omitting the remainder.
contraction the abbreviation omits some letter(s) in
the middle.
brevigraph the abbreviation comprises a special
symbol or mark.
superscription the abbreviation includes writing above
the line.
acronym the abbreviation comprises the initial
letters of the words of a phrase.
title the abbreviation is for a title of address
(Dr, Ms, Mr, ...)
organization the abbreviation is for the name of an
organization.
geographic the abbreviation is for a geographic name.
#IMPLIED
RSVP
SPQR&per;yr hbl servt
]]>
The type attribute is provided for the sake of
those who wish to classify abbreviations at their point of occurrence;
this may be useful in some circumstances, though usually the same
abbreviation will have the same type in all occurrences. As the sample
values make clear, abbreviations may be classified by the method used to
construct them, the method of writing them, or the referent of the term
abbreviated; the typology used is up to the encoder and should be
carefully planned to meet the needs of the expected use.
SPQR
]]>
This tag is the mirror image of the expan tag; both
allow the encoder to transcribe both an abbreviation and its expansion.
In abbr, however, the original is transcribed as the content of
the element and the expansion as an attribute value; expan
reverses this. The choice between the two is up to the user.
The abbr tag is not required; if appropriate, the encoder
may transcribe abbreviations in the source text silently, without
tagging them. If abbreviations are not transcribed directly but
expanded silently, then the TEI header should so indicate.
For a typology of ME abbreviations, see A. G. Petty,
English literary hands from Chaucer to
Dryden (London: Edward Arnold, 1977), pp. 22-25.additional tag set for common core featuresMay contain character data and phrase-level elements.
%phrase.seq;
]]>
]]>
addaddition
contains letters, words, or phrases inserted in the text by a
scribe, annotator, corrector, editor, or encoder.
place
if the the addition is written into the copy text, indicates where
the additional text is written.
CDATA
inline addition is made in a space left in the
witness by an earlier scribe
supralinear addition is made above the line
infralinear addition is made below the line
left addition is made in left margin
right addition is made in right margin
top addition is made in top margin
bottom addition is made in bottom margin
opposite addition is made on opposite page
verso addition is made on verso of sheet
mixed addition is made somewhere, one or more of
other values
#IMPLIED
For additions made by editors or encoders, no location need
be specified.
resp
responsible
identifies the individual responsible for the addition.
CDATA
Any string of characters, e.g. hand 1, hand 2,
author, etc., or the initials of individuals known by name.
#IMPLIED
additional tag set for common core featuresMay contain character data and phrase-level elements.
%specialPara;
]]>
]]>
addresscontains a postal or other address, for example of a
publisher, an organization, or an individualvia Marsala 24>
40126>
Bologna>
Italy>
]]>
P.O. Box 6998>
Chicago>
Illinois>
60680>
USA>
]]>
This is a general purpose crystal: it may be given
only using the defined substructural elements. To tag addresses
in running text without this substructure, use the place
element.
base tag set for common core featuresIf given as running prose, use a consistent format wherever
possible, for example separating lines of the address by commas, and
including any postal code in the standard form.indiv organization
city country org postbox postcode region street
]]>
]]>
agent
agent (individual or corporate body)
defines a group of elements which contain names of individuals
or corporate bodies.
This class is used in the resp element, to allow
a statement of responsibility to apply to an individual or a body.
base tag set for common core features
analyticanalytic level
contains bibliographic elements describing an item (e.g. an
article or poem) published within a monograph or journal and not as an
independent publication.
The analytic element may occur only within
bibliographic citation or reference elements; it is mandatory for
description of the analytic level of bibl.struct elements.
base tag set for common core featuresMay contain titles and statements of responsibility (author,
editor, or other), in any order.
bibl.struct
author editor resp title
]]>
]]>
authorprimary statement of responsibility for a
bibliographic item, for example the name of a single author, institution
or organization, or of several such. British Broadcasting Corporation
La Fayette, Marie Madeleine Pioche de la Vergne,
comtesse de (1634-1693)
]]>
Particularly where cataloguing is likely to be based on
the content of the header, it is advisable to use generally recognized
authority lists for the exact form of personal names.
In the case of a broadcast, use this element for the name of the
company or network which broadcasts the program.
base tag set for common core featuresAny string of characters and phrase-level tags.analytic monogr
%phrase.seq
]]>
]]>
biblbibliographic citationcontains a loosely-structured bibliographic citation of which
the sub-components may or may not be explicitly tagged. Blain, Clements and Grundy: Feminist Companion to Literature in
English (Yale, 1990)
]]>
The Interesting story of the Children in the
Wood.
In Victor E Neuberg,
The Penny Histories.
OUP1968.
]]>
base tag set for common core featuresContains either just prose, or any combination of elements
from the citation classlist.bibl
%m.bibl.part; %m.phrase; #PCDATA
]]>
]]>
biblbibliographic elements. base tag set for common core featuresbibl.part
bibliographic citation part
elements which can appear within bibliographic citations, but
cannot appear freely in running text outside bibliographic citations.
This class is used in defining the content model of
bibl.
base tag set for common core features
bibl.fullcontains a fully-structured bibliographic citation, in which all
components of the TEI file description save the source description
are present.Blain, Virginia
Clements, Patricia
Grundy, Isobel
The Feminist Companion to Literature in English:
women writers from the middle ages to the present
UK edition1231 pp1990
Yale University Press
New Haven and London
1990
No source: this is an original work
]]>
base tag set for common core featureslist.bibl
edition.stmt extent notes.stmt publication.stmt series.stmt source.desc title.stmt
]]>
]]>
biblScopescope of citationdefines the scope of a bibliographic reference, for example as a
list of pagenumbers, or a named subdivision of a larger work.
type
identifies the type of information conveyed by the element, e.g.
pages, volume.
CDATA
volume the element contains a volume number.
issue the element contains an issue number, or volume and
issue numbers.
pages the element contains a page number or page range.
chapterthe element contains a chapter indication (number
and/or title)
part the element identifies a part of a book or collection.
#IMPLIED
pp 12-34
]]>
Volume 2 "My apprenticeship"
]]>
base tag set for common core featuresimprint monogr series
%phrase.seq
]]>
]]>
bibl.structstructured bibliographic citationcontains a structured bibliographic citation, in which only
bibliographic subelements appear and in a specified order. Blain, Virginia
Clements, Patricia
Grundy, Isobel
The Feminist Companion to Literature in
English: women writers from the middle ages
to the present>
Yale University Press>
New Haven and London>
1990>
]]>
base tag set for common core featureslist.bibl
analytic idno monogr note series
]]>
]]>
chunkincludes all elements which can occur between, but not
within, paragraphs and other chunks.
This element class does not contain all those elements which
can appear between chunks: the class inter contains a
set of elements which can appear either within or between chunks.
Unlike elements of that class, chunks cannot occur within chunks.
In prose, this means the
elements in this class can appear between but not within paragraphs.
base tag set for common core features
chunk.seq
chunk sequence
defines a sequence of chunks, i.e. of paragraph-level
elements of the class chunk or of the class
inter.
'(%m.inter; | %m.chunk;)*'
citycitycontains the name of a town, city or village occurring within an address
or imprint.
London
Truth or Consequences
]]>
base tag set for common core featuresaddress
%phrase.seq
]]>
]]>
colcolumn
contains one column of a multi-column reference edition.
<(La)col n=a> ...
<(La)col n=b> ...
<(La)page n=33> ...
]]>
The col tag should be used wherever a standard
reference scheme uses references to individual columns of a reference
edition; otherwise it need not be used.
base tag set for physical description of the copy textMay contain line elements, or character data.
page
#PCDATA line
]]>
]]>
corrcorrection
contains the correct form of a passage apparently erroneous in the
copy text.
sic
gives the original form of the apparent error in the copy text.
CDATA
any string of characters
#IMPLIED
a' babbled
of green fields.
]]>
resp
responsibility
identifies the individual responsible for the correct form given.
CDATA
Sample values include ed, scribe, or the initials
of the individual in question.
#IMPLIED
If all that is desired is to call attention to the
fact that the copy text has been corrected, no attributes are required:
can we prove or disprove anyone's theories?
]]>
It is also possible to provide a correct reading and to identify
the individual responsible for the correction:
can we prove or
disprove anyone's theories?
]]>
The corr tag is a mirror of sic: the
latter leaves the original text untouched, giving the correction as an
attribute value; the former substitutes the correction, leaving the
original reading as an attribute value. The choice between them is up
to the encoder.
additional tag set for common core featuresMay contain character data and phrase-level elements.
%specialPara;
]]>
]]>
countrycountrycontains the largest geographic, administrative or political unit
identified within an address. Bosnia-Herzegovina
]]>
Denmark
]]>
Values for identifying attributes should be taken from recognised
lists of abbreviations, e.g. ISO 3166.
base tag set for common core featuresaddress
%soup
]]>
]]>
dataphrase-level elements containing names, dates, numbers, measures,
and similar data. base tag set for common core featuresdatecontains a date in any format.
calendar
indicates the system or calendar to which the date belongs.
CDATA
Recommended values include:
Gregorian,
Julian,
Roman,
Mosaic,
Revolutionary,
Islamic.
#IMPLIED
Feb. 22, 1732
(Feb. 11, 1731/32,
O.S.).
]]>
value
gives the value of the date in some standard form, usually
yyyy-mm-dd.
CDATA
Any string representing a date in standard format; recommended
form is yyyy-mm-dd, as defined by ISO 8601: 1988, Data
elements and interchange formats --- Information interchange ---
Representation of dates and times.#IMPLIED
27th of May (old style).
]]>
For simple dates, the value should give the Gregorian date
in the form (yyyy-mm-dd) specified by ISO 2014. More complicated dates
or special applications may require another calendar or another form;
these should be documented in the std.vals element in the TEI
Header.
certainty
indicates the degree of certainty attributed to the date.
CDATA
Any appropriate value, e.g.
ca.,
approx,
after,
before.
#IMPLIED
21 Feb 1980
]]>
Twelfth Day of June
in the Year of Our Lord One Thousand Nine Hundred and
Seventy-seven of the Republic the Two Hundredth and first
and of the University the Eighty-Sixth.
]]>
September 1990
]]>
additional tag set for common core featuresMay contain character data and phrase-level elements.
%phrase.seq;
]]>
]]>
date
dates and date ranges
defines a class of elements containing date specifications.
This class allows certain content models to allow either
a single date or a date-range element.
base tag set for common core features
dateRangedate range
contains two dates or another phrase delimiting a time period.
calendar
indicates the system or calendar to which the date belongs.
CDATA
Recommended values include:
Gregorian,
Julian,
Roman,
Mosaic,
Revolutionary,
Islamic.
#IMPLIED
from
indicates the starting point of the period in standard form.
CDATA
any date in a standard form; recommended form is yyyy-mm-dd.
#IMPLIED
The value should conform to the standard form declared in
the std.vals element in the TEI header.
to
indicates the ending point of the period in standard form.
CDATA
any date in a standard form; recommended form is yyyy-mm-dd.
#IMPLIED
The value should conform to the standard form declared in
the stdVals element in the TEI header.
exact
indicates the precision to be attached to either or both dates
specified.
(to|from|both|none)
tothe to date is exact
fromthe from date is exact
bothboth dates are exact
noneboth dates are approximate or unspecified
#IMPLIED
Early 12th century
]]>
date values should conform to the standard form declared in
the stdVals element in the TEI header.
1846-48)
the first collection of Hungarian folk poetry.
]]>
additional tag set for common core featuresMay contain character data and phrase-level elements.
%phrase.seq;
]]>
]]>
deldeletion
contains a word or passage deleted, marked as deleted, or
identified as superfluous or spurious in the copy text.
rend
rendition
indicates how the deletion was indicated in the copy text.
CDATA
subpunction dots below the line indicate matter to be
deleted.
overstrike lines through the text indicated matter to
be deleted.
erasure material to be deleted has been erased (but
remains legible enough to transcribe).
bracketed brackets around the material indicate that
it is spurious or superfluous.
#IMPLIED
type
classifies the type of deletion using any convenient typology.
CDATA
any string identifying the class of deletion.
#IMPLIED
No recommendation of any particular typology is made here;
to record the manner in which the deletion is signaled, use
rend, not type.
status
may be used to indicate faulty deletions, e.g.
strikeouts which include too much or too little text.
CDATA
any description of flaws in the marking of a deletion, e.g.
excess left, excess right, short left, short
right.
'unremarkable'
Status information on each deletion is needed rather rarely
except in critical editions from authorial manuscripts.
resp
responsible
identifies the individual responsible for making the deletion.
CDATA
Any string of characters, e.g. hand 1, hand 2,
etc., or the initials of individuals known by name.
#IMPLIED
Mein Frisch
schwebt weht der Wind
]]>
Cf. omit.
Degrees of uncertainty over what (if anything) can
still be read may be indicated by use of the
certainty element (see ).
This element should be used for deletion of shorter
sequences of text, typically single words or phrases.
The delSpan element should be used for longer
sequences of text, for those containing structural
subdivisions, and for those containing overlapping
additions and deletions.
additional tag set for common core featuresMay contain character data and phrase-level elements.
%phrase.seq;
]]>
]]>
distinctdistinctidentifies any word or phrase which is regarded as linguistically
distinct, for example as archaic, technical, dialectal, non-preferred,
etc., or as forming part of a sublanguage. typespecifies the sublanguage or register to which the word or phrase
is being assignedCDATA
a semi-open user-defined list#IMPLIED
timespecifies how the phrase is distinct diachronically
CDATA
a semi-open user-defined list#IMPLIED
spacespecifies how the phrase is distinct diatopically
CDATA
a semi-open user-defined list#IMPLIED
socialspecifies how the phrase is distinct diastatically
CDATA
a semi-open user-defined list#IMPLIED
fag of
Macrea's, that there was trouble in their midst which
King would fain keep
secret.
]]>
]]>
base tag set for common core features%phrase.seq;
]]>
]]>
div.gentext division generated automatically
indicates the location at which an div generated
automatically by a text-processing application is to appear.
type
specifies what type of generated text division (e.g. index,
table of contents, etc.) is to appear.
CDATA
index an index is to be generated and inserted at this point.
toc a table of contents
figlist a list of figures
tablist a list of tables
#IMPLIED
Valid values are application-dependent; those shown are of
obvious utility in document production, but are by no means exhaustive.
One use for this element is to allow document preparation
software to generate an index and insert it in the appropriate place in
the output. The example below assumes that the index
attribute on the index element has been used to specify index
entries for multiple indices, 1 and 2.
Bibliography
...
]]>
Another use for div.gen is to specify the
location of an automatically produced table of contents:
...
Preface
...
]]>
This element is intended primarily for use in document
production or manipulation, rather than in the transcription of
pre-existing materials; it makes it easier to specify the location of
indices, tables of contents, etc., to be generated by text preparation
or word processing software. The n attribute should be used
to give a title for the text division being generated.
additional tag set for common core featuresEmpty.
EMPTY
]]>
]]>
editphrase-level elements for simple editorial correction and
transcription. base tag set for common core featureseditionEditiondescribes the particularities of one edition of a text.First edition Oct 1990Students' edition
]]>
auxiliary tag set for TEI headersmonogr
%phrase.seq;
]]>
]]>
editoreditorsecondary statement of responsibility for a
bibliographic item, for example the name of an individual, institution
or organization, (or of several such) acting as editor, compiler,
translator, etc. rolespecifies the nature of the intellectual responsibility
CDATA
semi-open list (examples might include: translator, editor,
compiler, illustrator, etc.)
editor
Eric Johnson
John Tenniel
]]>
Particularly where cataloguing is likely to be based on
the content of the header, it is advisable to use generally recognized
authority lists for the exact form of personal names.
base tag set for common core featuresA consistent format should be adoptedanalytic monogr series
%phrase.seq
]]>
]]>
emphemphasized
marks words or phrases which are stressed or emphasized for
linguistic or rhetorical effect.
what?!!
]]>
What it all comes to is this, he said. What
does Christopher Robin do in the morning nowadays?
]]>
base tag set for common core featuresfree prose
%phrase.seq;
]]>
]]>
expanexpansion
contains the expansion of an abbreviation.
abbr
abbreviation
gives the abbreviation in its unexpanded form.
CDATA
any string of characters
#IMPLIED
Road>.
]]>
resp
responsibility
identifies the individual responsible for the interpretation given
the abbreviation.
CDATA
any string of characters, usually the initials of the
individual, if known, or a descriptive phrase (e.g. 16c hand) if
not.
#IMPLIED
type
allows the encoder to classify the abbreviation according to some
convenient typology.
CDATA
any useful classification name, e.g. suspension,
contraction, brevigraph, title,
organization, geographic, etc.
#IMPLIED
The type attribute is provided for the sake of
those who wish to classify abbreviations at their point of occurrence;
this may be useful in some circumstances, though usually the same
abbreviation will have the same type in all occurrences. As the sample
values make clear, abbreviations may be classified by the method used to
construct them, the method of writing them, or the referent of the term
abbreviated; the typology used is up to the encoder and should be
carefully planned to meet the needs of the expected use.
This tag is the mirror image of the abbr tag; both
allow the encoder to transcribe both an abbreviation and its expansion.
In abbr, however, the original is transcribed as the content of
the element and the expansion as an attribute value; expan
reverses this. The choice between the two is up to the user.
The expan tag is not required; if appropriate, the encoder
may expand abbreviations in the source text silently, without
tagging them. If this is done, the TEI header should so indicate.
additional tag set for common core featuresMay contain character data and phrase-level elements.
%phrase.seq;
]]>
]]>
extentdescribes the approximate size of the electronic text as stored on
some carrier medium, specified in any convenient units.
3200 sentences
between 10 and 20 Mb
ten 3.5 inch high density diskettes
]]>
auxiliary tag set for TEI headersbibl.full
%phrase.seq;
]]>
]]>
foreignforeignidentifies a word or phrase as belonging to some language other
than that of the surrounding text. lang
language
identifies the language of the word or phrase marked.
IDREF
contains a language code (associated with some writing system
declaration) for the language in question; where applicable, the
codes of ISO 639 should be used.
#IMPLIED
It is strongly recommended that the lang
attribute be consistently specified on all foreign elements.
lapis philosophicus?
]]>
This element is intended for use only where no other element
is available to mark the phrase or words concerned. The global
lang attribute should be used in preference to this element
where it is intended to mark the language of the whole of some text
element.
The distinct element may be used to identify phrases
belonging to sublanguages or registers not generally regarded as true
languages.
base tag set for common core featuresfree prose%phrase.seq;
]]>
]]>
glossidentifies a phrase or word used to provide a gloss or
definition for some other word or phrase target
identifies the associated term element
IDREF
must be a valid identifier for some term
element in the current document#IMPLIED
discoursal point of view
as the relationship, expressed through discourse
structure, between the implied author or some other addresser,
and the fiction.
]]>
base tag set for common core featuresfree prose
%phrase.seq;
]]>
]]>
headheading
contains any heading, for example, the title of a section,
or the heading of a list or glossary.
The most common use for the head element
is to mark the headings of sections. In older writings, the
headings or incipits may be rather longer than
usual in modern works. If a section has an explicit ending as
well as a heading, it should be marked as a trailer, as
in this example:
In the name of Christ here begins
the first book of the ecclesiastical history of Georgius
Florentinus, known as Gregory, Bishop of Tours.
Chapter-Headings
In the name of Christ here begins Book I of the
history.
Proposing as I do ...
From the Passion of our Lord until the death of Saint Martin
four hundred and twelve years passed.
Here ends the first Book, which covers five thousand,
five hundred and ninety-six years from the beginning of the
world down to the death of Saint Martin.
]]>
The head tag is also used to mark headings
of other units, such as lists:
Connectives
above
accordingly
across from
adjacent to
again
...
]]>
The head tag is used for headings at all
levels; processing programs which treat (e.g.) chapter headings,
section headings, and list titles differently must determine the
proper processing of a head element based on its
structural position. A head occurring as the first
element of a list is the title of that list; one occurring as
the first element of a div1 is the title of that
chapter or section.
additional tag set for common core featuresMay contain character data and phrase-level elements.
lg list list.bibl
%phrase.seq;
]]>
]]>
head.itemheading for list items
contains the heading for the item or gloss column in a
glossary list or similar structured list.
TRITE
SIMPLE, STRAIGHTFORWARD
The head.item element may appear only if
each item in the list is preceded by a label.
additional tag set for common core featuresMay contain character data and phrase-level elements.
list
%phrase.seq;
]]>
]]>
head.labelheading for list labels
contains the heading for the label or term column in a
glossary list or similar structured list.
TRITE
SIMPLE, STRAIGHTFORWARD
bury the hatchet stop fighting, make peace
at loose ends disorganized
on speaking terms friendly
fair and square completely honest
at death's door near death
]]>
The head.label element may appear only if
each item in the list is preceded by a label.
additional tag set for common core featuresMay contain character data and phrase-level elements.
list
%phrase.seq;
]]>
]]>
hihighlightedmarks a word or phrase as typographically distinct from the
surrounding text, for reasons concerning which no claim is made. rend
rendition
describes the rendition or presentation of the word or phrase
highlighted.
CDATA
a set of keywords from some suitable vocabulary; no systematic
recommendations for such a vocabulary are made by these Guidelines.
#IMPLIED
The rend attribute, though optional in general,
is recommended on the hi element.
And this Indenture further witnesseth
that the said Walter Shandy, merchant,
in consideration of the said intended marriage ...
]]>
auxiliary tag set for TEI headersfree prose%phrase.seq
]]>
]]>
hqphrasephrase-level elements related to highlighting. base tag set for common core featureshqinterintermediate-level elements related to highlighting. base tag set for common core featuresidnoidentifying numbersupplies any standard or non-standard number used to identify a
bibliographic item.type
categorizes the number, for example as an ISBN or other
standard series.
CDATA
A name or abbreviation indicating what type of identifying
number is given (e.g. ISBN, LCCN).#IMPLIED
0143-3385
116
]]>
auxiliary tag set for TEI headersbibl.struct
#PCDATA
]]>
]]>
imprintgroups information relating to the publication or distribution
of a bibliographic item. Oxford, Clarendon Press, 1987
OxfordClarendon Press1987
]]>
base tag set for common core featuresmonogr
%m.date %m.place; biblScope publisher
]]>
]]>
indexindex entry
marks a location to be indexed for whatever purpose.
index
index number
indicates which index (of several) the index entry belongs to.
CDATA
any string of characters; valid values are
application-dependent.
#IMPLIED
This attribute makes it possible to create multiple
indices for a text.
level1
first-level index entry
gives the form under which the index entry is to be made.
CDATA
any string of characters.
#REQUIRED
At least one level of entry is required.
level2
second-level index entry
gives the second-level form, if any.
any string of characters.
#IMPLIED
level3
third-level index entry
gives the third-level form, if any.
any string of characters.
#IMPLIED
level4
fourth-level index entry
gives the fourth-level form, if any.
any string of characters.
#IMPLIED
b. Azarya, son of one of the last gaons of Sura,
was David's own first cousin.
]]>
additional tag set for common core featuresEmpty.
EMPTY
]]>
]]>
indivindividual
contains structured information about an individual.
To tag just the name of the individual, use the name
or person tag.
base tag set for common core featuresMay contain the name of the individual, zero or more addresses,
and zero or more affiliations.
address affil name person
]]>
]]>
inter
elements of the intermediate (inter-level) class, which can occur
both within paragraphs and between paragraphs.
This element class contains a subset of those elements which
can appear in the unstructured soup with which
paragraph and other elements at the lowest level of crystal structures
are filled: specifically all the elements which can also occur as
structural elements in their own right. In prose, this means the
elements in this class can appear both within and between paragraphs.
This class is thus distinct from the purely phrase-level elements which
can appear only within soup, and not on their own; the latter class, in
keeping with this metaphor, is called broth; it
is represented by the class phrase. Cf. also the
class chunks.
base tag set for common core features
itemcontains one component of a list.
n
name or number
indicates the number (or letter, or other enumerator) borne by this
list item in the copy text.
CDATA
Whatever string of characters is used to label a list item in
the copy text may be used as the value of n, but it is not
required that numbering be recorded explicitly.
#IMPLIED
In ordered lists, the n attribute on the
item element is by definition synonymous with the use of the
label element to record the enumerator of the list item. In
glossary lists, however, the term being defined should be given with the
label element, not n.
Here begin the chapter headings of Book IV
The death of Queen Clotild.
How King Lothar wanted to appropriate
one third of the Church revenues.
The wives and children of Lothar.
The Counts of the Bretons.
Saint Gall the Bishop.
The priest Cato.
...
]]>
additional tag set for common core featuresMay contain simple prose or a sequence of chunks.
list
%specialPara;
]]>
]]>
lverse linecontains a single, possibly incomplete, line of verse.partspecifies whether or not the line is metrically complete. (Y | N | I | M | F)
Ythe line is metrically incomplete
Neither the line is complete, or no claim is made as to its
completeness
Ithe initial part of an incomplete line
Ma medial part of an incomplete line
Fthe final part of an incomplete line
N
The values I, M, or F should be used only where it is
clear how the line is to be reconstituted.
metcontains a user-specified encoding for the metrical structure of
the line.CDATA
No recommendations are made at this time; some commonly used
example formats are described in section .
#IMPLIED
The conventions used for encoding metrical patterns must be
documented in the TEI Header.
]]>
base tag set for common core featurescontains character data or phrase level elements onlylg
%phrase.seq
]]>
]]>
labelcontains the label associated with an item in a list; in
glossaries, marks the term being defined.
Labels are most commonly used for the headwords in glossary
lists; note the use of the global lang attribute to set the
default language of the glossary list to Middle English, and identify
the glosses and headings as modern English or Latin.
Vocabulary
Middle EnglishNew Englishnu now
lhude loudly
bloweth blooms
med meadow
wude wood
awe ewe
lhouth lows
sterteth bounds, frisks
(cf. Chaucer, K.T.
644: a courser,
sterting as the fyrverteth pedit
murie merrily
swik cease
naver never
]]>
Labels may also be used to record explicitly the numbers or
letters which mark list items in ordered lists, as in this extract from
Gibbon's
Autobiography. In this usage the label
element is synonymous with the n attribute on the
item element.
(1) My first rough manuscript, without any
intermediate copy, has been sent to the press.
(2) Not a sheet has been seen by any human
eyes, excepting those of the author and the printer:
the faults and the merits are exclusively my own.
]]>
Labels may also be used for other structured list items, as
in this extract from the journal of Edward Gibbon:
March 1757. I wrote some critical observations
upon Plautus.
March 8th. I wrote a long dissertation upon
some lines of Virgil.
June. I saw Mademoiselle Curchod --
Omnia vincit amor, et nos cedamus amori.August. I went to Crassy, and staid two days.
]]>
The fourth volume of the History of the
Decline and Fall of the Roman Empirebegun March 1, 1782 -- ended June, 1784.
The fifth volume
begun July, 1784 -- ended May 1, 1786.
The sixth volume
begun May 18, 1786 -- ended June 27, 1787.
]]>
additional tag set for common core featuresMay contain character data and phrase-level elements.
list
%phrase.seq;
]]>
]]>
lgline groupcontains a group of verse lines functioning as a formal unit
e.g. a stanza, refrain, verse paragraph, etc.typecharacterizes the line group in some respect, e.g. as a verse
paragraph, stanza, verse, etc.CDATA
stanzastanzas of regular number of lines
parablank verse paragraph
freearbitrary grouping e.g. in free verse
refraina chorus or refrain
#IMPLIED
rhymecontains a user-specified encoding for the rhyme scheme of this
line group. CDATA
No recommendations are made at this time; some commonly used
example formats are described in section .
'*INHERITED'
The conventions used for encoding rhyme schemes must be
documented in the TEI Header.
partspecifies whether or not the line is metrically complete. (y | n | i | m | f)
ythe line group is metrically incomplete
neither the line group is complete, or no claim is made as to
its completeness.
ithe initial part of an incomplete line group
ma medial part of an incomplete line group
fthe final part of an incomplete line group
N
The values I, M, or F should be used only where it is
clear how the line group is to be reconstituted.
]]>
base tag set for common core featurescontains verse lines only, possibly prefixed by a
heading.lg
head l lg
]]>
]]>
lbline break
marks the start of a new (typographic) line in some
edition or version of a text.
ed
edition
indicates the edition or version in which the line break is located
at this point
CDATA
Any string of characters; usually a siglum conventionally used
for the edition.
#REQUIRED
]]>
n
number or name
indicates the number or other value associated with the line
which follows the point of insertion of this lb.
CDATA
Any string of characters.
#IMPLIED
Encoders should adopt a clear and consistent policy as to
whether the numbers associated with line breaks relate to the physical
sequence number of the line within the page, or to some aspect of the
logical structure of the text.
By convention, lb elements should appear at the start of the
line to which they refer.
Like other forms of milestone tag, lb tags cannot
be automatically verified by SGML; for better validation, a
concurrent markup stream should be used.
The lb tag is intended for making typographic line breaks
in prose. It should be carefully distinguished from the l
element, used to mark lines of verse.
additional tag set for common core featuresEMPTY
]]>
]]>
linecontains one line of a reference edition.
<(La)line n=1> ... [text of edition La, p. 32, l. 1]
<(La)line n=2> ... [text of edition La, p. 32, l. 2]
<(La)line n=3> ... [text of edition La, p. 32, l. 3]
<(La)line n=4> ... [text of edition La, p. 32, l. 4]
<(La)page n=33> ...
]]>
(optional)
base tag set for physical description of the copy textMay contain character data only.
col page
#PCDATA
]]>
]]>
listcontains any sequence of items organized as a list.
type
describes the form of the list.
CDATA
orderedlist items are numbered or lettered.
bulletedlist items are marked with a
bullet or other typographic device.
simplelist items are not numbered or bulleted.
glosseach list item glosses some term or
concept, which is given by a label element preceding
the list item.
simple
The formal syntax of the element declarations allows
label tags to be omitted from lists tagged list
type=gloss; this is however a semantic error.
a butcher
a baker
a candlestick maker, with
rings on his fingers
bells on his toes
]]>
The following example treats the short numbered clauses of
Anglo-Saxon legal codes as lists of items. The text is from an
ordinance of King Athelstan (924-939).
Athelstan's Ordinance
Concerning thieves. First, that no thief is to be
spared who is caught with the stolen goods, [if he is] over
twelve years and [if the value of the goods is] over eightpence.
And if anyone does spare one, he is to pay for the
thief with his wergild -- and the thief is to be no nearer a
settlement on that account -- or to clear himself by an oath of
that amount.
If, however, he [the thief] wishes to defend himself
or to escape, he is not to be spared [whether younger or older
than twelve].
If a thief is put into prison, he is to be in prison
40 days, and he may then be redeemed with 120 shillings; and the
kindred are to stand surety for him that he will desist for ever.
And if he steals after that, they are to pay for him
with his wergild, or to bring him back there.
And if
he steals after that, they are to pay for him with his wergild,
whether to the king or to him to whom it rightly belongs; and
everyone of those who supported him is to pay 120 shillings to
the king as a fine.
Concerning lordless men. And we pronounced about these
lordless men, from whom no justice can be obtained, that one
should order their kindred to fetch back such a person to justice
and to find him a lord in public meeting.
And if they then will not, or cannot, produce him on
that appointed day, he is then to be a fugitive afterwards, and
he who encounters him is to strike him down as a thief.
And he who harbours him after that, is to pay for him
with his wergild or to clear himself by an oath of that amount.
Concerning the refusal of justice. The lord who
refuses justice and upholds his guilty man, so that the king is
appealed to, is to repay the value of the goods and 120 shillings
to the king; and he who appeals to the king before he demands
justice as often as he ought, is to pay the same fine as the
other would have done, if he had refused him justice.
And the lord who is an accessory to a theft by his
slave, and it becomes known about him, is to forfeit the slave
and be liable to his wergild on the first occasionp if he does it
more often, he is to be liable to pay all that he owns.
And likewise any of the king's treasurers or of our
reeves, who has been an accessory of thieves who have committed
theft, is to liable to the same.
Concerning treachery to a lord. And we have pronounced
concerning treachery to a lord, that he [who is accused] is to
forfeit his life if he cannot deny it or is afterwards convicted
at the three-fold ordeal.
]]>
Note that nested lists have been used so the tagging mirrors the
structure indicated by the two-level numbering of the clauses.
The clauses could have been treated as a one-level list with
irregular numbering, if desired.
These decrees, most blessed Pope Hadrian, we propounded in the
public council ... and they confirmed them in our hand in your
stead with the sign of the Holy Cross, and afterwards inscribed
with a careful pen on the paper of this page, affixing thus the
sign of the Holy Cross.
I, Eanbald, by the grace of God archbishop of the holy
church of York, have subscribed to the pious and catholic
validity of this document with the sign of the Holy Cross.
I, Ælfwold, king of the people across the Humber,
consenting have subscribed with the sign of the Holy Cross.
I, Tilberht, prelate of the church of Hexham, rejoicing
have subscribed with the sign of the Holy Cross.
I, Higbald, bishop of the church of Lindisfarne, obeying
have subscribed with the sign of the Holy Cross.
I, Ethelbert, bishop of Candida Casa, suppliant, have
subscribed with thef sign of the Holy Cross.
I, Ealdwulf, bishop of the church of Mayo, have subscribed
with devout will.
I, Æthelwine, bishop, have subscribed through
delegates.
I, Sicga, `patrician', have subscribed with serene mind
with the sign of the Holy Cross.
]]>
additional tag set for common core featuresMay contain an optional heading followed by a series of
items, or a series of label and item pairs, the latter being optionally
preceded by one or two specialized headings.
head head.item head.label item label
]]>
]]>
list.biblcitation listcontains a list of bibliographic citations of any kind.
Works consulted
Blain, Clements and Grundy: Feminist Companion to Literature in
English (Yale, 1990)
The Interesting story of the Children in the Wood
The Penny HistoriesVictor E NeubergOUP1968
]]>
base tag set for common core featuresbibl bibl.full bibl.struct head trailer
]]>
]]>
listslist-like elements.base tag set for common core featureslocelements used for purposes of location and referencebase tag set for common core featuresmeasurecontains a word or phrase referring to a unit of measurement of
any kind, typically associated with a number.
type
specifies the type of units being measured.
CDATA
weightmeasure of weight e.g. kg, pound.
countunit of count, e.g. dozen, score.
lengthmeasure of length, e.g. pole, mm.
areameasure of area e.g. acre, hectare.
volumemeasure of volume e.g. litre, gallon.
currencyunit of currency e.g. ecu, escudo, mark.
#IMPLIED
reg
regularized
gives a standardized form for the name of the measure concerned
CDATA
Any string of characters.
#IMPLIED
In providing a regularized form, no
claim is made that the form in the source text is incorrect; the
regularized form is simply that chosen as the main form for purposes
of unifying variant forms under a single heading.
quid
merks of old extent
]]>
additional tag set for common core featuresMay contain character data and phrase-level elements.
%phrase.seq;
]]>
]]>
mentionedmarks words or phrases mentioned, not used
eluthemen
we were released, accented on
the second syllable of the word, and its
participial derivative lutheisreleased, accented on the last.
]]>
base tag set for common core featuresfree prose
%phrase.seq;
]]>
]]>
milestonemarks the boundary between sections of a text, as indicated by
changes in a standard reference system.
ed
edition
indicates which edition or version the milestone applies to.
CDATA
Any string of characters; usually a siglum conventionally used
for the edition.
#REQUIRED
n
number or name
indicates the new number or other value for the unit which changes
at this milestone.
CDATA
Any string of characters.
#IMPLIED
The special value unnumbered should be used in
passages which fall outside the normal numbering scheme (e.g. chapter
heads, poem numbers or titles, or speaker attributions in verse drama).
unit
indicates what kind of section is changing at this milestone.
CDATA
page page breaks in the reference edition.
column column breaks.
line line breaks.
book any units termed book,
liber, etc.
poem individual poems in a collection.
canto cantos or other major sections of a poem.
stanza stanzas within a poem, book, or canto.
act acts within a play.
scene scenes within a play or act.
sectionsections of any kind.
absent passages not present in the reference edition.
#REQUIRED
If the milestone marks the beginning of a piece of text not
present in the reference edition, the special value absent
may be used as the value of unit. The normal interpretation
is that the reference edition does not contain the text which follows,
until the next milestone tag for the edition in question is
encountered.
In addition to the values suggested, other terms may be appropriate
(e.g. Stephanus for the Stephanus numbers in Plato).
...
...
]]>
Milestones for page and column should precede
milestones for line numbers, but this and other logical requirements
cannot be enforced automatically by SGML; for better validation, a
concurrent markup stream should be used.
additional tag set for common core featuresEMPTY
]]>
]]>
monogrmonographic level
contains bibliographic elements describing an item (e.g. a book
or journal) published as an independent item (i.e. as a separate
physical object).
The monogr element may occur only within
bibliographic citation or reference elements; it is mandatory for
description of the monographic level of bibl.struct elements.
base tag set for common core featuresMay contain specialized bibliographic elements, in a
prescribed order.
bibl.struct
author biblScope edition editor imprint meeting note resp title
]]>
]]>
namename, proper noun
contains a proper noun or noun phrase.
type
indicates the type of the object which is being named by the
phrase.
CDATA
Values such as person, place, institution, product, acronym.
#IMPLIED
Proper nouns referring to people, places, and organizations
may be tagged instead with person, place, or
org.
additional tag set for common core featuresMay contain character data and phrase-level elements.
indiv organization
%phrase.seq;
]]>
]]>
names
elements with proper nouns as content.
key
provides an alternative identifier for the object being named,
such as a database record key.
CDATA
any string
#IMPLIED
Montaillou
is not a large parish.
At the time of the events which led to
Fournier's investigations,
the local population consisted of between 200 and
250 inhabitants.
]]>
The value may be a unique identifier from a database, or
simply a more explicit name for the referent. Its purpose is only to
record an identification; if the analysis leading to the identification
is to be recorded as well, the analytic tags described in chapter
should be used in addition or instead.
reg
regularization
gives a normalized or regularized form of the proper name
CDATA
Any string of characters.
#IMPLIED
Fournier's investigations,
the local population consisted of between 200 and
250 inhabitants.
]]>
In providing a regularized form, no
claim is made that the form in the source text is incorrect; the
regularized form is simply that chosen as the main form for purposes
of unifying variant forms under a single heading.
base tag set for common core features
notecontains a note or annotation.
n
number or symbol
indicates the symbol or number used to mark the note's point of
attachment to the main text.
CDATA
any string of characters
#IMPLIED
The alleged mention of Judah Nagid's mother
in a letter from 1071 is, in fact, a
reference to Judah's children; cf. above,
nn. 111 and 54.
is well known from Geniza documents published by Jacob Mann.
]]>
If notes are numbered in sequence and their numbering can be
reconstructed automatically by processing software, it may not be
considered necessary to record the note numbers.
type
describes the type of note.
CDATA
Values can be taken from any convenient typology of annotation
suitable to the work in hand; e.g. annotation, gloss, citation,
digression, preliminary, temporary, ...
#IMPLIED
resp
responsible
indicates who is responsible for the annotation: author, editor,
translator, etc.
CDATA
auth[or]
note originated with the author of the text.
ed[itor]
note added by the editor of the text.
comp[iler]
note added by the compiler of a collection.
tr[anslator]
note added by the translator of a text.
transcr[iber]
note added by the transcriber of a text into electronic form.
(initials)
note added by the individual indicated by the initials.
#IMPLIED
For specialized types of editorial annotation (e.g. for
marking corrections, normalizations, cruxes, etc.), see chapter
.
place
indicates where the note appears in the source text.
CDATA
foot
note appears at foot of page.
end
note appears at end of chapter or volume.
inline
note appears as a marked paragraph in the
body of the text.
left
note appears in left margin.
right
note appears in right margin.
interlinear
note appears between lines of the text.
app[aratus]
note appears in the apparatus at the foot of the
page.
'unspecified'
For pages with multiple apparatus, values such as
app1 and app2 can be used.
The place attribute can be used to indicate to text
formatting software where a note should be printed. If the locations
indicated do not agree with those in the copy text, that fact should be
indicated in the TEI header.
anchored
indicates whether the copy text shows the exact place of reference
for the note.
(yes | no)
yes
copy text indicates the place of attachment for the note.
no
copy text indicates no place of attachment for the note.
yes
In modern texts, notes are usually anchored by means of
explicit footnote or endnote symbols. An explicit indication of the
phrase or line annotated may however be used instead (e.g. page 218,
lines 3-4). The anchored attribute indicates whether any
explicit location is given, whether by symbol or by prose
cross-reference. If the specific symbols used are to be recorded, use
the n attribute.
target
indicates the point of attachment of a note, or the beginning of
the span to which the note is attached.
IDREF
reference to the id of an element which begins at
the location in question (e.g. the id of an anchor
element).
#IMPLIED
If target and target.end are to be
used to indicate where notes attach to the text, then elements at the
appropriate locations (anchor elements if necessary) must be
given id values to be pointed at.
target.end
points to the end of the span to which the note is attached, if
the note is not embedded in the text at that point.
IDREF
reference to the id of an element which
ends at the location in question, or to an empty element at
the point in question.
#IMPLIED
Malerisch. This
word has, in the German, two distinct meanings, one
objective, a quality residing in the object, the
other subjective, a mode of apprehension and creation.
To avoid confusion, they have been distinguished in
English as picturesque and
painterly respectively. (Tr.)
style of the Dutch genre painters of the seventeenth
century that drapery has this psychological significance.
]]>
additional tag set for common core featuresMay contain character data and phrase-level elements.
bibl.struct monogr
%specialPara;
]]>
]]>
notesnote-like elements. base tag set for common core featuresnumnumber
contains a number, written in any form.
type
indicates the type of numeric value.
CDATA
cardinal absolute number, e.g. 21, 21.5ordinal ordinal number, e.g. 21stfraction fraction, e.g. one half or three halvespercentage e.g. ten percent#IMPLIED
If a different typology is desired, other values can be used
for this attribute.
value
supplies the value of the number in an application-dependent
standard form.
CDATA
any numeric value in the chosen standard form.
#IMPLIED
The standard form used should be described in the
std.vals element in the TEI header. Standard forms may be
defined from scratch, or borrowed from existing practice (e.g.
Standard values are given in the notation defined for numeric
constants in the C language.)
twenty-one
1.5
He stands 1 · 90m. high.
]]>
Detailed analyses of quantities and units of measure in
historical documents should use the feature structure mechanism
described in chapter . The num element is
intended for use in simple applications.
additional tag set for common core featuresMay contain character data and phrase-level elements.
%phrase.seq;
]]>
]]>
omitomitted material
indicates a point where material has been omitted
in a transcription,
whether for editorial reasons described in the TEI header,
as part of sampling practice, or because the material is
illegible or inaudible.
desc
description
gives a description of the omitted text.
CDATA
a prose description of the material omitted.
#IMPLIED
reason
gives the reason for omission. Sample values include
sampling, illegible, inaudible, irrelevant,
canceled, canceled and illegible.
CDATA
any short indication of the reason for the omission.
#IMPLIED
resp
responsibility
indicates the individual responsible for the omission
(in the case of canceled material, the individual
responsible for the cancellation)
CDATA
any string of characters identifying the individual in
question, e.g. 2d hand or the initials of a known individual.
#IMPLIED
extent
indicates approximately how much text has been lost,
in letters, minims, inches, or any appropriate unit, for
cases where a deletion renders the text quite illegible.
CDATA
any string of characters
#IMPLIED
It is possible, but not always necessary, to provide
measurements precise to the millimeter or even to the printer's point.
The degree of precision attempted will vary with the purpose of the
encoding and the nature of the material.
additional tag set for common core featuresEmpty.
EMPTY
]]>
]]>
orgorganization
contains the name of an organization, or a reference to an
organization.
The Ministry of Love
]]>
This tag may be used to mark either all references to
organizations, or only those which use proper nouns; in the latter case
it is synonymous with name type=org. No distinction is made
between organizations and institutions; the org tag may be used
to mark references to either, or both. To provide further information
about the organization, e.g. an address, use the organization
element.
additional tag set for common core featuresMay contain character data and phrase-level elements.
address organization
%phrase.seq;
]]>
]]>
organizationorganization (structured information)
contains structured information about an organization: its name
and address.
ACH/ACL/ALLC Text Encoding IntiativeComputer Center (M/C 135), UIC>
1940 W. Taylor St. Room 124>
Chicago>
Illinois>
60612-7352>
U.S.A.>
Oxford University Computing Service>
13 Banbury Rd.>
Oxford>
OX2 6NN>
England>
]]>
This tag may be used to encode a structured description of
an organization, providing at least its name and its address. Further
elements may be added as they are found useful. To tag only the name of
the organization, use the org or the name element.
additional tag set for common core featuresMay contain the organizations name and zero or more addresses.
address name org
]]>
]]>
origoriginal form
contains the original form of a reading, for which a
regularized form is given in an attribute value.
reg
original
gives a regularized (normalized) form of the text.
CDATA
any string of characters
#IMPLIED
resp
responsibility
identifies the individual responsible for the regularization
of the word or phrase.
CDATA
any string of characters, typically the initials of the
individual involved, or a role identifier like editor if not
known by name.
#IMPLIED
additional tag set for common core featuresMay contain character data and phrase-level elements.
%phrase.seq;
]]>
]]>
pparagraph
marks paragraphs in prose.
Hallgerd was outside. There is blood on your
axe, she said. What have you done?
I have now arranged that you can be married a
second time, replied Thjostolf.
Then you must mean that Thorvald is dead,
she said.
Yes, said Thjostolf. And now you must
think up some plan for me.
]]>
In some contexts, the paragraph may have a specialized meaning,
e.g. in the tag set for dictionaries, p is used to enclose any
running text, and thus does not imply text set off as is conventionally
done in running prose.
additional tag set for common core featuresMay contain character data and phrase-level elements.
%paraContent;
]]>
]]>
pbpage break
marks the boundary between one page of a text and the next
in a standard reference system.
ed
edition
indicates the edition or version in which the page break is located
at this point
CDATA
Any string of characters; usually a siglum conventionally used
for the edition.
#REQUIRED
]]>
n
number or name
indicates the number or other value associated with the page
which follows the point of insertion of this pb.
CDATA
Any string of characters.
#IMPLIED
Encoders should adopt a clear and consistent policy as to
whether the numbers associated with page breaks relate to the physical
sequence number of the page or the page number or signature printed on it.
By convention, pb elements should appear at the start of the
page to which they refer.
Like other forms of milestone tag, pb tags cannot
be automatically verified by SGML; for better validation, a
concurrent markup stream should be used.
additional tag set for common core featuresEMPTY
]]>
]]>
pagemarks pages in a reference edition.
... [text of edition La, page 32]
<(La)page n=32> ... [text of edition La, page 33]
<(La)page n=32> ... [text of edition La, page 34]
<(La)page n=32> ... [text of edition La, page 35]
]]>
The page element in a concurrent page-reference DTD
may be subdivided into columns or lines if desired, or left unanalysed.
base tag set for physical description of the copy textMay contain col elements, line elements, or
character data.
%version vol
#PCDATA col line
]]>
]]>
personcontains a name of a person, or a reference to a person.
This element may be used either for proper nouns only (in
which case it is synonymous with name type='person') or for
any reference to persons it is felt desirable to tag.
This ambiguity may need to be removed. Use broader definition
only and expect name to nest within person if
the tagger wants it that way? -MSM
I think it shd be restricted to names -LB
additional tag set for common core featuresMay contain character data and phrase-level elements.
indiv
%phrase.seq;
]]>
]]>
paraContentparagraph contentdefines the legal contents of paragraphs and similar elements
as comprising character data, phrase-level elements which can occur
only in paragraph content, and some higher-level elements which can
occur either within paragraph content or within sequences of high-level
elements (as defined by the parameter entity seq).
'(#PCDATA | %m.inter; | %m.phrase;)*' phraseincludes elements which can occur at the level of individual
words or phrases.
This class of elements can occur only within larger
elements of the class inter or chunk.
In prose, this means these elements can occur within paragraphs,
list items, lines of verse, etc.
base tag set for common core features
phrase.seq
phrase sequencedefines a sequence of character data and phrase-level elements.
'(#PCDATA | %m.phrase;)*' placecontains the name of, or a reference to, a place.
This element may be used either for proper nouns only (in
which case it is synonymous with name type='place') or for
any reference to places it is felt desirable to tag.
This ambiguity may need to be removed. Use broader definition
only and expect name to nest within place if
the tagger wants it that way? -MSM
I think it shd be restricted to names -LBadditional tag set for common core featuresMay contain character data and phrase-level elements.
%phrase.seq;
]]>
]]>
place
defines a class of elements which contain names of places,
addresses, etc.
This class allows any type of location specification to
be used in certain places (e.g. within the imprint element of
a structured bibliographic reference).
base tag set for common core features
pointerelements which point to other elements, using IDREFs.typecategorizes the pointer in some respect, using any convenient
set of categories.CDATA The type should indicate the intended function of the
pointer, or the rhetorical
relationship between its source and target. #IMPLIEDrespspecifies the creator of the pointer.CDATA any string of characters, usually the initials or name
of the creator.#IMPLIEDcrdatespecifies when the pointer was created.CDATAany string representing a date.#IMPLIEDtargTypespecifies the kinds of elements to which this pointer may point.
NAMES A list of names of SGML elements defined in the DTD of the
current document.#IMPLIED
If this attribute is supplied, every element specified as a
target must be of one or the other of the types specified. An
application may choose whether or not to report failures to satisfy this
constraint as errors, but may not access an element of the right
identifier but the wrong type.
targOrderwhere more than one identifier is supplied as the value of the
target attribute, this attribute specifies whether the order
in which they are supplied is significant.(Y | N | U)YYes: the order in which IDREFs are
specified as the value of a target attribute
should be followed when combining the
targeted elements.
N No: the order in which IDREFs are
specified as the value of a target attribute
has no significance when combining the
pointered elements.
U Unspecified: the order in which IDREFs are
specified as the value of a target attribute
may or may not be significant.
Uevaluatespecifies the intended meaning
when the target of a pointer is itself a pointer.(all | one | none)allif the element pointed to is itself a pointer, then
the target of that pointer will be taken, and so on, until an
element is found which is not a pointer.
one if the element pointed to is itself a pointer, then
its target (whether a pointer or not) is taken as the target of
this pointer.
noneno further evaluation of targets is carried out beyond
that needed to find the element specified in the pointer's target.
#IMPLIED
If no value is given, the application program is
responsible for deciding (possibly on the basis of user
input) how far to trace a chain of pointers.
base tag set for common core featurespostboxpostboxcontains a number or other identifier for some postal delivery
point other than a street address. P.O. Box 280
]]>
Postbus 532
]]>
The position and nature of postal codes is highly
country-specific; the conventions appropriate to the country concerned should
be used.
base tag set for common core featuresaddress
#PCDATA
]]>
]]>
postcodepostcodecontains a numerical or alphanumeric code used as part of a
postal address to simplify sorting or delivery of mail. HR1 3LR
]]>
60142-7
]]>
The position and nature of postal codes is highly
country-specific; the conventions appropriate to the country concerned should
be used.
base tag set for common core featuresaddress
#PCDATA
]]>
]]>
ptrdefines a pointer to another location in the current document.
targetspecifies the destination of the pointer as one or more
SGML identifiersIDREFS Each value specified must be the same as that specified as
value for an ID attribute for some other element in the
current SGML document.#REQUIRED
]]>
base tag set for common core featuresEMPTY
]]>
]]>
publisherprovides the name of the organization responsible for
the publication or distribution of a bibliographic item.Faber and Faber Ltd
OxfordClarendon Press1987
]]>
base tag set for common core featuresUse the full form of the name by which a company is usually
referred to, rather than any abbreviation of it which may appear on a
title pageimprint
%phrase.seq
]]>
]]>
qquoted speech or thoughtcontains a representation of the speech or
thought of a character or speaker in a text. typemay be used to indicate whether the quoted matter is spoken
or thought, or to characterize it more finely.
CDATA
spoken representation of direct speech, usually
marked by quotation marks
thought representation of thought, e.g. internal monologue
#IMPLIED
directmay be used to indicate whether the quoted matter is regarded
as direct or indirect speech
(y | n | unspecified)
y speech or thought is represented directly
n speech or thought is represented indirectly
e.g. by use of a marked verbal aspect
unspecified no claim is made
unspecified
whoidentifies the speaker of a piece of direct speechCDATA
may be an idref#IMPLIED
God Bless Us, Every One!
]]>
base tag set for common core features%phrase.seq;
]]>
]]>
quotequotationcontains a phrase or passage attributed by the narrator or
author to some agency external to the text. You shall know a word by the company it
keeps(Firth, 1957)
]]>
If a citation is supplied for the source of a
quotation, this should be included within the quote
element.
base tag set for common core features%seq;
]]>
]]>
refdefines a reference to another location in the current document,
in terms of one or more identifiable elements, possibly modified by
additional text or comment.targetspecifies the destination of the reference as one or more
SGML identifiersIDREFS Each value specified must be the same as that specified as
value for an ID attribute for some other element in the
current SGML document.#IMPLIED
See especially the second sentence
see also s.v. locution
]]>
base tag set for common core features%phrase.seq
]]>
]]>
refsys
reference system elements
milestone-style elements used in reference systems
base tag set for common core features
regregularization
contains a reading which has been regularized or normalized
in some sense.
orig
original
gives the unregularized form of the text as found in the source
copy.
CDATA
any string of characters
#IMPLIED
Authority
]]>
resp
responsibility
identifies the individual responsible for the regularization
of the word or phrase.
CDATA
any string of characters, typically the initials of the
individual involved, or a role identifier like editor if not
known by name.
#IMPLIED
The reg tag is mirrored by the orig tag,
which allows the unnormalized form of the original to be retained as
the content of the element, while still providing the opportunity to
record the regularized or normalized form postulated by a researcher.
The choice between the two elements is up to the encoder.
additional tag set for common core featuresMay contain character data and phrase-level elements.
%phrase.seq;
]]>
]]>
regionregioncontains the name of any regional or administrative subdivision,
smaller than a country but larger than a city, occurring within an address.
Illinois
]]>
Provence
]]>
base tag set for common core featuresaddress
%phrase.seq
]]>
]]>
respstatement of responsibilitysupplies a statement of responsibility for someone responsible for
the intellectual content of a text, edition, recording, or series, where
the author, editor and other specialized elements do
not suffice.
transcribed from original msClaus Huitfeldt
]]>
converted to SGML encodingAlan Morrison
]]>
base tag set for common core featuresanalytic monogr series
%m.agent role
]]>
]]>
rolecontains a phrase describing the nature of a person's
intellectual responsibility.compilerEdward Child
]]>
auxiliary tag set for TEI headersresp
%phrase.seq;
]]>
]]>
sarbitrary segment
contains any arbitrary phrase-level unit of text.
type
indicates what sort of segment is marked.
CDATA
any string of characters.
#IMPLIED
The type attribute may be used to provide any
suitable classification for the s elements tagged in a text.
It should not be used to tag features for which other tags are defined
in these Guidelines.
When are you leaving?
Tomorrow.
]]>
The s element may be used to mark orthographic
sentences, or other arbitrary low-level segments of the text.
additional tag set for common core featuresMay contain character data and phrase-level elements.
%phrase.seq
]]>
]]>
segelements for arbitrary segmentation. base tag set for common core featuresseriesseries information
contains information about the series in which a book or other
bibliographic item has appeared.
base tag set for common core featuresMay contain character data and phrase-level elements.
bibl.struct
biblScope editor resp title
]]>
]]>
siccontains text reproduced although apparently incorrect or
inaccurate.
corr
correction
gives a correction for the apparent error in the copy text.
CDATA
any string of characters
#IMPLIED
a Table
of green fields.
]]>
resp
responsibility
identifies the individual responsible for the correct form given.
CDATA
Sample values include author, ed, scribe,
or the initials of the individual in question.
#IMPLIED
a Table
of green fields.
]]>
If all that is desired is to call attention to the
apparent problem in the copy text, no attributes are required:
we can prove or disprove anyone's theories?
]]>
It is also possible to provide a correct reading and to identify
the individual responsible for the correction:
we can prove or
disprove anyone's theories?
]]>
The sic tag is a mirror of corr: the
former leaves the original text untouched, giving the correction as an
attribute value; the latter substitutes the correction, leaving the
original reading as an attribute value. The choice between them is up
to the encoder.
additional tag set for common core featuresMay contain character data and phrase-level elements.
%specialPara;
]]>
]]>
soCalledso calledcontains a word or phrase for which the author or narrator
indicates a disclaiming of responsibility, for example by the use
of scare quotes or italics nuts to Scrooge.
]]>
base tag set for common core features%phrase.seq;
]]>
]]>
spspeech
An individual speech in a performance text, or a passage
presented as such in a prose or verse text.
who
identifies the speaker of the part by supplying an ID.
IDREFS
The IDREFS are derived from the ID attribute on the role
elements in the cast list or from a list of the participants.
#IMPLIED
The reverend Doctor Opimiam
I do not think I have named a single unpresentable fish.
Mr Gryll
Bream, Doctor: there is not much to be said for bream.
The Reverend Doctor Opimiam
On the contrary, sir, I think there is much to be said for him.
In the first place....
Fish, Miss Gryll -- I could discourse to you on fish by the hour:
but for the present I will forbear...
]]>
The who attribute on this element may
be used either in addition to the speaker
element or as an alternative.
base tag set for common core featuresLines or paragraphs, stagedirections and phrase level elements.
%specialPara speaker
]]>
]]>
spanattaches simple analytic categories to an arbitrary passage of
text.
resp
responsibility
indicates who is responsible for the annotation being
performed.
CDATA
initials of the researcher providing the analysis.
#IMPLIED
type
indicates what type of phenomenon is being noted in the passage.
CDATA
image
an image in the passage is being identified.
character
the span is associated with a particular character.
theme
the span tag identifies a theme in the passage.
allusion
the span tag identifies an allusion to another
text.
discourse type
the span tag identifies a passage marked by a
particular discourse type.
#IMPLIED
value
identifies the specific phenomenon being annotated.
CDATA
A value from any convenient vocabulary for the type of
phenomenon involved: names for images, themes, characters, types of
discourse, etc.
#IMPLIED
Automatic processing is easier if the values for this
attribute come from a controlled vocabulary.
target
indicates the beginning of the span.
IDREF
reference to the id of an element which begins at
the location in question (e.g. the id of an anchor
element).
#IMPLIED
If no value is supplied, the passage is assumed to begin at
the point where the span element occurs. If the id
value given is that of an element with content, and no value is given
for target.end, then the passage is assumed coterminous with
the element pointed at.
target.end
points to the end of the span being annotated.
IDREF
reference to the id of an element which
ends at the location in question, or to an empty element at
the point in question.
#IMPLIED
If no value is supplied, the annotation is assumed to apply
to the span or single point indicated by the target value.
]]>
The span element is intended for simple analyses
of text, in which specific values (for example names of themes,
characters, allusions, etc.) are
associated with specific passages or spans of text. The type
and value attributes allow the researcher to use whatever
categories and values seem apt to the purpose. For more discursive
annotation which requires less restrictive markup, the note
element may be used. For analysis with more internal structure, use the
feature structure mechanisms described in chapter
.
Note that since the span tag points both to the beginning
and to the ending of the passage it annotates, it need not be located at
the passage itself; this allows the base text and annotation to be kept
separate if that is desired.
additional tag set for common core featuresEmpty.
EMPTY
]]>
]]>
speakerA specialized form of heading or label, giving the name of
one or more speakers in a dramatic text or fragment.
Nancy and Robert(speaking simultaneously)
The future? ...
]]>
The who attribute on the sp element may
be used either in addition to this element or as an alternative.
base tag set for common core featuresAny sequence of phrase level data
sp
%phrase.seq
]]>
]]>
specialPara
special paragraph
defines the behavior of certain special
paragraph-like elements, which can either behave like a single paragraph
(containing character data and phrases) or contain a series of
paragraphs and paragraph-level objects.
This parameter entity is used to define the contents of
several elements, including item, note, and
sp.
'(((%m.chunk;), (%m.chunk; | %m.inter;)*) |
(%paraContent;))'
stagestage directioncontains any kind of stage direction within a dramatic text or
fragment.typeindicates the kind of stage direction.CDATA
setting describes a setting.
entrance describes an entrance.
exit describes an exit.
business describes stage business.
novelistic is a narrative, motivating stage direction.
delivery describes how a character speaks.
modifier gives some detail about a character.
location describes a location.
mixed more than one of the above
mix
A curtain being drawn.
MusicEnter Husband as being thrown off his
horse.Exit pursued by a bear.He quickly takes the stone out.To Lussurioso.Having had enough, and embarrassed for
the family.Disguised as Ansaldo.At a window.Aside.
]]>
base tag set for common core features%para.seq
]]>
]]>
streetstreeta full street address including any name or number identifying a
building as well as the name of the street or route on which it is
located.
via della Faggiola, 36
]]>
Duntaggin,
110 Southmoor Road
]]>
The order and presentation of house names and numbers and
street names, etc., may vary considerably in different countries. The
encoding should reflect the order which is appropriate in the country
concerned.
base tag set for common core featuresRunning prose onlyaddress
%phrase.seq
]]>
]]>
termcontains a word or phrase regarded as a technical term.
parser, and much
of the history of NLP over the last 20 years has been occupied
with the design of parsers.
]]>
discoursal point of view
as the relationship, expressed through discourse
structure, between the implied author or some other addresser,
and the fiction.
]]>
base tag set for common core featuresfree prose
%phrase.seq;
]]>
]]>
timecontains a phrase defining a time of day in any format.
value
gives the value of the time in a standard form.
CDATA
Any string representing a time in standard format; recommended
form is hhmm, using the 24 hour clock.
#IMPLIED
four o' clock
]]>
If the normalized form of time is not given using the twenty-four
hour clock, this fact should be documented in the stdVals
element in the TEI Header.
the quarter to twelve.
]]>
Twelfth Day of June
in the Year of Our Lord One Thousand Nine Hundred and
Seventy-seven of the Republic the Two Hundredth and first
and of the University the Eighty-Sixth.
]]>
September 1990
]]>
additional tag set for common core featuresMay contain character data and phrase-level elements.
%phrase.seq;
]]>
]]>
timeRangetime range
contains two times or another phrase indicating a time period.
from
indicates the starting point of the time period in standard form.
CDATA
a time of day; recommended form is to use the 24 hour clock.
#IMPLIED
The value should conform to the standard form declared in
the stdVals element in the TEI header.
to
indicates the ending point of the time period in standard form.
CDATA
a time of day; recommended form is to use the 24 hour clock.
#IMPLIED
The value should conform to the standard form declared in
the stdVals element in the TEI header.
exact
indicates the precision to be attached to either or both times
specified.
(to|from|both|none)
tothe to time is exact
fromthe from time is exact
bothboth times are exact
noneboth times are approximate or unspecified
#IMPLIED
Just after teatime
]]>
time values should conform to the standard form declared in
the stdVals element in the TEI header.
additional tag set for common core featuresMay contain character data and phrase-level elements.
%phrase.seq;
]]>
]]>
titlecontains the title of a work, whether article, book, journal, or
series, including any alternative titles or subtitles.level
bibliographic level (or class) of title
indicates whether this is the title of an article, book, journal,
series, or unpublished material.
(a | m | j | s | u)
a
analytic title (article, poem, or other item published as
part of a larger item)
m
monographic title (book, collection, or other item published
as a distinct item, including single volumes of multi-volume
works)
j
journal title
s
series title
u
title of unpublished material (including theses and
dissertations unless published by a commercial press)
#IMPLIED
If the title appears directly enclosed within an
analytic element, the level, if given, must be
a; if it appears directly enclosed within a monogr
element, level must be m, j, or u;
when title is directly enclosed by series,
level must be s.
type
type of title
classifies the title according to some convenient typology.
CDATA
main main title
subordinate subtitle, title of part
parallel alternate title, often in another language,
by which the work is also known
abbreviated abbreviated form of title
#IMPLIED
This attribute is provided for convenience in analysing
titles and processing them according to their type; where such
specialized processing is not necessary, there is no need for such
analysis, and the entire title, including subtitles and any parallel
titles, may be enclosed within a single title element.
La vie mode d'emploi. Romans.
]]>
Synthese: an international journal for epistemology, methodology
and history of science
]]>
Information Technology and the Research Process: Proceedings
of a conference held at Cranfield Institute of Technology, UK, 18-21
July 1989
]]>
Hardy's Tess of the D'Urbervilles: a machine readable edition
]]>
base tag set for common core featuresanalytic monogr series
%soup;
]]>
]]>
%versionversion name
serves as the root element for a concurrent markup stream which
will be used to mark page and line numbers of a reference edition of the
text.
]>
<(TEI.2)TEI.2><(La)La>
<(TEI.2)TEI.Header> ...
(TEI.2)TEI.Header>
<(TEI.2)text>
<(La)page n='[1]'>
...
<(La)page n='4'>
...
<(La)page n='5'>
...
(TEI.2)text>
(La)La>
(TEI.2)TEI.2>
]]>
The version name should be short (one or two characters), as
it must be prefixed to all page and line tags in the concurrent markup
stream.
base tag set for physical description of the copy textMay contain character data, vol, and page
elements.
#PCDATA page vol
]]>
]]>
version
version
defines the name to be used for the root element of a concurrent
markup stream for marking pages and lines of some reference edition.
Some name for the edition should be supplied by defining
this parameter entity within the appropriate DTD subset. If none is
defined, the parameter entity (and thus the document type) default to
ref.
'ref'
volvolume
marks the individual volumes of a reference edition.
... [text of edition La, volume 2]
<(La)page n=32> ... [text of edition La, vol. 2, page 32]
<(La)page n=32> ... [text of edition La, vol. 2, page 33]
<(La)page n=32> ... [text of edition La, vol. 2, page 34]
<(La)page n=32> ... [text of edition La, vol. 2, page 35]
]]>
The vol may be used if the reference edition has
more than one volume; otherwise it need not be used. Any data contained
within a vol element but not within a page element is
assumed not to appear in the edition from which the reference scheme
derives.
base tag set for physical description of the copy textMay contain character data and page elements.
%version
#PCDATA page
]]>
]]>