Names and Dates

This chapter describes an additional tag set which may be used for the encoding of proper names and other phrases descriptive of persons, places, organizations, and also of dates and times, in a manner more detailed than that possible using the elements already provided for these purposes in the core tag set described in chapter .

In section it was noted that the elements provided in the core allow the encoder to specify that a given text segment is a proper noun, or a referring string, and to specify the kind of object named or referred to only by supplying a value for the type attribute. The elements provided by the present tag set allow the encoder both to supply a detailed sub-structure for such referring strings, and also to distinguish explicitly between names of persons, places or organizations.

Similarly, the elements provided here allow the encoder to supply a detailed analysis of the component parts of any expression which denotes a date or time, which is not possible using the elements described in section .

It should be noted however that no provision is made by the present tag set for the representation of the abstract structures, or virtual objects to which names or dates may be said to refer. In simple terms, where the core tag set allows one to represent a name, this additional tag set allows one to represent a personal name, but neither provides for the direct representation of a person. Appropriate mechanisms for the encoding of such interpretative gestures may be found in chapters and .

To enable the additional tag set described in the present chapter, a parameter entity TEI.names.dates must be declared in the document type subset with the value INCLUDE, as further described in section . A document using the prose base tag set and this additional tag set will thus begin as follows: ]> ]]>

The chapter begins by discussing additional tags for the encoding of component parts of personal names (section , place names (section ) and organizational names (section ). Detailed encoding of dates and times is described in section .

The additional tag set for names and dates, included in the file teind2.dtd, has the following overall structure: ]]>

When this tag set is enabled, three additional element classes called persPart, placePart, and temporalExpr are declared. The parameter entities corresponding with these classes are declared in the file teind2.ent, as follows: ]]> Personal Names

The core rs and name elements can distinguish names in a text but are insufficiently powerful to mark their internal components or structure. To conduct nominal record linkage or even to create an alphabetically sorted list of personal names, it is important to distinguish between a family name, a forename and an honorary title. Similarly, when confronted with a referencing string such as John, by the grace of God, king of England, lord of Ireland, duke of Normandy and Aquitaine, and count of Anjou, the analyst will often wish to distinguish amongst components giving some hint as to the status, occupation or residence of the person to whom the name belongs. The following elements are provided for these and related purposes: contains a proper noun or proper-noun phrase referring to a person, possibly including any or all of the person's forename, surname, honorofic, added names, etc. contains a family (inherited) name, as opposed to a given, baptismal, or nick name. contains a forename, given or baptismal name. contains a name component which indicates that the referent has a particular role or position in society, such as an official title or rank. contains an additional name component, such as a nickname, epithet, or alias, or any other descriptive phrase used within a personal name. contains a connecting phrase or link used within a name but not regarded as part of it, such as van der or of. contains a name component used to indicating generational information, such as Junior, or a number used in a monarch's name.

As members of the names class, all of these elements share the following attributes: provides an alternative identifier for the object being named, such as a database record key. gives a normalized or regularized form of the name used.

Additionally, all of the above elements except for persName are members of the class personPart, and thus share the following attributes: provides more culture- linguistic- or application- specific information used to categorize this name component. indicates whether the name component is given in full, as an abbreviation or simply as an initial. Legal values are: the name component is spelled out in full. the name component is given in an abbreviated form. the name component is indicated only by one initial. specifies the sort order of the name component in relation to others within the personal name.

The persName element may be used in preference to the general name element irrespective of whether or not the components of the personal name are also to be marked. Its key and reg attributes are used in exactly the same way as those on the rs and name elements (see section ). The tag persName is synonymous with the tag name type=person, except that its type attribute allows for further subcategorization of the personal name for example as a married, maiden, pen, pseudo or religious name. Consequently the following examples are equivalent: David Paul Brown, has suffered the furniture of his office to be seized the third time for rent. That silly man David Paul Brown has suffered the furniture of his office to be seized the third time for rent. That silly man David Paul Brown has suffered ... That silly man David Paul Brown has suffered ... ]]>

The persName element is more powerful than the rs and name elements because distinctive name components occurring within it can be marked as such.

Many cultures distinguish between a family or inherited surname and additional personal names, often known as given names. These should be tagged using the surname and forename elements respectively and may occur in any order: Roosevelt, Franklin Delano Franklin Delano Roosevelt ]]>

The type attribute may be used with both forename and surname elements to provide further culture- or project- specific detail about the name component, for example: Franklin Delano Roosevelt Margaret Hilda Roberts Thatcher Muhammad Ali ]]> In the following two examples the type attribute of the surname element is used to indicate so-called double-barrelled or hyphenated surnames: Kara Hattersley-Smith Norman St John Stevas ]]>

In most cases, patronymics should be treated as forenames, thus: Snorri Sturluson to combine the two traditions in cyclic form. ]]> When a patronymic is used as a surname, however (e.g. by an individual who otherwise would have no surname, but lives in a culture which requires surnames), it may be tagged as such: Finnur Jonsson acknowledged the artificiality of the procedure: As Njála</> now begins, no original saga ever began.</q> ]]> </eg> <!-- Clover, p. 77. --> In the following example, the <att>type</att> attribute is used to distinguish a patronymic from other forenames: <eg><![ CDATA[ <persName key=pn9> <forename sort=2>Sergei</forename> <forename sort=3 type='patronym'>Mikhailovic</forename> <surname sort=1>Uspensky</surname> </persName> ]]> </eg> <p>This example also demonstrates the use of the <att>sort</att> attribute common to all members of the <ident type=class>personPart</ident> class; its effect is to state the sequence in which <gi>forename</gi> and <gi>surname</gi> elements should be combined when constructing a sort key for the name. <p>Some names include generational or dynastic information, such as <q>Junior</q> or <q>senior</q>, or a number: the <gi>genName</gi> element may be used to distinguish these from other parts of the name, as in the following examples: <eg><![ CDATA[ <persName key=HEMA1> <surname>Marques</surname> <genName>Junior</genName>, <forename>Henrique</forename> </persName> <persName> <foreName>Charles</foreName> <genName>II</genName> </persName> ]]> </eg> <p>It is also often convenient to distinguish phrases (historically similar to the generational labels mentioned above) used to link parts of a name together, such as <q>von</q>, <q>of</q>, <q>de</q> etc. It is often a matter of arbitrary choice whether or not such components are regarded as part of the surname or not; the <gi>nameLink</gi> element is provided as a means of making clear what the correct usage should be in a given case, as in the following examples: <eg> <![ CDATA [ <persName key=DUDO1> <roleName type=honorific full=abb>Mme</roleName> <nameLink>de la</nameLink> <surname>Rochefoucault</surname> </persName> <persName> <foreName>Walter</foreName> <surname>de la Mare</surname> </persname> ]]> </eg> <p>Finally, the <gi>addName</gi> and <gi>roleName</gi> elements are used to mark all name components other than those already listed. The distinction between them is that a <gi>roleName</gi> encloses an associated name component such as an aristocratic or official title which exists in some sense independently of its bearer. The distinction is not always a clear one. As elsewhere, the <att>type</att> attribute may be used with either element to supply culture- or application- specific distinctions. Some typical values for this attribute for names in the Western European tradition follow: <list type=gloss> <label>nobility<item>An inherited or life-time title of nobility such as Lord, Viscount, Baron, etc. <label>honorific<item>An academic or other honorific prefixed to a name e.g. Dr., Professor, Mrs., etc. <label>office<item>Membership of some elected or appointed organization such as President, Governor, etc. <label>military<item>Military rank such as Colonel. <label>epithet<item>A traditional descriptive phrase or nick-name such as The Hammer, The Great, etc. </list> <p>Here are some further examples of the usage of these elements: <eg><![ CDATA[ <persName key=PGK1> <roleName type=nobility>Princess</roleName> <forename>Grace</forename> </persName> <persName key=GRMO1 type=pseudo> <addName type=honorific>Grandma</addName> <surname>Moses</surname> </persName> <persName key=MRSRO1> <addName type=honorific>Mrs</addName> <surname>Robinson</surname> </persName> <persName key=STAU1> <roleName type=office>Saint</addName> <forename>Augustine</forename> </persName> <persName key=SLWICL1> <roleName type=office>President</roleName> <foreName>Bill</foreName> <surname>Clinton</surname> </persName> <persName key=MOGA1> <roleName type=military>Colonel</roleName> <surname>Gaddafi</surname> </persName> <persName key=FRTG1> <forename>Frederick</forename> <addname type=epithet>the Great</addname> </persName> ]]> </eg> <p>A name may have any combination of the above elements: <eg><![ CDATA[ <persName key=EGBR1> <roleName type=office>Governor</roleName> <forename sort=2>Edmund <forename sort=3 full=init reg='Gerald'>G</forename>. <addName type=epithet>Jerry</addName> <addName type=epithet>Moonbeam</addName> <surname sort=1>Brown</surname> <relationship full=abb>Jr</relationship>. </persName> ]]> </eg> <p>Although highly flexible, these mechanisms for marking personal name components will not cater for every personal name and processing need. Where the internal structure of personal names is highly complex or where name components are particularly ambiguous, feature structures are recommended as the most appropriate mechanism to mark and analyze them, as further discussed in chapter <ptr target=FS>. <p>The elements discussed in this section are formally defined as follows: <eg id=dndper> <![ CDATA [ <!-- 20.1: Personal names --> <!ELEMENT persName - - (%m.personPart; | %m.phrase; | #PCDATA)* > <!ATTLIST persName %a.global; %a.names; type CDATA #IMPLIED > <!ELEMENT surname - - (%phrase.seq;) > <!ATTLIST surname %a.global; %a.personPart; > <!ELEMENT forename - - (%phrase.seq;) > <!ATTLIST forename %a.global; %a.personPart; > <!ELEMENT genName - - (%phrase.seq;) > <!ATTLIST genName %a.global; %a.personPart; > <!ELEMENT nameLink - - (%phrase.seq;) > <!ATTLIST nameLink %a.global; %a.personPart; > <!ELEMENT addName - - (%phrase.seq;) > <!ATTLIST addName %a.global; %a.personPart; > <!ELEMENT roleName - - (%phrase.seq) > <!ATTLIST roleName %a.global; %a.personPart; > <!-- This fragment is used in sec. 20 --> ]]> </eg> <div2 id=NDplac><head>Place Names</head> <p>Like other proper nouns or noun phrases used as names, place names can simply be marked up with the <gi>rs</gi> element, or with the <gi>name</gi> element.For cartographers and historical geographers, however, the component parts of a place name provide important information about the relation between the name and some spot in space and time. They also provide importance evidence in historical linguistics. For such applications and others in which the internal structure of a place name is to be encoded, the <gi>placeName></gi> element and its subcomponents should be used. <list type=gloss> <label><gi>placeName</gi></label> <item>contains an absolute or relative place name. </item> <label><gi>settlement</gi></label> <item>contains the name of the smallest component of a place name expressed as a hierarchy of geo-political or administrative units as in <q>Rochester</q>, New York; <q>Glasgow</q>, Scotland.</item> <label><gi>region</gi></label> <item>in an address, contains the state, province, county or region name; in a place name given as a hierarchy of geo-political units, the <gi>region</gi> is larger or administratively superior to the <gi>settlement</gi> and smaller or administratively less important than the <gi>country</gi>.</item> <label><gi>country</gi></label> <item>in an address, gives the name of the nation, country, colony, or commonwealth; in a place name given as a hierarchy of geo-political units, the <gi>country</gi> is larger or administratively superior to the <gi>region</gi> and smaller than the <gi>bloc</gi>.</item> <label><gi>bloc</gi></label> <item>a geo-political unit containing one or more nation states.</item> <label><gi>geogName</gi></label> <item>a name associated with some geographical feature such as <q>Windrush Valley</q> or <q>Mount Sinai</q>.</item> <label><gi>geog</gi></label> <item>contains a common noun identifying some geographical feature contained within a geographic name, such as <q>valley</q>, <q>mount</q> etc.</item> <label><gi>distance</gi></label> <item>that part of a relative temporal or spatial expression which indicates the distance between the place or time denoted by it and the place or time referred to within it.</item> <label><gi>offset</gi></label> <item>that part of a relative temporal or spatial expression which indicates the direction of the offset between the two place names, dates, or times involved in the expression.</item> </list> <p>As members of the <ident type=class>names</ident> class, all these elements share the following attributes: <list type=gloss> <label><att>key</att></label> <item>provides an alternative identifier for the object being named, such as a database record key.</item> <label><att>reg</att></label> <item>gives a normalized or regularized form of the name used.</item> </list> <p>Additionally, all of the above elements are members of the class <ident type=class>placePart</ident>, and thus share the following attributes: <list type=gloss> <label><att>type</att></label> <item>provides more culture- linguistic- or application- specific information used to categorize this name component.</item> <label><att>full</att></label> <item>indicates whether the place name component is given in full, as an abbreviation or simply as an initial Legal values are: <list type=gloss> <label><term>yes</term></label> <item>the name component is spelled out in full.</item> <label><term>abb</term></label> <item>the name component is given in an abbreviated form.</item> <label><term>init</term></label> <item>the name component is indicated only by one initial.</item> </list></item> </list> <p>Like the <gi>persName</gi> element discussed in section <ptr target=NDper>, the <gi>placeName</gi> element may be regarded simply as an abbreviation for the tags <tag>name type=place</tag> or <tag>rs type=place</tag>. The following encodings are thus equivalent: <note>Strictly, a suitable value such as <code>figurative</code> should be added to the two place names which are presented periphrastically in the second example here, in order to preserve the distinction indicated by the choice of <gi>rs</gi> rather than <gi>name</gi> to encode them in the first version.</note> <eg><![CDATA[ After spending some time in our <rs type=place key=NY1>modern <name type=place key=BA1>Babylon</name> </rs>, <name type=place key=NY1>New York</name>, I have proceeded to the <rs type=place key=PH1>City of Brotherly Love</rs>. After spending some time in our <placeName key=NY1>modern <placeName key=BA1>Babylon</placeName> </placeName>, <placeName key=NY1>New York</placeName>, I have proceeded to the <placeName key=PH1>City of Brotherly Love</placeName>. ]]> </eg> <!-- from letter from 'JK' found in Poulson's Daily Advertiser --> <!-- 8 October, 1835, --> <p>As indicated above, the <gi>placeName</gi> may simply contain a character string and its <att>type</att> attribute may be used to provide a sub-categorization of place names. Alternatively, it may contain more detailed sub components. A place name may be analysed in several different ways: as a geo-political unit, using a hierarchy of descriptive names (see section <ptr target=NDplgu>); in terms of geographic features such as mountains and rivers (see section <ptr target=NDplgf>); relative to other place names (see section <ptr target=NDplr>. <div3 id=NDplgu><head>Geo-political Place Names <p>A place name is sometimes given as sequence of geo-political or administrative units, often arranged in ascending sequence according to their size or administrative importance, for example: <q>Rochester, New York</q>, or as a single such unit, for example <q>Belgium</q>. The more detailed component elements listed above (<gi>settle</gi> for a settlement, such as a village, town or city; <gi>region</gi> for any administrative unit such as a county, parish or state; <gi>country</gi> for a politically recognized national entity; or <gi>bloc</gi> for any grouping of such entities) have been chosen for their generality of application. They may be tailored more closely to project- and culture-specific needs by specifying appropriate values in their respective <att>type</att> attributes, as in the following example: <eg><![ CDATA[ <placeName key=RNY1> <settlement type=city>Rochester</settlement>, <region type=state>New York</region> </placeName> <placeName key=LSEA1> <country type=nation>Laos</country>, <bloc type=sub-continent>Southeast Asia</bloc> </gu> ]]> </eg> <p>Note that, even in the case where only one of these component place name elements is used, the <gi>placeName</gi> element must still be present. <eg><![ CDATA[ I'd rather be in <placeName> <settlement key=RNY1 type=city>Rochester</settlement> </placeName> than any other place I know. ]]> </eg> <div3 id=NDplgf><head>Geographic Names <p>Places may also be named in terms of geographic features such as mountains, lakes or rivers, independently of geo-political units. The <gi>geogName</gi> is provided to mark up such names, as an alternative to the <gi>placeName</gi> element discussed above. It contains a sequence of phrase level elements, optionally extended by the following special element: <list type=gloss> <label><gi>geog</gi></label> <item>contains a common noun identifying some geographical feature contained within a geographic name, such as <q>valley</q>, <q>mount</q> etc.</item> </list> <p>For example: <eg> <![CDATA[ <geogName key=MIRI1 type=river> Mississippi River </geogName> ]]> </eg> <p>Where the <gi>geog</gi> element is used to characterize the kind of geographic feature being named, the <gi>name</gi> element will generally also be used to mark the associated proper noun or noun phrase: <eg> <![CDATA[ <geogName key=MIRI1 type=river> <name>Mississippi</name> <geog>River</geog> </geogName> ]]> </eg> A more complex example, showing a variety of practices, follows: <eg><![CDATA[ The isolated ridge separates two great corridors which run from <name key=GLCO1 type=place>Glencoe</name> into <geogName key=GLET1 type=glen> <geog reg='glen'>Glen</gfn> <name>Etive</name> </geogName>, the <geogName key=LAGA1 type=hill> <geog lang='gaelic' reg='sloping hill face'>Lairig</gfn> <name type=>Gartain</name> </geogName> and the <geogName key=LAEI1 type=hill> <geog lang='gaelic' reg='sloping hill face'>Lairig</gfn> <name>Eilde</name> </geogName> ]]> </eg> <div3 id=NDplr><head>Relative Place Names <p>All the place name specifications so far discussed are <term rend=noindex>absolute</term>, in the sense that they define only one place. A place may however be specified in terms of its relationship to another place, for example <q>10 miles northeast of Paris</q> or <q>near the top of Mount Sinai</q>. These <term>relative place names</term> will contain a place name which acts as a referent (e.g. <q>Paris</q> and <q>Mount Sinai</q>). They will also contain a word or phrase indicating the the position of the place being named in relation to the referent (e.g. <q>the top of</q>, <q>north of</q>). A distance, possibly only vaguely specified, between the referent place and the place being indicated may also be present (e.g. <q>10 miles</q>, <q>near</q>) <p>Relative place names may be encoded using the following elements in combination with either a <gi>placeName</gi> or a <gi>geogName</gi> element. <list type=gloss> <label><gi>offset</gi></label> <item>that part of a relative temporal or spatial expression which indicates the direction of the offset between the two place names, dates, or times involved in the expression.</item> <label><gi>distance</gi></label> <item>that part of a relative temporal or spatial expression which indicates the distance between the place or time denoted by it and the place or time referred to within it.</item> </list> Some examples of relative place names are: <eg> <![CDATA[ <placeName key=NRPA1> <offset>near</offset> <geog> the top of <geogName>Mount</geogName> <name>Sinai</name> </geog> </placeName> <placeName key=NEPA1> <distance>10 miles</distance> <offset>north of</offset> <settlement type=city>Paris </settlement> </placeName> ]]> </eg> <p>The internal structure of place names is like that of personal names - complex and subject to an enormous amount of variation across time and different cultures. The recommendations in this section will be adequate for a majority of users and applications. They may not, however, satisfy the most specialized inquiries and/or applications in which case it is recommended that the internal structure of place names be represented using feature structures <ptr target=FS>. <p>The elements discussed in this section are formally defined as follows: <eg id=dndplac> <![ CDATA [ <!-- 20.2.3: Names for places --> <!ELEMENT placeName - - ((%m.placePart; | %m.phrase; | #PCDATA)+) > <!ATTLIST placeName %a.global; type CDATA #IMPLIED full (yes | abb | init) yes %a.names; > <!ELEMENT settlement - - (%phrase.seq;) > <!ATTLIST settlement %a.global; %a.placePart; > <!ELEMENT region - - (%paraContent) > <!ATTLIST region %a.global; %a.placePart; > <!ELEMENT country - o (%paraContent) > <!ATTLIST country %a.global; %a.placePart; > <!ELEMENT bloc - - (%phrase.seq) > <!ATTLIST bloc %a.global; %a.placePart; > <!ELEMENT offset - - (#PCDATA) > <!ATTLIST offset %a.global; value CDATA #IMPLIED %a.placePart; > <!ELEMENT distance - - (%phrase.seq) > <!ATTLIST distance %a.global; value CDATA #IMPLIED type CDATA #IMPLIED full (yes | abb | init) yes exact (Y | N | U) U reg CDATA #IMPLIED > <!ELEMENT geogName - - (geog | name | #PCDATA)* > <!ATTLIST geogName %a.global; %a.placePart; > <!ELEMENT geog - - (#PCDATA) > <!ATTLIST geog %a.global; %a.placePart; > <!-- This fragment is used in sec. 20 --> ]]> </eg> <div2 id=NDorg><head>Organization Names</head> <p>Like names of persons or places, organization names can be marked as referent strings or as proper names with the <gi>rs</gi> and <gi>name</gi> elements. For certain applications it may be desirable to mark the component parts of an organization. In some historical and social scientific studies, for example, the component parts of an organization names may give crucial clues which help to characterizing the organization in terms of its geographical location, ownership, likely number of employees, management structure etc. The elements discussed in this section are recommended for this purpose and include: <list type=gloss> <label><gi>orgName</gi></label> <item>contains an organizational name. Attributes include: <list type=gloss> <label><att>type</att></label> <item>more fully describes the organization indicated in the organizational name. Possible values include <q>voluntary</q>, <q>political</q>, <q>governmental</q>, <q>industrial</q>, <q>commercial</q>, etc.</item> <label><att>key</att></label> <item>provides an alternative identifier for the organization being named, such as a database record key.</item> <label><att>reg</att></label> <item>gives a normalized or regularized form of the organization name</item> </list> </item> <label><gi>orgtitle</gi></label> <item>contains the proper name component of an organizational name. Attributes include: <list type=gloss> <label><att>type</att></label> <item>more fully describes the organization title. Possible values include <q>formal</q>, <q>colloquial</q>, <q>acronym</q>, etc.</item> <label><att>reg</att></label> <item>gives a normalized or regularized form of the organization title.</item> </list> </item> <label><gi>orgtype</gi></label> <item>indicates a part of the organization name which contains information about the organization's structure or function. Attributes include: <list type=gloss> <label><att>type</att></label> <item>more fully describes the organization type specified in the name component. Possible values include <q>function</q>, <q>structure</q>, etc.</item> <label><att>reg</att></label> <item>gives a normalized or regularized form of the organization type</item> </list> </item> <label><gi>orgdivn</gi></label> <item>indicates a division, branch or department specified in an organizational name. Attributes include: <list type=gloss> <label><att>type</att></label> <item>more fully describes the organization division specified in the name component. Possible values include <q>branch</q>, <q>department</q>, <q>section</q>, <q>division</q>, etc.</item> <label><att>reg</att></label> <item>gives a normalized or regularized form of the organizational division.</item> </list> </item> </list> <p>The <gi>orgname</gi> element should be used when it is desirable to mark an organization name irrespective of whether or not its components are also to be marked. In effect the <gi>orgname</gi> element is a special case of a <gi>name</gi> and thus of an <gi>rs</gi> element. Consequently, the following examples are synonymous, though the last is preferred: <eg> <![ CDATA [ About a year back, a question of considerable interest was agitated in the <rs type=org key=PAS1>Pennsyla. Abolition Society</rs>. About a year back, a question of considerable interest was agitated in the <rs type=org key=PAS1><name>Pennsyla. Abolition Society</name></rs>. About a year back, a question of considerable interest was agitated in the <name type=org key=PAS1>Pennsyla. Abolition Society</name>. About a year back, a question of considerable interest was agitated in the <orgname key=PAS1 type=voluntary reg=Pennsylvania Abolition Society'>Pennsyla. Abolition Society</orgname>. ]]> </eg><!-- From the Diary of Thomas Pym Cope --> Like the <gi>rs</gi> and <gi>name</gi> elements, the <gi>orgname</gi> element has a <att>key</att> attribute with which an external identifier such as a database key can be assigned to the organization name. It also has a <att>type</att> attribute with which the organization named in the expression can be described, and a <att>reg</att> attribute with which the organization name can be presented in a regularized form. <p>The <gi>orgtitle</gi> element is used to mark the expression which provides the proper name component of an organization name for example: <eg><![CDATA[ Mr Frost will be able to earn an extra fee from <orgname type=media key=BSB1> <orgtitle type=acronym>BSkyB</orgtitle> </orgname> rather than the <orgname type=media key=BBC1> <orgtitle type=acronym reg='British Broadcasting Corporation'> BBC</orgtitle> </orgname> ]]></eg> <p>Where personal names are encountered as component parts of an organization's title, as in <q>Ernst & Young</q>, these may be tagged with the appropriate personal name elements as discussed in <ptr type=div2 target=NDPer>. Examples include: <eg><![ CDATA[ <orgname type='accountancy partnership' key=EY1> <orgtitle> <persname><surname>Ernst</surname></persname> & <persname><surname>Young</surname></persname> </orgtitle> </orgname> ]]></eg> <p>Organization names may also contain within them place names which, in some applications, may yield vital clues as to the organization's location and or sphere of influence. These components should be tagged with the approprate place name tags <ptr type=div2 target=NDplac>. Examples include: <eg><![ CDATA[ A spokesman from <orgname type=computers key=IBM1> <orgtitle reg='International Business Machines'>IBM</orgtitle> <country reg='United Kingdom' key=UNKI1>UK</country> </orgname> said... The feeling in <country type=nation key=CAN1>Canada</country> is one of strong aversion to the <orgname type=government key=USG1> <country type=nation key=US1>United States</country> Government </orgname>, and of predilection for self-government under the <orgname type=government reg='British monarchy'>English Crown</orgname> ]]></eg><!-- Anthony Trollope, North America, Penguin ed., 51 --> <p>The <gi>orgtype</gi> element is used to mark those components of an organizatino name which indicate something about the structure or function of the organization. Examples include: <eg><![ CDATA[ <orgname type='utility company' key=WWPC1> <region type=state>Washington</region> <orgtype type=function>Water Power</orgtype> <orgtype reg='incorporated' type=structure>Inc.</orgtype> </orgname> THE TICKET which you will receive herewith has been formed by the <orgname type=political reg='Whig party' key=WHI1> <orgtitle>Democratic Whig</orgtitle> <orgtype type=function>Party</orgtype> </orgname> after the most careful deliberation, with a reference to all the great objects of NATIONAL, STATE, COUNTY and CITY concern, and with a single eye to the <emph>Welfare and Best Interests of the Community</emph>. ]]></eg> <p>Organizational names may also be specified hierarchically particularly where the named organization is itself a department or a branch of a larger organizational entity. <q>The Department of Modern History, Glasgow University</q> is an example. The <gi>orgdivn</gi> element is recommended wherever it is desirable to isolate the independent levels of an organizational hierarchy that are specified in an organization name. Examples include: <eg><![ CDATA[ <orgname type=academic key=DMHGU1> <orgdivn type=department>Department of <orgtype type=function>Modern History</orgtype> </orgdivn>, <settle type=city>Glasgow</settle> <orgtype type=function>University</orgtype> </orgname> ]]></eg> <p>Although highly flexible, the mechanisms discussed here for marking the components of organization names will not cater for every processing need or organizational name that is encountered. Where the internal structure of organization names is highly complex, where name components are particularly ambiguous, or where it is important to indicate the assumptions made in the evaluation of an organization name, then feature structure notation is recommended <ptr target=FS> <p>The formal declaration of the elements discussed in this section include: <eg id=dndorg> <![ CDATA [ <!-- 20.3: Organization names --> <!ELEMENT orgName - - (%phrase.seq) > <!ATTLIST orgName %a.global; type CDATA #IMPLIED key CDATA #IMPLIED reg CDATA #IMPLIED > <!ELEMENT orgtitle - - (%phrase.seq) > <!ATTLIST orgtitle %a.global; type CDATA #IMPLIED reg CDATA #IMPLIED > <!ELEMENT orgtype - - (%phrase.seq) > <!ATTLIST orgtype %a.global; type CDATA #IMPLIED reg CDATA #IMPLIED > <!ELEMENT orgdivn - - (%phrase.seq) > <!ATTLIST orgdivn %a.global; type CDATA #IMPLIED reg CDATA #IMPLIED > <!-- This fragment is used in sec. 20 --> ]]> </eg> <div2 id=NDdate><head>Dates and Time</head> <p>The following elements for the encoding of dates and times were introduced in section <ptr target=COnada>: <list type=gloss> <label><gi>date</gi></label> <item>contains a date in any format. Attributes include: <list type=gloss> <label><att>calendar</att></label> <item>indicates the system or calendar to which the date belongs.</item> <label><att>value</att></label> <item>gives the value of the date in some standard form, usually yyyy-mm-dd.</item> <label><att>certainty</att></label> <item>indicates the degree of precision to be attributed to the date.</item> </list> </item> <label><gi>time</gi></label> <item>contains a phrase defining a time of day in any format. Attributes include: <list type=gloss> <label><att>zone</att></label> <item>indicates time zone or place name wherever this is necessary to evaluate a temporal expression.</item> <label><att>value</att></label> <item>gives the value of the time in a standard form.</item> <label><att>type</att></label> <item>indicates something about the type of temporal expression being tagged. Legal values are: <list type=gloss> <label><term>am</term></label> <item>indicates a temporal expression made on the basis of a twelve-hour clock and referring to a time between midnight and noon.</item> <label><term>pm</term></label> <item>indicates a temporal expression made on the basis of a twelve-hour clock and referring to a time between noon and midnight.</item> <label><term>24hour</term></label> <item>indicates a temporal expression made on the basis of a twenty-four-hour clock.</item> <label><term>descriptive</term></label> <item>indicates a temporal expression made in descriptive terms, e.g. <q>noon</q>.</item> </list></item> </list> </item> </list> <p>While adequate for many applications, these elements do not allow for the representation of the internal structure of expressions indicating dates or times, which may however be of importance for the correct interpretation of such expressions, or for certain kinds of analytic applications. In this section, we introduce the following special-purpose elements, for use when the internal structure of a temporal expression is to be encoded: <list type=gloss> <label><gi>dateStruct</gi></label> <item>contains an internally structured representation of a date.</item> <label><gi>timeStruct</gi></label> <item>contains an internally structured representation for a time of day.</item> </list> <p>Two types of temporal expressions are envisaged for dates and times: absolute and relative. An <term>absolute temporal expression</term> is composed of a sequence of the following elements, possibly interspersed with character data: <list type=gloss> <label><gi>day</gi></label> <item>the day component of a structured date.</item> <label><gi>week</gi></label> <item>the week component of a structured date.</item> <label><gi>month</gi></label> <item>the month component of a structured date.</item> <label><gi>year</gi></label> <item>the year component of a date.</item> <label><gi>second</gi></label> <item>the second component of a structured time-expression.</item> <label><gi>minute</gi></label> <item>the minute component of a structured time-expression.</item> <label><gi>hour</gi></label> <item>the hour component of a temporal expression such as</item> <label><gi>occasion</gi></label> <item>a temporal expression (either a date or a time) given in terms of a named occasion such as a holiday, a named time of day, or some notable event.</item> </list> <p>A <term>relative temporal expression</term> describes a date or time with reference to some other (absolute) temporal expression, and thus contains the following elements in addition to those listed above: <list type=gloss> <label><gi>distance</gi></label> <item>that part of a relative temporal or spatial expression which indicates the distance between the place or time denoted by it and the place or time referred to within it.</item> <label><gi>offset</gi></label> <item>that part of a relative temporal or spatial expression which indicates the direction of the offset between the two place names, dates, or times involved in the expression.</item> </list> <p>As members of the class <ident type=class>temporalExpr</ident> (temporal expression) these elements all share the following attributes: <list type=gloss> <label><att>value</att></label> <item>supplies the value of a date or time in a standard form.</item> <label><att>type</att></label> <item>provides any application-, linguistic- or culture-specific classication for the component.</item> <label><att>reg</att></label> <item>gives a normalized or regularized form of the temporal expression.</item> </list> <div3 id=NDdatea><head>Absolute Dates and Times <p>An absolute temporal expression which is a date will contain only a sequence of <gi>day</gi>, <gi>month</gi> <gi>week</gi>, <gi>year</gi> or <gi>occasion</gi> elements, as in the following examples: <eg><![ CDATA [ The university's view of American affairs produced a stinging attack by Edmund Burke in the Commons debate of <dateStruct value='26-10-1775'> <day value='26'>26</day> <month value='10'>October</month> <year value='1775'>1775</year> </dateStruct> ]]> </eg><!-- The History of the University of Oxford, vol v, 178 --> Component elements of a <gi>dateStruct</gi> may be repeated, provided that only a single temporal expression is intended: <eg><![ CDATA [ <dateStruct value='14-05-1993'> <day type=name>Friday</day>, <day type=number>14</day> <month>May</month> <year>1993</year> </dateStruct> ]]> </eg><!-- the headline on the Independent Newspaper on that day --> <p>The <gi>occasion</gi> element may be used for any component of a temporal expression which is given in terms of a named event, such as a public holiday for dates, or a named time such as <q>tea time</q> or <q>matins</q>: <eg> <![ CDATA [ In New York, <dateStruct value='1-1'> <occasion type=holiday>New Years Day</occasion> </dateStruct> is the quietest of holidays, <dateStruct value='4 July'> <occasion type=holiday>Independence Day</occasion> </dateStruct> the most turbulent. ]]> </eg> <p>These components may be applied to dates using any calendar system using subcomponents equivalent to those listed above: <eg> <![ CDATA [ <title>Le Vieux Cordelier: Journal rédigé par Camille Desmoulins, Quintidi Pluviose 2e décade, l'an 2 de la République Indivisible ]]>

Absolute temporal expressions denoting times which are given in terms of seconds, minutes, hours or of well defined events (e.g. noon, sunset) may similarly be represented using the timeStruct element: 13:45 At sunset we walked to the beach. The train leaves for Boston at a quarter of two ]]>

The type attribute may be used to distinguish sub-types of component elements (for example, months or days presented as words or as numbers) or to provide additional information about the function of this particular component (for example, to distinguish types of occasion). The value and reg attributes are both used to provide a standardized or regularized form of the content of an element. The distinction is that the value specified by the reg attribute is simply that chosen as a convenient way of grouping together a number of variant forms, whereas that specified for the value attribute must always be given in some application-dependent standard form, described in the stdVals element of the TEI header.

For example: June 9th : The period is approaching which will terminate my present copartnership. On the 1st Jany. next, it expires by its own limitation. ]]> Relative Dates and Times

As noted above, relative dates and times such as in the Two Hundredth and First Year of the Republic, twenty minutes before noon, and, more ambiguously, after the lamented death of the Doctor or an hour after the game have two distinct components. As well as the absolute temporal expression or event to which reference is made (e.g. noon, the game, the death of the Doctor [the foundation of] the Republic), they also contain a description of the distance between the time or date which is indicated and the referent expression (e.g. the Two Hundredth and First Year, twenty minutes, an hour); and (optionally) an offset describing the direction of the distance between the time or date indicated and the referent expression (e.g. of implying after, before, after).

The elements distance (or measure) and offset are used to encode these last two components within a dateStruct or timeStruct. The absolute temporal expression contained within the relative expression may be encoded using a occasion element, or by a nested dateStruct or timeStruct, or by a simple date or time. This allows for endlessly recursive structures such as the third Sunday after the first Monday before Lammastide in the fifth year of the King's second marriage ... --- but so does natural language. In the following examples, the reg attribute has been used to simplify processing of variant forms of expression: A fortnightbefore Christmas 1786 I reached the station about a half hour after the departure of the afternoon train to Boston ]]>

In the following example, the exact attribute has been used to indicate a lack of precision in the distance stated: just before sundown ]]>

In the following example, a nested dateStruct element is used to show that my birthday and the cited date are parts of the same temporal expression, and hence to disambiguate the phrase A week before my birthday on 9th December: A week before my birthday on 9thDecember ]]> The alternative reading of this phrase would be encoded as follows: A week before my birthday on 9thDecember ]]>

Where more complex or ambiguous expressions are involved, and where it is desirable to make more explicit the interpretive processes required, the feature structure notation described in chapter is recommended. Consider, for example, the following temporal expression which occurs in the Scottish Temperance Review of August 1850, referring to the summer holiday known in Glasgow simply as the Fair: during the Fair, a horrible nucleus of immorality and wickedness; it sends our multitudes to pollute and demoralize the country. ]]>

For the definition of the ana attribute, see chapter . It is used here to link the temporarl phrase with an interpretation of it. Like most traditional fairs and market days, the Glasgow Fair was established by local custom and could vary from year to year. Consequently, in order to provide such an interpretation, it is necessary to drawn upon additional information which may or may not be located in the particular text in question. In this case, it is necessary at least to know the spatial and temporal context (year and place) of the fair referred to. These and other features required for the analysis of this particular temporal expression may be combined together as one feature structure of type date-analysis: the Fair Glasgow 08-08-1850 19-09-1850 ]]> The elements described in this section are formally defined as follows: ]]>