germamania.blogg.se

Text encoding initiative tei
Text encoding initiative tei












text encoding initiative tei

These rules include a complete list of allowable elements and attributes, special character entities, rules for external files (such as images), as well as the hierarchical structure of all elements. The DTD defines the structural rules of a type of document.

text encoding initiative tei

It is important have a DTD that is appropriate for the project. TEI includes tags that are specific to a particular genre - drama, poetry, prose. TEI may be customized to fit the needs of the project. It is this strict tree structure that makes it possible to reliably search a TEI document and to apply stylesheets for display to the user. The entire document is considered the " root element", with other features, such as sections, chapters, pages, paragraphs, titles, etc., branching off of the root. TEI is used to organize text into a strict " document tree". ^ is represented with character entity ^ How Does TEI Work? Character entities may be internal or external to the XML document. A character entity file is an index of the special characters and is accessed when displaying a document. More than 65,536 characters can be represented using Unicode.

#TEXT ENCODING INITIATIVE TEI CODE#

The middle component is a code from the Unicode 16-bit character set. In XML, codes for special characters typically begin with "&#" and end with a semicolon ( ). In XML, an ampersand is coded as & #x0026. For example, an ampersand is coded as & in HTML. How these characters are represented varies in HTML and XML. Examples include characters with diacritics and special symbols, such as the copyright sign or an ampersand. Special characters include characters that are not found on a standard English-language keyboard or that are not one of the 128 characters of the US-ASCII character code set. In addition, tags may be placed around typographical characteristics such as text that is underlined, italicized, superscripted, etc., and around text that needs special emphasis such as foreign words, misspellings, proper names, etc. TEI tags describe the characteristics of a given text.įor example, TEI tags may be used to indicate paragraph and line breaks, pagination, and major divisions of a text such as volumes, chapters, and sections. These are the basic tags that almost all TEI documents include: TEI tags describe the structural hierarchies, divisions, and characteristics of a given document. The Electronic Text Center uses Text Encoding Initiative (TEI) tag sets and rules, an application of the Extensible Markup Language (XML), to encode texts. TEI Lite, a somewhat smaller version of TEI, includes a subset of the whole TEI tag set selected to include the most commonly needed tags. TEI, the Text Encoding Initiative was founded in 1987 to develop guidelines for encoding machine-readable texts of interest to the humanities and social sciences.














Text encoding initiative tei