Unitary and composite textsUsing <text> to encode both individual texts and groups of texts
Groups of textsCriteria for deciding whether to use <group>
Embedding <text>Using <text> for encoding embedded narratives, letters, and other documents
Letters embedded within other worksMore details on encoding embedded letters, a special case of embedded texts.
Unique identification of the <text> elementDetails on using the id= attribute on <text>, both for independent documents and for embedded texts
Front matterGeneral notes on encoding front matter, including the various types of prefatory material
Back matterGeneral notes on encoding back matter, including the various types of concluding material
Divisions of the text: generalGeneral notes on the function and use of <div>, including its basic internal structure
Divisions of the text: types of <div>Specific discussion of possible types of <div>
Numbering: generalUse of the n= attribute and handling of line numbering
Numbering of divisionsUse of numbered versus unnumbered <div> elements
FrontispiecesUse of <div type="frontispiece">
AdvertisementsEncoding of two different kinds of advertisements: advertisements for other books, and preliminary addresses to the reader
Tables of contentsEncoding of tables of contents with <list> inside <div type="contents">, with internal encoding to capture the functional parts of the table of contents information, such as page numbers and titles.
Comparison of indexes and tables of contentsDifferences between tables of contents (ordered by location in the book) and indexes (ordered by topic).
HeadingsUse of <head> to encode headings, and permissible values for type= attribute
Heads and labelsComparison of headings and labels, and the use of <head> and <label>
ArgumentDefinition of argument, and use of <argument> element
EpigraphsEncoding of epigraphs with either <epigraph> or <div type="epigraph"> depending on their structural location.
DedicationsEncoding dedications using <div type="dedication">; distinguishing between dedications and other prefatory material
OpenersUsing <opener> to group together information at the top of a division (especially in letters); the usual contents of <opener>
Closers and trailers Using <closer> to group together information at the bottom of a division (especially in letters); the usual contents of <closer>; difference between <closer> and <trailer>
Salutes and signaturesEncoding of salutations and signatures in letters using <salute> and <signed>
Lists: subscriber listsEncoding subscriber lists using <list type="subscriber">, including handling of “ditto” used for repeated items
Lists: errataEncoding of errata lists with <div type="corrigenda"> and a nested <list type="errata">. Within each list item, further encoding captures the functional components such as the error, the page number, etc.
Overlapping and fragmented elementsStrategies for handling overlapping textual features, particularly quotations and poetry
Letters: general notesGeneral information on encoding letters in three different contexts: within collections, embedded in other works, and as prefatory material
Letters as prefatory materialEncoding letters which serve as prefatory material
Collections of lettersEncoding collections of letters using <div type="letter">, and description of internal structure
Postscripts, <ps>Encoding of postscripts using the WWP element <ps>
EssaysIdentification and encoding of essays
Journal and diary entriesEncoding of diary and journal entries using <div type="entry">
Lists: general notesEncoding lists, including discussion of criteria for identifying lists
QuotationsEncoding of quotations, distinction between use of <q> and <quote>, treatment of quotation marks
FiguresEncoding of figures and illustrations using <figure>; handling of text within figures; discussion of the WWP’s changes to the content model of <figure>
Encoding verse: general notesGeneral discussion of encoding poetry, including the use of <text>, <div>, and <lg> to encode basic poetic structures
Verse linesEncoding of verse lines and line breaks within verse lines
Types of poemEncoding basic verse types, including possible values for type= on <div> for poetry
Fixed poetic formsEncoding of clearly defined poetic forms such as sonnets, including possible values for type= on <div>, and instructions for marking internal subdivisions
Stanzas and generic formsEncoding of stanz
Specific line groupsEncoding of specific types of line group, such as couplets, quatrains, Spenserian stanzas, etc.
Excerpted and quoted poemsHandling of excerpted and quoted poems, including cases where the full extent of the original poem is unknown
Poetry and dramaDiscussion of the intersection between poetry and drama, including verse drama, poetry and songs in drama, and dramatic verse
Line breaks in verseLine breaks within individual verse lines should be encoded with <lb>.
Acts and scenesEncoding of dramatic acts and scenes, using <div type="act"> and <div type="scene">
Cast listsEncoding of cast lists using <castList>, situations where the original cast list is missing or incomplete, discussion of WWP changes to the TEI DTD
Speeches and speakersEncoding of dramatic speeches and speakers using <sp> and <speaker>, use of the who= attribute.
Stage directionsEncoding of stage directions in drama and verse dialogues, position of stage directions, identification of speakers within stage directions
<stage> type= attributeCategorization of stage directions using the type= attribute, with a list of permissible values
Simultaneous action in dramaHandling of simultaneous action in drama, in particular the encoding of cases where simultaneous action is marked with a printed brace
Principles of transcription: generalGeneral principles of transcription, including details of what is and is not captured, and the order in which it is represented
Regularization: silentFeatures which the WWP silently regularizes, including details of spacing, delimiters, type size, and typography
Regularization: <orig>Explicit regularization using <orig>
Punctuation: general Transcription of punctuation, including treatment of hard and soft hyphens
Punctuation and elementsPosition of punctuation relative to element boundaries
Transcription of primary sourcesUse of elements from the TEI tagset on transcription of primary sources
Features omitted from transcriptionUse of <gap/> to encode explicit omissions from the transcription, and cases where silent omission is allowed
Typography: I, J, U and V, generalTranscription and encoding of early typography using <orig>
Tagging the letter, tagging the wordApplication of <sic>, <orig>, and <abbr> at the word and letter level
Typography: recognizing difficult letter formsDiscussion of specific letterforms in the WWP collection, including long s, disambiguation of I and J, U and V
Special characters: entity referencesUse of entity references for special characters, boilerplate, and decorative features of the text
Special characters: ordinary characters requiring special treatmentFurther detail on ordinary characters which must be encoded with entity references in particular contexts or because they serve special functions
Special characters: brevigraphs and diacritical marksUsing entity references to transcribe brevigraphs and characters with diacritical marks
Special characters: miscellaneousDetails of various kinds of special characters not covered elsewhere
EllipsisEncoding of ellipsis using the entity reference &hellip;
Roman numeralsTranscription of roman numerals, and regularization of roman numeral dates
Errors in the originalEncoding of errors in the document source using <sic>; situations where corr= is and is not used; distinguishing between error and old spelling
Sequencing errorsEncoding of errors in sequencing, such as scene or page numbering
Reading orderDiscussion of the principle of “reading order” to guide the order of transcription in cases where the text flow contains parallel or non-sequential segments
Handwriting: the hand= attribute and the <hand> elementIdentification of handwriting using <hand> and the hand= attribute
Handwriting: additions and deletionsEncoding handwritten additions and deletions using <add>, <addSpan>, <del>, and <gap/>
Unclear textHandling damaged, unclear, or illegible text, including missing or deleted letters, damage to the original, or unclarity in the reproduction, using <sic>, <del>, <unclear>, <supplied>, and <gap/>
Gap: generalGeneral notes on the use of <gap/> to encode material omitted from transcription
Gap: use of the extent attributeDetailed notes on the use of the extent= attribute on <gap/> to indicate the extent of text being omitted from transcription
Gap: use of the extent attribute, advancedExcruciatingly detailed information on the use of the extent= attribute on <gap/> to encode the signature sequences of pages omitted from transcription
Encoding document appearance: renditional informationGeneral notes on what kinds of renditional information we do and do not capture
Renditional distinction: overviewThe WWP uses a decision tree to help determine how to encode different kinds of renditionally distinct phrase-level text features.
Special typographyGeneral notes on what aspects of typography the WWP does and does not capture
Small capsEncoding of small capital letters, including notes on how the WWP defines and recognizes small capitals, and how they should be transcribed
Spacing and sizingRegularization of sizing and spacing, including regularization of vertical and horizontal space and of type size
Decorative capitalizationDecorative capitalization should be encoded with <hi>, with an optional type= attribute if categorization is useful.
DashesEncoding of dashes, including em-dashes and en-dashes, using entity references
Special characters: inverted charactersTreatment of characters which are printed upside down in the source
Rules and ornaments: definitionsTranscription of rules and ornaments using an entity reference
Quotation marksQuotation marks should be captured where possible as renditional information modifying the element that motivates their appearance.
Punctuation and quotesTranscription of punctuation in relation to quotation marks and the <q> and <quote> elements
Punctuation and fontTreatment of the font of punctuation, particularly in cases where the font is not accurately captured by the element context
Font of numbersTreatment of the font of numbers
LeadersTranscription of leaders (separators within column-formatted lists such as tables of contents)
ColumnsEncoding of multi-column layouts using the columns keyword in the renditional ladder
Special characters: superscriptionTreatment of superscripted characters using the rend= attribute or an entity reference
IndentationTreatment of indentation using the indent keyword in the rendition ladder; handling of indentation resulting from an enlarged initial capital letter
White spaceTreatment of vertical and horizontal white space
Type size and faceTreatment of type size and type face, information the WWP does and does not record
Rules and ornaments: use as delimitersEncoding of rules and ornaments as delimiters on elements, using the rend= attribute
Renditional defaultsMethods of setting renditional defaults, using the <tagsDecl> in the TEI header
Encoding document appearance: rendition ladders overviewGeneral notes on the use of the rendition ladder in the rend= attribute, overview of keyword/value structure
Rendition ladders: common keywords and valuesOverview of keywords and values used in the rendition ladder
Renditional keywords: break, and line break defaults for WWP elementsUse of the break keyword to capture line breaks between elements, including defaults assumed in WWP practice
Renditional keywords: slant and weightUse of the slant and weight keywords to capture italicization and bold type
Renditional keywords: pre, postUse of the pre and post keywords to capture characters printed before or after an element (used as delimiters)
Renditional keywords: caseUse of the case keywordto capture case, and approaches to transcription
Renditional keywords: general points on indentationOverview of the encoding of indentation, including absolute and relative indentation, first-line indentation, and negative indentation
Renditional keywords: indentSpecifics on the use of the indent keyword to encode indentation (absolute and relative)
Renditional keywords: first-indent and right-indentSpecifics on the use of the first-indent keyword and right-indent keyword to encode first-line indentation and right indentation
Renditional keywords: alignmentUse of the align keyword to encode horizontal alignment of elements whose position on the page is vertically constrained
Renditional keywords: placeUse of the place keyword to encode the vertical and horizontal position of elements whose position on the page is unconstrained
Renditional keywords: sub and supUse of the sub and sup keywords to encode subscription and superscription of letters
Renditional keywords: columnsUse of the columns keyword to indicate the number of columns in a page layout
Renditional keywords: posUse of the pos keyword to indicate the position of the remainder of a verse line which is printed on the line above or below
Renditional keywords: bracedUse of the braced keyword to encode bracing used to group together multiple lines (e.g. lines of poetry)
Renditional keywords: fill(Non)use of the fill keyword
Rendition ladders: borderUse of the border keyword to encode borders around elements
Renditional keywords: bestow and bequeathUse of the bestow and bequeath keywords to propagate renditional information from an element to its children or descendants
Renditional keywords: get(Non)use of the get keyword to duplicate the renditional features of a given element on other elements
Phrase-level encoding: general notesThe WWP does not include phrase-level encoding in textual apparatus that duplicates content elsewhere in the text.
Names: general notesOverview of the WWP’s encoding of names, including personal names, place names, organizational names, and the names of objects
Names of humansDiscussion of the encoding of human names using <persName>, including criteria for identifying creatures as human, and guidelines for nesting name elements
Names of placesDiscussion of encoding the names of places using <placeName>, including definition of “place” and relationship between place names and personal names
Names of non-humans and thingsDiscussion of the encoding of the names of non-human creatures, things, and events using <name>
Names of collectivities and organizationsDiscussion of encoding the names of collectivities and organizations using <name> and <orgName>, including distinctions between collectivities and organizations
Names: difficult casesDiscussion of some difficult cases in the encoding of names, including lists of boundary cases
Names: problems of multiple referenceDiscussion of encoding personal names that refer to more than one person
Names: abbreviationsEncoding of abbreviated versions of names
Name keysUse of the key= attribute on <persName> to uniquely identify individuals
Special terminology, irony, and other forms of textual highlightingEncoding of specialized language, including technical terminology, ironic usage, and words which are being discussed as words rather than used
EmphasisThe <emph> element should be used for linguistic emphasis, where that can be distinguished from casual or decorative highlighting and from other motivating factors such as titles, foreign words, and so forth.
AbbreviationsEncoding of abbreviations using <abbr>, including a list of common abbreviations which are not tagged, and treatment of punctuation
Abbreviations and <orig>Use of the <abbr> element in connection with old-style typography
Authors in the main textEncoding of authors in bibliographic entries, using <author> and <persName>
Titles in the main textEncoding of titles in bibliographic entries and in running prose, using <title>, including criteria for identifying titles
Foreign words and phrasesEncoding foreign-language words and phrases using the lang= attribute on existing elements, and the <foreign> element when necessary
<mcr>The WWP uses <mcr> to encode phrase-level renditionally distinct words and phrases that cannot be assigned to any more specific category.
Simple highlightingEncoding of simple renditional highlighting using <hi>
Proper adjectivesEncoding of proper adjectives using <mcr>
Referencing strings (the <rs> element)Use of the <rs> element
Measures and numbersEncoding of numbers and measurements using <measure>
Dates: general Encoding dates using <date> and the value= attribute, including detailed instructions on the ISO8601 standard for date values
Dates: date rangesEncoding date ranges using the <date> element rather than <dateRange>
Dates, errors inEncoding errors in dates
Dates: BC datesEncoding of BC dates
Dates: Julian calendar and old-style datesEncoding of old-style dates and dates expressed in the Julian calendar
TimeEncoding of time using <time> and the value= attribute; our usage limited to cases which are used to structure a set of entries in a journal or log
<unknown>Use of the <unknown> element as a placeholder to flag textual features for which the correct encoding is uncertain
Title pagesEncoding of title pages using <titleBlock>, including a description of possible values for type=, and the various parts of the title page and how to encode them
Document titlesEncoding of document titles on the title page using <docTitle> and <titlePart>, including possible values for the type= attribute of <titlePart>
Authorship of the documentEncoding attributions of responsibility using <respLine>
ColophonsEncoding colophons using <titleBlock type="colophon">
Forme work (metawork): generalEncoding various types of forme work (including page numbers, line numbers, catchwords, press figures, signatures, and a few other features) using the <mw> element
Forme work, encoding withinDiscussion of the types of encoding which may appear within the <mw> element
Forme work: renditional issuesEncoding renditional distinctions within <mw>
Page breaks and page numberingEncoding of page breaks and page numbering using the <pb/> element and its n= attribute, including guidelines for creating idealized page number sequences
Signatures Encoding of the collation of the document, recording both printed signatures as they appear on the page using <mw type="signature"> and also an idealized signature sequence using <milestone unit="sig"/>
Line breaks: generalLine breaks in general are encoded with <lb>, with the exception of verse lines.
Line numbersEncoding line numbers that are printed in the original text using <mw type="lineNum">
Milestones, <mw>, and <div> boundariesDiscussion of the order and location of elements associated with page breaks (catchwords, milestones, page numbers, etc.)
CatchwordsEncoding of catchwords using <mw type="catch">, including how to handle discrepancies between the catchword and the matching word in the text
Running headers(Non)encoding of running headers
Hyphens, soft and hardEncoding of hard and soft hyphens, including guidelines for determining when a line-end hyphen is soft
Notes, endnotes, and footnotes: overviewGeneral overview of encoding notes, including footnotes, endnotes, marginal notes, and inline notes, and giving a summary of how notes are linked to the main text
EndnotesEncoding of endnotes using <note>, within a separate <div type="endnotes">
Inline notesEncoding of inline notes, using <note rend="place(inline)">
Notes in the TEI headerEncoding of textual notes in the <notesStmt> of the TEI header
Notes: encoding the note itselfEncoding the text of a note, including details of the WWP’s changes to the content model of note, and discussion of the various things that appear in notes
Notes: linking the note and the textNotes (including footnotes, endnotes, and marginal notes) should be linked to their anchor point using a bi-directional link which explicitly identifies both the anchor point and the note.
Notes: resp= and type=Use of the resp= and type= attributes on <note> to capture the authorship of the note
Notes, page breaks withinEncoding page breaks within footnotes, using a second <pb/> element which points to the main <pb/> element
Notes: revised content modelDescription of the WWP’s revised content model for notes
<hyperDiv>Use of the WWP <hyperDiv> element as a container for notes and other hypertextual components of the text, such as acrostics and supplemental cast lists
Bibliographic references Encoding of bibliographic references using <bibl>, including guidelines for identifying bibliographic references and when not to encode them.
Links and cross-referencesEncoding of links and cross-references using <ref> and <xref>
<xref>, <xptr>Details of the WWP’s use of <xref> and <xptr/>
Repetitions in listsEncoding repetitions signalled with “ditto”, “ibid”, and similar markers, using the sameAs= attribute
AcrosticsEncoding acrostics, and in particular capturing the encrypted word or words from the acrostic in a searchable form, using the WWP <acrostic> element
TEI HeaderGeneral notes on the TEI header and its use at the WWP
TEI Header, ID attribute ofUse of the id= attribute on <teiHeader>, including the format for its value
Titles in TEI headerEncoding two different forms of the document title in the TEI header, in <titleStmt> and in <sourceDesc>
Key= in TEI HeaderEncoding key= values on names in the TEI header
ID values: general notesGeneral information on use of id= in WWP texts, including guidelines for choosing values
ID values of texts and sub-textsGuidelines for encoding id= values on <text>

list all entries