Insert title here

Table of contents

1. Style markup

Doth he like encoding stylistic stuff? It's a form of intellectual manual labor.

Question: why do values of @ana start with a # hash symbol?

Hint:

Question: what does the value of @ana point to?

2. Punctuation

You've now seen a few uses of markup. Some less familiar than others, e. g. the · lmiddle dot ·

3. An online part of speech (POS) tagger

Online POS tagging 1 (part of speech only):

Question: How can I embed that very colorful output in my encoding (in my TEI XML file)?

4. Another online POS tagger

Question: How can I embed this output in my encoding (in my TEI XML file)?

5. Encoding POS tags with @ana

Linguistic Annotation by Means of Generic TEI DevicesDefinite articleAdverbConjunctionRelative thatNoun singularNoun pluralProper nounGenitive markerPrepositionVerb past tense

Encoding sample taken from TEI Guidelines, 17.4.1:

The victim 's friends told police that Kruger drove into the quarry and never surfaced

Question: why do values of @ana start with a # hash symbol? (Hint: see above)

Question: do the tags in the tagset above (AT0, AD0 etc.) only represent parts of speech (article, adverb etc.), or also other information? If so, what additional information?

Exercise: In the previous source code, edit the values of @ana as suggested in the comments.

6. Encoding POS tags with @pos

The victim 's friends told police that Kruger drove into the quarry and never surfaced

Question: why do values of @pos do not start with a # hash symbol?

Hint:

Exercise: In the previous source code, edit the values of @pos as suggested in the comments.

7. Complete linguistic annotation

Question: Compare the tagset used in the sample from the TEI Guidelines with those used in the aihaiyang.com website (Stanford parser). Do you see any differences?

The victim 's friends told police

Exercise 1: edit the source code above and replace the Guidelines POS tags ('AT0' etc.) with the aihaiyang.com / Stanford parser tags ('DT' etc.).

Esercise 2: edit the same source code above and provide attributes for @lemma and @msd. Note that:

8. Real-world examples of TEI-encoded linguistic annotation

(A) My edition of Ursus Beneventanus medieval latin, grammar book), lemmatized with TreeTagger.

differt dictio a syllaba

(B) Same text, same tagger, but tagged with a different "parameter file".

differt dictio a syllaba

Question: what differences do you notice in the encoding of 'w' elements the above edited code snippets and the original markup in the online TEI files?

(C) The Ancient Greek and Latin Dependency Treebank:

Question(s): is the "Ancient Greek and Latin Dependency Treebank" source code XML? Is it TEI? If it's not, what differences do you notice? Why do you think that they chose that encoding?

9. German cases: @ana

German casesNominativeGenitiveDativeAccusative

Ein Mops kam in die Küche Und stahl dem Koch ein Ei

10. German cases: @msd

Exercise: edit the source code below.

Ein Mops kam in die Küche

Question: if you use @msd instead of @ana, do you need to include 'interp' elements?

Insert author here. Date: Rhetoric figuresMetaphorAnaphoraEnthymemeApophasis (praeteritio)AposiopesisSententiaHypophoraOxymoronHypotyposisAntiphrasisPolyptoton Lexical markersLolloquial languageArchaismJargonPidginCreole languageSociolectMilitary languageReligious Function of punctuation marksAbbreviation markElisionArchaic punctuationSyntactical function (division and organization of clauses)Citation Ethno-phoneticsUnstressed vowel not pronouncedMispronunciation of /r/Missing apheresis Textual linguisticsDeixis