Old Catalan Morphosyntax: Developing an Annotated Corpus
This paper presents a full procedure for the development of a Part-of-Speech (POS) tagged corpus of Old Catalan.As an extremely low-resource language DVD with rich inflection and frequent homographs, Old Catalan poses non-trivial problems in the development of a searchable constituency-based treebank.We demonstrate, however, that a semi- supervised