Informace o publikaci

KernelTagger – a PoS Tagger for Very Small Amount of Training Data



Rok publikování 2017
Druh Článek ve sborníku
Konference Proceedings of the Eleventh Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2017
Fakulta / Pracoviště MU

Fakulta informatiky

Obor Informatika
Klíčová slova PoS tagging; morphological tagging; language model; Czech
Popis The paper describes a new Part of speech (PoS) tagger which can learn a PoS tagging language model from very short annotated text with the use of much bigger non-annotated text. Only several sentences could be used for training to achieve much better accuracy than a baseline. The results cannot be compared to the results of state-of-the-art taggers but it could be used during the annotation process for a pre-annotation.
Související projekty: