Informace o publikaci

Annotated Corpus of Czech Case Law for Reference Recognition Tasks

Autoři

HARAŠTA Jakub ŠAVELKA Jaromír KASL František KOTKOVÁ Adéla LOUTOCKÝ Pavel MÍŠEK Jakub PROCHÁZKOVÁ Daniela PULLMANNOVÁ Helena SEMENIŠÍN Petr ŠEJNOVÁ Tamara ŠIMKOVÁ Nikola VOSINEK Michal ZAVADILOVÁ Lucie ZIBNER Jan

Druh Článek ve sborníku
Konference Text, Speech, and Dialogue: 21st International Conference
Fakulta / Pracoviště MU

Právnická fakulta

Citace
www
Doi http://dx.doi.org/10.1007/978-3-030-00794-2_26
Klíčová slova Reference recognition; dataset; legal texts; manual annotation
Přiložené soubory
Popis We describe an annotated corpus of 350 decisions of Czech top-tier courts which was gathered for a project assessing the relevance of court decisions in Czech law. We describe two layers of processing of the corpus; every decision was annotated by two trained annotators and then manually adjudicated by one trained curator to solve possible disagreements between annotators. This corpus was developed as training and testing material for reference recognition tasks which will be further used for research on assessment of legal importance. However, the overall shortage of available research corpora of annotated legal texts, particularly in Czech language, leads us to believe that other research teams may find it useful.
Související projekty: