Publication details

Annotated Corpus of Czech Case Law for Reference Recognition Tasks

Authors

HARAŠTA Jakub ŠAVELKA Jaromír KASL František KOTKOVÁ Adéla LOUTOCKÝ Pavel MÍŠEK Jakub PROCHÁZKOVÁ Daniela PULLMANNOVÁ Helena SEMENIŠÍN Petr ŠEJNOVÁ Tamara ŠIMKOVÁ Nikola VOSINEK Michal ZAVADILOVÁ Lucie ZIBNER Jan

Year of publication 2018
Type Article in Proceedings
Conference Text, Speech, and Dialogue: 21st International Conference
MU Faculty or unit

Faculty of Law

Citation
Web
Doi http://dx.doi.org/10.1007/978-3-030-00794-2_26
Keywords Reference recognition; dataset; legal texts; manual annotation
Attached files
Description We describe an annotated corpus of 350 decisions of Czech top-tier courts which was gathered for a project assessing the relevance of court decisions in Czech law. We describe two layers of processing of the corpus; every decision was annotated by two trained annotators and then manually adjudicated by one trained curator to solve possible disagreements between annotators. This corpus was developed as training and testing material for reference recognition tasks which will be further used for research on assessment of legal importance. However, the overall shortage of available research corpora of annotated legal texts, particularly in Czech language, leads us to believe that other research teams may find it useful.
Related projects: