Informace o publikaci

Automatic Segmentation of Czech Court Decisions into Multi-Paragraph Parts

Autoři

HARAŠTA Jakub ŠAVELKA Jaromír KASL František MÍŠEK Jakub

Druh Článek v odborném periodiku
Časopis / Zdroj Jusletter IT
Fakulta / Pracoviště MU

Právnická fakulta

Citace
WWW
Klíčová slova text segmentation; NLP; case law
Popis The authors describe a tool for automatic segmentation of the Czech top-tier court decisions (Supreme Court, Supreme Administrative Court, and Constitutional Court) into multi-paragraph parts. The tool allows segmenting a decision into Header, Party Response, Proceeding Summary, Court Argumentation, Footer, Dissent, and Footnotes. Segmenting text into multi-paragraph parts allows to treat different parts differently even when they contain similar linguistic or other features. Eventually, this is useful in data processing pipelines, as this tool is planned for use in automatic reference recognition purposes.
Související projekty: