Information Extraction for Czech Based on Syntactic Analysis
|Year of publication
|Article in Proceedings
|Human Language Technology Challenges for Computer Science and Linguistics
|MU Faculty or unit
|information extraction; Czech language; syntactic analysis
|We present a complex pipeline of natural language processing tools for Czech that performs extraction of basic facts presented in a text. The input for the tool is a plain text, the output contains verb and noun phrases with basic semantic classification. Automatic syntactic analysis of Czech plays a crucial role in the pipeline. In this paper, we describe the particular tools used in the system, then we give an example of its usage and conclude with a basic evaluation of the overall system accuracy.