Publication details

Idiomatic Expressions in VerbaLex



Type Article in Proceedings
Conference Proceedings of the Eleventh Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2017
MU Faculty or unit

Faculty of Informatics

Field Informatics
Keywords idioms; verb phrases; verb frames; valency lexicon; corpus
Description Idiomatic expressions are part of everyday language, therefore NLP applications that can ``understand'' idioms are desirable. The nature of idioms is somewhat heterogenous - idioms form classes differing in many aspects (e.g. syntactic structure, lexical and syntactic fixedness). Although dictionaries of idioms exist, they usually do not contain information about fixedness or frequency since they are intended to be used by humans, not computer programs. In this work, we propose how to deal with idioms in the valency lexicon VerbaLex using automatically extracted information from the largest dictionary Czech idioms and a web corpus.
Related projects: