Publication details

Lexicographic Tools to Build New Encyclopaedia of the Czech Language

Authors

HORÁK Aleš RAMBOUSEK Adam

Year of publication 2016
Type Article in Periodical
Magazine / Source The Prague Bulletin of Mathematical Linguistics
MU Faculty or unit

Faculty of Informatics

Citation
Web https://ufal.mff.cuni.cz/pbml/106/art-horak-rambousek.pdf
Doi http://dx.doi.org/10.1515/pralin-2016-0019
Field Informatics
Keywords encyclopaedia; lexicographic tools; DEB platform
Attached files
Description The first edition of the Encyclopaedia of the Czech Language was published in 2002 and since that time it has established as one of the basic reference books for the study of the Czech language and related linguistic disciplines. However, many new concepts and even new research areas have emerged since that publication. That is why a preparation of a complete new edition of the encyclopaedia started in 2011, rather than just re-printing the previous version with supplements. The new edition covers current research status in all concepts connected with the linguistic studies of (prevalently, but not solely) the Czech language. The project proceeded for five years and it has finished at the end of 2015, the printed edition is currently in preparation. An important innovation of the new encyclopaedia lies in the decision that the new edition will be published both as a printed book and as an electronic on-line encyclopaedia, utilizing the many advantages of electronic dictionaries. In this paper, we describe the lexicographic platform used for the Encyclopaedia preparation and the process behind the work flow consisting of more than 3,000 pages written by nearly 200 authors from all over the world. The paper covers the process of managing entry submissions, the development of tools to convert word processor files to an XML database, tools to cross-check and connect bibliography references from free text to structured bibliography entries, and the preparation of data for the printed publication
Related projects: