Publication details

Expanding hyphenation patterns across Slavic languages

Authors

SOJKA Ondřej

Year of publication 2024
Type Article in Periodical
Magazine / Source TUGboat : The Communications of the TeX Users Group
Citation
web https://tug.org/TUGboat/tb45-2/tb140sojka-slavic.pdf
Doi https://doi.org/10.47397/tb/45-2/tb140sojka-slavic
Attached files
Description So far, TEX hyphenation patterns, even for related languages, have been developed separately for each language, splitting scarce human resources. As languages develop and (especially) English terms creep into formerly monolingual texts, hyphenation patterns, especially for medium- and low-resource languages which often lack quality generated patterns, are due for an update. In this article, we explore the possibilities for transfer learning of hyphenation rules between related Slavic languages. We present new hyphenation patterns for multiple Slavic languages, developed using transfer learning from various sources.

You are running an old browser version. We recommend updating your browser to its latest version.

More info