Publication details

Text Segmentation Using Context Overlap

Authors

ŘEHŮŘEK Radim

Year of publication 2007
Type Article in Periodical
Magazine / Source Progress in Artificial Intelligence
MU Faculty or unit

Faculty of Informatics

Citation
Web http://www.springerlink.com/content/k820g107h7067383/?p=9cc314a6a70b4ca286722b609e097494&pi=0
Field Informatics
Keywords text segmentation; LSI; latent semantic indexing
Description In this paper we propose features desirable of linear text segmentation algorithms for the Information Retrieval domain, with emphasis on improving high similarity search of heterogeneous texts. We proceed to describe a robust purely statistical method, based on context overlap exploitation, that exhibits these desired features. Experimental results are presented, along with comparison to other existing algorithms.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.

More info