Web Interface and Collection for Mathematical Retrieval : WebMIaS and MREC
|Year of publication
|Article in Proceedings
|DML 2011: Towards a Digital Mathematics Library
|MU Faculty or unit
|math indexing and retrieval; mathematical digital libraries; information systems; information retrieval; mathematical content search; document ranking of mathematical papers; math text mining; WebMIaS; MIaS; Tralics; TeX; UMCL; Lucene
|We demonstrate searching of mathematical expressions in technical digital libraries on a MREC collection of 439,423 real scientific documents with more than 158 million mathematical formulae. Our solution - the WebMIaS system - allows the retrieval of mathematical expressions written in TEX or MathML. TEX queries are converted on-the-fly into tree representations of Presentation MathML, which is used for indexing. WebMIaS allows complex queries composed of plain text and mathematical formulae, using MIaS (Math Indexer and Searcher), a math aware search engine based on the state-of-the-art system Lucene. MIaS implements proximity math indexing with a subformulae similarity search.