Lingo: Search Results Clustering Algorithm Based on Singular Value Decomposition
[ 1 ] Instytut Informatyki (II), Wydział Informatyki i Zarządzania, Politechnika Poznańska | [ P ] pracownik
2004
rozdział w monografii naukowej / referat
angielski
EN Search results clustering problem is defined as an automatic, on-line grouping of similar documents in a search results list returned from a search engine. In this paper we present Lingo—a novel algorithm for clustering search results, which emphasizes cluster description quality. We describe methods used in the algorithm: algebraic transformations of the term-document matrix and frequent phrase extraction using suffix arrays. Finally, we discuss results acquired from an empirical evaluation of the algorithm.
359 - 368
International IIS: IIPWM‘04 Conference, 17-20.05.2004, Zakopane, Polska