Hierarchical clustering of text corpora using suffix trees
[ 1 ] Instytut Informatyki (II), Wydział Informatyki i Zarządzania, Politechnika Poznańska | [ P ] pracownik
2003
referat
angielski
EN We present a novel method for hierarchical clustering of text corpora, which proves especially suitable for online clustering. Information overload — the current phenomenon in electronic document repositories and the Internet in particular — constitutes an unceasing challenge for researchers. Clustering has been proposed as a comprehensive information access method. We describe a system, which automatically builds a navigable hierarchy of meaningful document groups. We claim that our system addresses two chief needs of the Web users: the need for efficient access to the up-to-date information on every available topic and the need for an organized and meaningful presentation of the desired information.
179 - 188