Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Chapter

Download BibTeX

Title

Hierarchical clustering of text corpora using suffix trees

Authors

[ 1 ] Instytut Informatyki (II), Wydział Informatyki i Zarządzania, Politechnika Poznańska | [ P ] employee

Year of publication

2003

Chapter type

paper

Publication language

english

Abstract

EN We present a novel method for hierarchical clustering of text corpora, which proves especially suitable for online clustering. Information overload — the current phenomenon in electronic document repositories and the Internet in particular — constitutes an unceasing challenge for researchers. Clustering has been proposed as a comprehensive information access method. We describe a system, which automatically builds a navigable hierarchy of meaningful document groups. We claim that our system addresses two chief needs of the Web users: the need for efficient access to the up-to-date information on every available topic and the need for an organized and meaningful presentation of the desired information.

Pages (from - to)

179 - 188

DOI

10.1007/978-3-540-36562-4_19

URL

https://link.springer.com/chapter/10.1007/978-3-540-36562-4_19

Book

Intelligent Information Processing and Web Mining : proceedings of the International IIS : IIPWM'03 Conference held in Zakopane, Poland, June 2-5, 2003

Presented on

Intelligent information processing and web mining IIPWM'03, 2-5.06.2003, Zakopane, Polska

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.