Hierarchical clustering of text corpora using suffix trees

Irmina Masłowska; Roman Słowiński

doi:10.1007/978-3-540-36562-4_19

Scientific Information System of the Poznań University of Technology

PL EN

Main page / Publications / Hierarchical clustering of text corpora using suffix trees

Submit a comment

Chapter

Download BibTeX

Title

Hierarchical clustering of text corpora using suffix trees

Authors

Irmina Masłowska ^{[ 1 ][ P ]}
Roman Słowiński ^{[ 1 ][ P ]}

^{[ 1 ]} Instytut Informatyki (II), Wydział Informatyki i Zarządzania, Politechnika Poznańska | ^{[ P ]} employee

Year of publication

2003

Chapter type

paper

Publication language

english

Abstract

EN We present a novel method for hierarchical clustering of text corpora, which proves especially suitable for online clustering. Information overload — the current phenomenon in electronic document repositories and the Internet in particular — constitutes an unceasing challenge for researchers. Clustering has been proposed as a comprehensive information access method. We describe a system, which automatically builds a navigable hierarchy of meaningful document groups. We claim that our system addresses two chief needs of the Web users: the need for efficient access to the up-to-date information on every available topic and the need for an organized and meaningful presentation of the desired information.

Pages (from - to)

179 - 188

DOI

10.1007/978-3-540-36562-4_19

URL

https://link.springer.com/chapter/10.1007/978-3-540-36562-4_19

Book

Intelligent Information Processing and Web Mining : proceedings of the International IIS : IIPWM'03 Conference held in Zakopane, Poland, June 2-5, 2003

Presented on

Intelligent information processing and web mining IIPWM'03, 2-5.06.2003, Zakopane, Polska

System created by Poznań University of Technology and Poznan Supercomputing and Networking Center

Log in through eKonto to add to SIS