Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Chapter

Download BibTeX

Title

Similarity Forests Revisited: A Swiss Army Knife for Machine Learning

Authors

[ 1 ] Instytut Informatyki, Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | [ P ] employee

Scientific discipline (Law 2.0)

[2.3] Information and communication technology

Year of publication

2021

Chapter type

chapter in monograph / paper

Publication language

english

Keywords
EN
  • decision trees
  • random forests
  • similarity forests
Abstract

EN Random Forests are one of the most reliable and robust general-purpose machine learning algorithms. They provide very competitive baselines for more complex algorithms. Recently, a new algorithm has been introduced into the family of decision tree learners – Similarity Forests, aiming at mitigating some of the well-known deficiencies of Random Forests. In this paper we extend the originally proposed Similarity Forests algorithm to one-class classification, multi-class classification, regression and metric learning tasks. We also introduce two new criteria for split evaluation in regression learning. The results of conducted experiments show that Similarity Forests can be a competitive alternative to Random Forests, in particular, when high quality data representation is difficult to obtain.

Date of online publication

08.05.2021

Pages (from - to)

42 - 53

DOI

10.1007/978-3-030-75765-6_4

URL

https://link.springer.com/chapter/10.1007/978-3-030-75765-6_4

Book

Advances in Knowledge Discovery and Data Mining : 25th Pacific-Asia Conference, PAKDD 2021, Virtual Event, May 11–14, 2021, Proceedings, Part II

Presented on

25th Pacific-Asia Conference on Knowledge Discovery and Data Mining PAKDD 2021, 11-14.05.2021, Delhi, India

Ministry points / chapter

20

Ministry points / conference (CORE)

140

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.