Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Chapter

Download BibTeX

Title

On Missing Labels, Long-tails and Propensities in Extreme Multi-label Classification

Authors

[ 1 ] Instytut Informatyki, Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | [ D ] phd student | [ P ] employee

Scientific discipline (Law 2.0)

[2.3] Information and communication technology

Year of publication

2022

Chapter type

chapter in monograph / paper

Publication language

english

Keywords
EN
  • extreme classification
  • multi-label classification
  • propensity model
  • missing labels
  • long-tail labels
  • recommendation
Abstract

PL The propensity model introduced by Jain et al has become a standard approach for dealing with missing and long-tail labels in extreme multi-label classification (XMLC). In this paper, we critically revise this approach showing that despite its theoretical soundness, its application in contemporary XMLC works is debatable. We exhaustively discuss the flaws of the propensity-based approach, and present several recipes, some of them related to solutions used in search engines and recommender systems, that we believe constitute promising alternatives to be followed in XMLC.

Date of online publication

14.08.2022

Pages (from - to)

1547 - 1557

DOI

10.1145/3534678.3539466

URL

https://dl.acm.org/doi/pdf/10.1145/3534678.3539466

Book

Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining KDD '22

Presented on

28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22), 14-18.08.2022, Washington, United States

License type

copyright

Open Access Mode

publisher's website

Open Access Text Version

final published version

Date of Open Access to the publication

at the time of publication

Ministry points / chapter

20

Ministry points / conference (CORE)

200

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.