Artikel in einem Konferenzbericht,

The Impact of Negative Relevance Judgments on NDCG

L. Gienapp, M. Fröbe, M. Hagen, und M. Potthast.
Proceedings of the 29th ACM International Conference on Information & Knowledge Management, Seite 2037–2040. New York, NY, USA, Association for Computing Machinery, (2020)
DOI: 10.1145/3340531.3412123

Zusammenfassung

NDCG is one of the most commonly used measures to quantify system performance in retrieval experiments. Though originally not considered, graded relevance judgments nowadays frequently include negative labels. Negative relevance labels cause NDCG to be unbounded. This is probably why widely used implementations of NDCG map negative relevance labels to zero, thus ensuring the resulting scores to originate from the 0,1 range. But zeroing negative labels discards valuable relevance information, e.g., by treating spam documents the same as unjudged ones, which are assigned the relevance label of zero by default. We show that, instead of zeroing negative labels, a min-max-normalization of NDCG retains its statistical power while improving its reliability and stability.

BibTeX-Schlüssel: 10.1145/3340531.3412123
Eintragstyp: inproceedings
Adresse: New York, NY, USA
Buchtitel: Proceedings of the 29th ACM International Conference on Information & Knowledge Management
Jahr: 2020
Seiten: 2037–2040
Verlag: Association for Computing Machinery
Reihe: CIKM '20
isbn: 9781450368599
numpages: 4
location: Virtual Event, Ireland
DOI: 10.1145/3340531.3412123
URL: https://doi.org/10.1145/3340531.3412123

PUMA

The Impact of Negative Relevance Judgments on NDCG

Zusammenfassung

Tags

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf