When rank order isn't enough: New statistical-significance-aware correlation measures

Kutlu M.; Elsayed T.; Hasanain M.; Lease M.

Author	Kutlu M.
Author	Elsayed T.
Author	Hasanain M.
Author	Lease M.
Available date	2020-02-24T08:57:14Z
Publication Date	2018
Publication Name	International Conference on Information and Knowledge Management, Proceedings
Resource	Scopus
URI	http://dx.doi.org/10.1145/3269206.3271751
URI	http://hdl.handle.net/10576/13016
Abstract	Because it is expensive to construct test collections for Cranfield-based evaluation of information retrieval systems, a variety of lower-cost methods have been proposed. The reliability of these methods is often validated by measuring rank correlation (e.g., Kendall's t) between known system rankings on the full test collection vs. observed system rankings on the lower-cost one. However, existing rank correlation measures do not consider the statistical significance of score differences between systems in the observed rankings. To address this, we propose two statistical-significance-aware rank correlation measures, one of which is a head-weighted version of the other. We first show empirical differences between our proposed measures and existing ones. We then compare the measures while benchmarking four system evaluation methods: pooling, crowdsourcing, evaluation with incomplete judgments, and automatic system ranking. We show that use of our measures can lead to different experimental conclusions regarding reliability of alternative low-cost evaluation methods.
Sponsor	This work was made possible by NPRP grant# NPRP 7-1313-1-245 from the Qatar National Research Fund (a member of Qatar Foundation). The statements made herein are solely the responsibility of the authors.
Language	en
Publisher	Association for Computing Machinery
Subject	Evaluation IR System Ranking Rank Correlation
Title	When rank order isn't enough: New statistical-significance-aware correlation measures
Type	Conference Paper
Pagination	397 - 406

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Computer Science & Engineering [‎2127‎ items ]

Show simple item record

When rank order isn't enough: New statistical-significance-aware correlation measures

Files in this item

This item appears in the following Collection(s)

Video