Rank-Biased Centroids (RBC)

The Rank-Biased Centroids (RBC) rank fusion method to combine multiple-rankings of objects.

This code implements the RBC rank fusion method, as described in:

bibtex @inproceedings{DBLP:conf/sigir/BaileyMST17, author = {Peter Bailey and Alistair Moffat and Falk Scholer and Paul Thomas}, title = {Retrieval Consistency in the Presence of Query Variations}, booktitle = {Proceedings of the 40th International {ACM} {SIGIR} Conference on Research and Development in Information Retrieval, Shinjuku, Tokyo, Japan, August 7-11, 2017}, pages = {395--404}, publisher = {{ACM}}, year = {2017}, url = {https://doi.org/10.1145/3077136.3080839}, doi = {10.1145/3077136.3080839}, timestamp = {Wed, 25 Sep 2019 16:43:14 +0200}, biburl = {https://dblp.org/rec/conf/sigir/BaileyMST17.bib}, }

The fundamental step in the working of RBC is the usage a persistence parameter (p or phi) to to fusion multiple ranked lists based only on rank information. Larger values of p give higher importance to elements at the top of each ranking. From the paper:

As extreme values, consider p = 0 and p = 1. When p = 0, the agents only ever examine the first item in each of the input rankings, and the fused output is by decreasing score of firstrst preference; this is somewhat akin to a first-past-the-post election regime. When p = 1, each agent examines the whole of every list, and the fused ordering is determined by the number of lists that contain each item – a kind of "popularity count" of each item across the input sets. In between these extremes, the expected depth reached by the agents viewing the rankings is given by 1/(1 − p). For example, when p = 0.9, on average the first 10 items in each ranking are being used to contribute to the fused ordering; of course, in aggregate, across the whole universe of agents, all of the items in every ranking contribute to the overall outcome.

Example fusion

For example (also taken from the paper) for diffent rank orderings (R1-R4) of items A-G:

|Rank| R1 | R2 | R3 | R4 | | ---| --- | --- | --- | --- | | 1 | A | B | A | G | | 2 | D | D | B | D | | 3 | B | E | D | E | | 4 | C | C | C | A | | 5 | G | - | G | F | | 6 | F | - | F | C | | 7 | - | - | E | - |

Depending on the persistence parameter p will result in different output orderings based on each items accumulated weights:

|Rank| p=0.6 | p=0.8 | p=0.9 | | ---| ------ | ------ | ------ | | 1 | A(0.89) | D(0.61) | D(0.35) | | 2 | D(0.89) | A(0.50) | C(0.28) | | 3 | B(0.89) | B(0.49) | A(0.27) | | 4 | G(0.89) | C(0.37) | B(0.27) | | 5 | E(0.89) | G(0.37) | G(0.23) | | 6 | C(0.89) | E(0.31) | E(0.22) | | 7 | F(0.89) | F(0.21) | F(0.18) |

Code Example:

```rust use rankbiasedcentroids::rbcwithscores;

let r1 = vec!['A', 'D', 'B', 'C', 'G', 'F']; let r2 = vec!['B', 'D', 'E', 'C']; let r3 = vec!['A', 'B', 'D', 'C', 'G', 'F', 'E']; let r4 = vec!['G', 'D', 'E', 'A', 'F', 'C']; let p = 0.9; let result = rbcwithscores(vec![r1, r2, r3, r4], p).unwrap(); let expected = vec![ ('D', 0.35), ('C', 0.28), ('A', 0.27), ('B', 0.27), ('G', 0.23), ('E', 0.22), ('F', 0.18), ]; for ((c, s), (expectedc, expectedscore)) in result.intoiter().zip(expected.intoiter()) { asserteq!(c, expectedc); approx::assertabsdiffeq!(s, expectedscore, epsilon = 0.005); }

```

License

MIT