ICTIR 2016 Slides - The Impact of Fixed-Cost Pooling Strategies on Test Collection Bias
Presentation · September 2016 CITATIONS READS 38 1 author: Some of the authors of this publication are also working on these related projects: Abstracting Domain-Specific Information Retrieval and Evaluation (ADmIRE) View project Space-time mapping and modelling of soil properties in Mediterranean and Temperate areas View project Aldo Lipani University College London 57 PUBLICATIONS 223 CITATIONS SEE PROFILE All content following this page was uploaded by Aldo Lipani on 15 September 2016. The user has requested enhancement of the downloaded file.ICTIR 2016 Slides - The Impact of Fixed-Cost Pooling Strategies on - - PDF document
ICTIR 2016 Slides - The Impact of Fixed-Cost Pooling Strategies on - - PDF document
See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/308120269 ICTIR 2016 Slides - The Impact of Fixed-Cost Pooling Strategies on Test Collection Bias Presentation September 2016
The Impact of Fixed-Cost Pooling Strategies
- n Test Collection Bias
1Aldo Lipani, 2Guido Zuccon, 1Mihai Lupu, 3Bevan Koopman, 1Allan Hanbury 1Vienna University of Technology 2Queensland University of Technology 3Australian e-Health Research Centre
15 Sep 2016 ICTIR 2016 - Newark (DE)
In the context of building a test collection for an IR task modelled by precision-based IR evaluation measures (P@10 and RBP) in presence
- f a fixed-budget, which of the tested pooling strategies introduces
less pool bias?
The Pooling Method
The Pooling Method
The Pooling Method
The Pooling Method
The Pooling Method
The Pooling Method
The Pooling Method
The Pooling Method
The Pooling Method
The Pooling Method
The Pooling Method
What is the Pool Bias?
What is the Pool Bias?
BIASED
What is the Pool Bias?
BIASED
It is the effect that documents that were not selected in the pool created from the original runs will never be considered relevant
What is the Pool Bias?
Tested Pooling Strategies
Tested Pooling Strategies
Standard pooling strategies:
Tested Pooling Strategies
Standard pooling strategies:
Tested Pooling Strategies
Standard pooling strategies:
Take@10
r4 r1 r2 r3 r5 ρ1 ρ2 ρ3 ρ4 ρ5 ρ6 ρ7 ρ8 ρ10 ρ9 ρ1 ρ2 ρ3 ρ4 ρ5 ρ6 ρ7 ρ8 ρ10 ρ9 1 2 3 4 5
Tested Pooling Strategies
Standard pooling strategies:
Take@10
r4 r1 r2 r3 r5 ρ1 ρ2 ρ3 ρ4 ρ5 ρ6 ρ7 ρ8 ρ10 ρ9 ρ1 ρ2 ρ3 ρ4 ρ5 ρ6 ρ7 ρ8 ρ10 ρ9 1 2 3 4 5
Tested Pooling Strategies
Standard pooling strategies:
Tested Pooling Strategies
Standard pooling strategies: &
Tested Pooling Strategies
Standard pooling strategies: & RBP based pooling strategies*:
* A. Moffat, W. Webber, and J. Zobel. Strategic system comparisons via targeted relevance judgments.
Tested Pooling Strategies
Standard pooling strategies: & RBP based pooling strategies*:
* A. Moffat, W. Webber, and J. Zobel. Strategic system comparisons via targeted relevance judgments.
Tested Pooling Strategies
Standard pooling strategies: & RBP based pooling strategies*:
* A. Moffat, W. Webber, and J. Zobel. Strategic system comparisons via targeted relevance judgments.
r4 r1 r2 r3 r5 ρ1 ρ2 ρ3 ρ4 ρ5 ρ6 ρ7 ρ8 ρ10 ρ9 ρ1 ρ2 ρ3 ρ4 ρ5 ρ6 ρ7 ρ8 ρ10 ρ9 1 2 3 4 5
RBPBasedA@10&0.8
Tested Pooling Strategies
Standard pooling strategies: & RBP based pooling strategies*:
* A. Moffat, W. Webber, and J. Zobel. Strategic system comparisons via targeted relevance judgments.
r4 r1 r2 r3 r5 ρ1 ρ2 ρ3 ρ4 ρ5 ρ6 ρ7 ρ8 ρ10 ρ9 ρ1 ρ2 ρ3 ρ4 ρ5 ρ6 ρ7 ρ8 ρ10 ρ9 1 2 3 4 5
RBPBasedA@10&0.8
Tested Pooling Strategies
Standard pooling strategies: & RBP based pooling strategies*:
* A. Moffat, W. Webber, and J. Zobel. Strategic system comparisons via targeted relevance judgments.
Tested Pooling Strategies
Standard pooling strategies: & RBP based pooling strategies*:
* A. Moffat, W. Webber, and J. Zobel. Strategic system comparisons via targeted relevance judgments.
Tested Pooling Strategies
Standard pooling strategies: & RBP based pooling strategies*:
* A. Moffat, W. Webber, and J. Zobel. Strategic system comparisons via targeted relevance judgments.
r4 r1 r2 r3 r5 ρ1 ρ2 ρ3 ρ4 ρ5 ρ6 ρ7 ρ8 ρ10 ρ9 ρ1 ρ2 ρ3 ρ4 ρ5 ρ6 ρ7 ρ8 ρ10 ρ9 1 2 3 4 5
RBPBasedB@10&0.8
Tested Pooling Strategies
Standard pooling strategies: & RBP based pooling strategies*:
* A. Moffat, W. Webber, and J. Zobel. Strategic system comparisons via targeted relevance judgments.
r4 r1 r2 r3 r5 ρ1 ρ2 ρ3 ρ4 ρ5 ρ6 ρ7 ρ8 ρ10 ρ9 ρ1 ρ2 ρ3 ρ4 ρ5 ρ6 ρ7 ρ8 ρ10 ρ9 1 2 3 4 5
RBPBasedB@10&0.8
Tested Pooling Strategies
Standard pooling strategies: & RBP based pooling strategies*:
* A. Moffat, W. Webber, and J. Zobel. Strategic system comparisons via targeted relevance judgments.
Tested Pooling Strategies
Standard pooling strategies: & RBP based pooling strategies*:
* A. Moffat, W. Webber, and J. Zobel. Strategic system comparisons via targeted relevance judgments.
Tested Pooling Strategies
Standard pooling strategies: & RBP based pooling strategies*:
* A. Moffat, W. Webber, and J. Zobel. Strategic system comparisons via targeted relevance judgments.
r4 r1 r2 r3 r5 ρ1 ρ2 ρ3 ρ4 ρ5 ρ6 ρ7 ρ8 ρ10 ρ9 ρ1 ρ2 ρ3 ρ4 ρ5 ρ6 ρ7 ρ8 ρ10 ρ9 1 2 3 4 5
RBPBasedC@10&0.8
Tested Pooling Strategies
Standard pooling strategies: & RBP based pooling strategies*:
* A. Moffat, W. Webber, and J. Zobel. Strategic system comparisons via targeted relevance judgments.
r4 r1 r2 r3 r5 ρ1 ρ2 ρ3 ρ4 ρ5 ρ6 ρ7 ρ8 ρ10 ρ9 ρ1 ρ2 ρ3 ρ4 ρ5 ρ6 ρ7 ρ8 ρ10 ρ9 1 2 3 4 5
RBPBasedC@10&0.8
Tested Pooling Strategies
Standard pooling strategies: & RBP based pooling strategies*:
* A. Moffat, W. Webber, and J. Zobel. Strategic system comparisons via targeted relevance judgments.
Methodology
Test Collections
Ad Hoc 2-8 Web 9, Web 2001, Robust 2005 Genomics 2005 Legal 2006 Medical 2011 Microblog 2011
Methodology
Test Collections
Ad Hoc 2-8 Web 9, Web 2001, Robust 2005 Genomics 2005 Legal 2006 Medical 2011 Microblog 2011
Domains
News Web Genomics Legal Medical Microblog
Methodology
Test Collections
Ad Hoc 2-8 Web 9, Web 2001, Robust 2005 Genomics 2005 Legal 2006 Medical 2011 Microblog 2011
Domains
News Web Genomics Legal Medical Microblog We compared Standard Pooling strategies RBP Based Pooling strategies (with p=0.80 and p=0.73)
Methodology
Test Collections
Ad Hoc 2-8 Web 9, Web 2001, Robust 2005 Genomics 2005 Legal 2006 Medical 2011 Microblog 2011
Domains
News Web Genomics Legal Medical Microblog We compared Standard Pooling strategies RBP Based Pooling strategies (with p=0.80 and p=0.73)
Metrics of Error
Mean Absolute Error System Rank Error SRE with statistical significance (p<0.05) MAE SRE SRE*
Methodology
Summary of the Results for MAE
Summary of the Results for MAE
P@10
1st 2nd 3rd
2 4 6 8 10 12 14 16 T T+ A80 B80 C80 A73 B73 C73Summary of the Results for MAE
P@10 RBP (p=0.80)
1st 2nd 3rd
2 4 6 8 10 12 14 16 T T+ A80 B80 C80 A73 B73 C73 2 4 6 8 10 12 14 16 T T+ A80 B80 C80 A73 B73 C73Summary of the Results for MAE
P@10 RBP (p=0.80)
1st 2nd 3rd
# Relevant Documents
2 4 6 8 10 12 14 16 T T+ A80 B80 C80 A73 B73 C73 2 4 6 8 10 12 14 16 T T+ A80 B80 C80 A73 B73 C73 2 4 6 8 10 12 14 16 T T+ A80 B80 C80 A73 B73 C73Conclusion
Conclusion
In the context of building a test collection for an IR task modelled by precision-based IR evaluation measures in presence of a fixed-budget, the best pooling strategy is
Conclusion
In the context of building a test collection for an IR task modelled by precision-based IR evaluation measures in presence of a fixed-budget, the best pooling strategy is
Conclusion
In the context of building a test collection for an IR task modelled by precision-based IR evaluation measures in presence of a fixed-budget, the best pooling strategy is However due to its limitations we recommend
Conclusion
In the context of building a test collection for an IR task modelled by precision-based IR evaluation measures in presence of a fixed-budget, the best pooling strategy is However due to its limitations we recommend
Thank you
for you attention!
PoolBiasEstimators VisualPooling
View publication stats View publication stats