1 Outline

2 Precision and recall

3 Precision and recall

4 Precision-recall curve

5 F-measure

6 Any problems with the metrics so far?

7 Outline

8 Precision and recall

9 Other common metrics

10 Any problems with the metrics so far?

11 Outline

12 User search behavior

13 Discounted cumulative gain (DCG)

14 Rank-biased precision (RBP)

15 Rank-biased precision (RBP)

16 Expected reciprocal rank (ERR)

17 Expected reciprocal rank (ERR)

18 Offline evaluation summary

19 Materials

20 Materials
