Skip to main content

Table 4 RRF@10 values (as a percentage \(\bar{x}, (\hat{s})\)) for all model-dataset combinations. For every dataset, the best results are in bold. Median (MAD) is given for all datasets

From: Performance of active learning models for screening prioritization in systematic reviews: a simulation study into the Average Time to Discover relevant records

 

Nudging

PTSD

Software

ACE

Virus

Wilson

SVM + TF-IDF

60.2 (3.12)

98.6 (1.40)

99.0 (0.00)

86.2 (5.25)

73.4 (1.62)

90.6 (1.17)

NB + TF-IDF

65.3 (2.61)

99.6 (0.95)

98.2 (0.34)

90.5 (1.40)

73.9 (1.70)

87.3 (2.55)

RF + TF-IDF

53.6 (2.71)

94.8 (1.60)

99.0 (0.00)

82.3 (2.75)

62.1 (3.19)

86.7 (5.82)

LR + TF-IDF

62.1 (2.59)

99.8 (0.70)

99.0 (0.00)

88.5 (5.16)

73.7 (1.48)

89.1 (2.30)

SVM + D2V

67.3 (3.00)

97.8 (1.12)

99.3 (0.44)

84.2 (2.78)

73.6 (2.54)

91.5 (4.16)

RF + D2V

62.6 (5.47)

97.1 (1.90)

99.2 (0.34)

80.8 (5.72)

67.3 (3.19)

75.5 (14.35)

LR + D2V

67.5 (2.59)

98.6 (1.40)

99.0 (0.00)

81.7 (1.81)

70.6 (2.21)

90.6 (5.00)

median (MAD)

62.6 (3.89)

98.6 (1.60)

99.0 (0.00)

84.2 (3.71)

73.4 (0.70)

89.1 (2.70)