Skip to main content

Table 3 WSS@95 values (as a percentage \(\bar{x} (\hat{s})\)) for all model-dataset combinations. For every dataset, the best results are in bold. Median (MAD) is given for all datasets

From: Performance of active learning models for screening prioritization in systematic reviews: a simulation study into the Average Time to Discover relevant records

 

Nudging

PTSD

Software

ACE

Virus

Wilson

SVM + TF-IDF

66.2 (2.90)

91.0 (0.41)

92.0 (0.10)

75.8 (1.95)

69.7 (0.81)

79.9 (2.09)

NB + TF-IDF

71.7 (1.37)

91.7 (0.27)

92.3 (0.08)

82.9 (0.99)

71.2 (0.62)

83.4 (0.89)

RF + TF-IDF

64.9 (2.50)

84.5 (3.38)

90.5 (0.34)

71.3 (4.03)

63.9 (3.54)

81.6 (3.35)

LR + TF-IDF

66.9 (4.01)

91.7 (0.18)

92.0 (0.10)

81.1 (1.31)

70.3 (0.65)

80.5 (0.65)

SVM + D2V

70.9 (1.68)

90.6 (0.73)

92.0 (0.21)

78.3 (1.92)

70.7 (1.76)

82.7 (1.44)

RF + D2V

66.3 (3.25)

88.2 (3.23)

91.0 (0.55)

68.6 (7.11)

67.2 (3.44)

77.9 (3.43)

LR + D2V

71.6 (1.66)

90.1 (0.63)

91.7 (0.13)

77.4 (1.03)

70.4 (1.34)

84.0 (0.77)

Median (MAD)

66.9 (3.05)

90.6 (1.53)

92.0 (0.47)

77.4 (5.51)

70.3 (0.90)

81.6 (2.48)