Fig. 5From: Previously unidentified duplicate registrations of clinical trials: an exploratory analysis of registry data worldwideEstimated number of unknown duplicates. The number of unknown duplicates estimated by randomly sampling from the pairs of records that are not known to be duplicates. The investigated range of title similarity scores (0.7–1.0) contains 76 % of all known duplicatesBack to article page