Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021;126(6):5285-5304.
doi: 10.1007/s11192-021-03900-7. Epub 2021 Apr 18.

Day-to-day discovery of preprint-publication links

Affiliations

Day-to-day discovery of preprint-publication links

Guillaume Cabanac et al. Scientometrics. 2021.

Abstract

Preprints promote the open and fast communication of non-peer reviewed work. Once a preprint is published in a peer-reviewed venue, the preprint server updates its web page: a prominent hyperlink leading to the newly published work is added. Linking preprints to publications is of utmost importance as it provides readers with the latest version of a now certified work. Yet leading preprint servers fail to identify all existing preprint-publication links. This limitation calls for a more thorough approach to this critical information retrieval task: overlooking published evidence translates into partial and even inaccurate systematic reviews on health-related issues, for instance. We designed an algorithm leveraging the Crossref public and free source of bibliographic metadata to comb the literature for preprint-publication links. We tested it on a reference preprint set identified and curated for a living systematic review on interventions for preventing and treating COVID-19 performed by international collaboration: the COVID-NMA initiative (covid-nma.com). The reference set comprised 343 preprints, 121 of which appeared as a publication in a peer-reviewed journal. While the preprint servers identified 39.7% of the preprint-publication links, our linker identified 90.9% of the expected links with no clues taken from the preprint servers. The accuracy of the proposed linker is 91.5% on this reference set, with 90.9% sensitivity and 91.9% specificity. This is a 16.26% increase in accuracy compared to that of preprint servers. We release this software as supplementary material to foster its integration into preprint servers' workflows and enhance a daily preprint-publication chase that is useful to all readers, including systematic reviewers. This preprint-publication linker currently provides day-to-day updates to the biomedical experts of the COVID-NMA initiative.

Keywords: COVID-19; Data linking; Living systematic review; Preprint; Publication.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Crossref documentation on preprint metadata updates expected from content publishers. Excerpt of the Introduction to posted content (including preprints) available from https://www.crossref.org/education/content-registration/content-types-intro/posted-content-includes-preprints/
Fig. 2
Fig. 2
The medRxiv preprint doi:10.1101/2020.04.07.20056424 with linked paper in JAMA Network Open
Fig. 3
Fig. 3
Excerpt of the listing of all medRxiv preprints, available from https://api.biorxiv.org/details/medrxiv/2000-01-01/3000-01-01/1 in JSON format. Each preprint comes with its associated metadata (e.g., title, authors, version) and is optionally linked to a publication (e.g., see the third record with 10.1136/bmj.l7078)
Fig. 4
Fig. 4
Validated preprint–publication links shown on the ‘COVID19 Preprint Tracker’ used by COVID-NMA and hosted at https://www.irit.fr/~Guillaume.Cabanac/covid19-preprint-tracker
Fig. 5
Fig. 5
Evaluation of the preprint servers on the 343 preprint reference set, as of 23 October 2020
Fig. 6
Fig. 6
Evaluation of the preprint–publication linker on the 343 preprint reference set, as of 23 October 2020

References

    1. Abdill, R. J., Adamowicz, E. M., & Blekhman, R. (2020). International authorship and collaboration across biorxiv preprints. eLife, 9, e58496. 10.7554/elife.58496 - PMC - PubMed
    1. Abdill, R. J., & Blekhman, R. (2019). Tracking the popularity and outcomes of all bioRxiv preprints. eLife, 8, e45133. 10.7554/elife.45133 - PMC - PubMed
    1. Anderson KR. bioRxiv: Trends and analysis of five years of preprints. Learned Publishing. 2020;33(2):104–109. doi: 10.1002/leap.1265. - DOI
    1. Baumann, A., & Wohlrabe, K. (2020). Where have all the working papers gone? Evidence from four major economics working paper series. Scientometrics, 124(3), 2433-2441. 10.1007/s11192-020-03570-x
    1. Boutron I, Chaimani A, Devane D, Meerpohl JJ, Rada G, Hróbjartsson A, Ravaud P. Interventions for the prevention and treatment of COVID-19: a living mapping of research and living network meta-analysis. Cochrane Database of Systematic Reviews. 2020 doi: 10.1002/14651858.cd013769. - DOI

LinkOut - more resources