Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Apr 22:13:ELIXIR-1547.
doi: 10.12688/f1000research.158264.3. eCollection 2024.

Using text-mining to measure the scientific impact and legacy of ELIXIR, a distributed research infrastructure for life science data

Affiliations

Using text-mining to measure the scientific impact and legacy of ELIXIR, a distributed research infrastructure for life science data

Francesca De Leo et al. F1000Res. .

Abstract

Background: ELIXIR is a pan-European public-funded research infrastructure dedicated to life science data. As such, it must demonstrate public value to its funders and stakeholders. We present methods to inventory research publications linked to ELIXIR that have received funding and support, as well as related citation metrics, used as performance metrics for these audiences.

Methods: To overcome challenges inherent in ELIXIR's distributed structure, and the fact that those publishing ELIXIR-supported work are typically working part-time on ELIXIR matters, a semi-automated approach, consisting of text-mining followed by manual curation, is presented. A country-level case study (ELIXIR Italy) refines and expands the methods, notably by introducing more granularity in the curation process (e.g. considering all national-level grants, examining affiliations to report publication per institute) and by additionally looking at the scientific impact of the resources developed and operated by the Italian Node of ELIXIR.

Results: Overall, the methods described in this article have shown to: (1) be repeatable with acceptable levels of accuracy and consistency (notably across curators); (2) require reasonable effort in terms of curation of monthly 'harvests' of publications (obtained by text-mining); and (3) to be well-adapted to ELIXIR's distributed nature.

Conclusions: Concrete examples are provided of downstream uses of the inventoried publications and their citations, both for ELIXIR as a whole and for the Italian case study. Limitations of the methods are discussed, particularly the challenges associated with using an 'Open literature' database (Europe PMC) for the text-mining, and the constraints related to curation capacity. The methods, along with the valuable lessons learned during their development, are sufficiently generic and pragmatic enough to be readily adapted by other similar research infrastructures.

Keywords: KPI; bioinformatics; database; funder; literature; metric; performance; resource.

PubMed Disclaimer

Conflict of interest statement

No competing interests were disclosed.

Figures

Figure 1.
Figure 1.. Publications and citations supported by ELIXIR (2011–2023) in Open literature (EuropePMC).
Figure 2.
Figure 2.. Publications and citations supported by ELIXIR Italy (2011–2023) in Open literature (EuropePMC).
Figure 3.
Figure 3.. Distribution of ELIXIR Italy publications (double counted) across its Node institutes.
Acronyms: see https://elixir-italy.org/about/members/.

References

    1. Academy of Finland: Assessment of benefits of international Research Infrastructure memberships for Finland 2020-2021. Questionnaire results. 2023. Reference Source
    1. Castro LJ, Martin C, Lazarov G, et al. : Measuring outcome and impacts from the BioHackathon Europe. 2021. 10.37044/osf.io/3dxhg Reference Source - DOI
    1. Drysdale R, Cook CE, Petryszak R, et al. : The ELIXIR Core Data Resources: fundamental infrastructure for the life sciences. Bioinformatics. 2020;36(8):2636–2642. 10.1093/bioinformatics/btz959 - DOI - PMC - PubMed
    1. Durinx C, McEntyre J, Appel R, et al. : Identifying ELIXIR core data resources. F1000Res. 2017;5:2422. 10.12688/f1000research.9656.2 - DOI - PMC - PubMed
    1. ELIXIR: ELIXIR Scientific Programme, 2024-2028. 2023. Reference Source

LinkOut - more resources