Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Feb 26;22(1):95.
doi: 10.1186/s12859-021-04044-4.

Biomedical articles share annotations with their citation neighbors

Affiliations

Biomedical articles share annotations with their citation neighbors

Raul Rodriguez-Esteban. BMC Bioinformatics. .

Abstract

Background: Numerous efforts have been poured into annotating the wealth of knowledge contained in biomedical articles. Thanks to such efforts, it is now possible to quantitatively explore relations between these annotations and the citation network at large scale.

Results: With the aid of several large and small annotation databases, this study shows that articles share annotations with their citation neighborhood to the point that the neighborhood's most common annotations are likely to be those appearing in the article.

Conclusions: These findings posit that an article's citation neighborhood defines to a large extent the article's annotated content. Thus, citations should be considered as a foundation for future knowledge management and annotation of biomedical articles.

Keywords: Biomedical database; Citation network; Document annotation.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Recall, precision and mean average precision (MAP) for the gene2pubmed (first row), dbSNP (second row), MeSH (third row) and UniProt (fourth row) datasets for first-degree neighbors with annotations (first column), including second-degree neighbors with annotations (second column), and randomized network (third column)
Fig. 2
Fig. 2
a Recall for BC2GN annotations based on the number of annotated neighbors. A sigmoidal function fit (R2 = 0.61) with 95% confidence intervals is shown alongside (dotted lines). The sigmoidal function was chosen because it is a monotonic function that is bounded within a range. b Including second-degree neighbors (sigmoidal function fit with R2 = 0.85). c For the randomized network
Fig. 3
Fig. 3
Recall, precision and MAP for the BC2GN dataset: a for first-degree neighbors with annotations, b including second-degree neighbors with annotations, and c for the randomized network. Values shown are noisy due to the low number of samples associated to each data point
Fig. 4
Fig. 4
Recall, precision and MAP for MeSH term overlap analysis on the a NLM2007 and b L1000 datasets using first-degree neighbors with annotations. Values are noisy for small datasets, such as these, due to the low number of samples associated to each data point. Thus, values associated to NLM2007, with 200 articles, are noisier than for L1000, with 1000 articles
Fig. 5
Fig. 5
Mean average precision (MAP) for the gene2pubmed datasets for first-degree neighbors with annotations: a considering citing and cited references. b Considering citing references alone, and c cited references alone

Similar articles

Cited by

References

    1. Delbecque T, Zweigenbaum P. Using co-authoring and cross-referencing information for MEDLINE indexing. AMIA Annu Symp Proc. 2010;13(2010):147–151. - PMC - PubMed
    1. Mao Y, Lu Z. MeSH now: automatic MeSH indexing at PubMed scale via learning to rank. J Biomed Semant. 2017;8(1):15. doi: 10.1186/s13326-017-0123-3. - DOI - PMC - PubMed
    1. Peroni S., Shotton D., Vitali F. One year of the opencitations corpus. In: d'Amato C, et al. (eds) The semantic web—ISWC 2017. ISWC 2017. Lecture Notes in Computer Science 2017; 10588. Springer, Cham.
    1. Rodriguez-Esteban R. Semantic persistence of ambiguous biomedical names in the citation network. Bioinformatics. 2019;36(7):2224–2228. doi: 10.1093/bioinformatics/btz923. - DOI - PubMed
    1. Arighi CN, Lu Z, Krallinger M, et al. Overview of the biocreative III workshop. BMC Bioinform. 2011;12(Suppl 8):S1. doi: 10.1186/1471-2105-12-S8-S1. - DOI - PMC - PubMed

LinkOut - more resources