Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Apr 4;24(1):128.
doi: 10.1186/s12859-023-05195-2.

Shine: A novel strategy to extract specific, sensitive and well-conserved biomarkers from massive microbial genomic datasets

Affiliations

Shine: A novel strategy to extract specific, sensitive and well-conserved biomarkers from massive microbial genomic datasets

Cong Ji et al. BMC Bioinformatics. .

Abstract

Background: Concentrations of the pathogenic microorganisms' DNA in biological samples are typically low. Therefore, DNA diagnostics of common infections are costly, rarely accurate, and challenging. Limited by failing to cover updated epidemic testing samples, computational services are difficult to implement in clinical applications without complex customized settings. Furthermore, the combined biomarkers used to maintain high conservation may not be cost effective and could cause several experimental errors in many clinical settings. Given the limitations of recent developed technology, 16S rRNA is too conserved to distinguish closely related species, and mosaic plasmids are not effective as well because of their uneven distribution across prokaryotic taxa.

Results: Here, we provide a computational strategy, Shine, that allows extraction of specific, sensitive and well-conserved biomarkers from massive microbial genomic datasets. Distinguished with simple concatenations with blast-based filtering, our method involves a de novo genome alignment-based pipeline to explore the original and specific repetitive biomarkers in the defined population. It can cover all members to detect newly discovered multicopy conserved species-specific or even subspecies-specific target probes and primer sets. The method has been successfully applied to a number of clinical projects and has the overwhelming advantages of automated detection of all pathogenic microorganisms without the limitations of genome annotation and incompletely assembled motifs. Using on our pipeline, users may select different configuration parameters depending on the purpose of the project for routine clinical detection practices on the website https://bioinfo.liferiver.com.cn with easy registration.

Conclusions: The proposed strategy is suitable for identifying shared phylogenetic markers while featuring low rates of false positive or false negative. This technology is suitable for the automatic design of minimal and efficient PCR primers and other types of detection probes.

Keywords: Biomarker; Conservation; Microbial genome; Sensitivity; Specificity.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Schematic map of Shine. This new strategy was used to explore specific, sensitive and conserved biomarkers to cover all members of defined populations. The total pipeline was divided into two directions, i.e., to search specific regions preferentially or to search sensitive regions preferentially. At last, searching for consensus sequences for specific and repeated regions was to be available for the best sets of primers and probes
Fig. 2
Fig. 2
Illustration of the submodule preferentially searching for the specific regions. (1) the microbial target fragments were compared with the whole-genome sequences of one or more comparison strains one to one to obtain several residual fragments as the first-round cut fragments T1-Tn; (2) then compared with whole-genome sequences of the remaining comparison strains, to obtain the collection of residual cut fragments as the candidate specific regions of the microbial target fragments; and (3) the specific regions were then verified and obtained to determine
Fig. 3
Fig. 3
Illustration of the submodule preferentially searching for multicopy regions. (1) for searching candidate multicopy regions, internal alignments were performed on the microbial target fragments; (2) for verifying and obtaining the multicopy regions, including by determining the positions of each candidate multicopy region on the microorganism target fragments, obtaining the numbers of other candidate multicopy regions covering the positions of each base of the to-be-verified candidate multicopy regions, and calculating the median values of the copy numbers of the to-be-verified candidate multicopy regions

Similar articles

References

    1. Priyanka B, et al. A review on detection methods used for foodborne pathogens. Indian J Med Res. 2016;144(3):327–338. - PMC - PubMed
    1. Upadhyay A, Yang H, Zaman B. ZnO nanoflower-based NanoPCR as an efficient diagnostic tool for quick diagnosis of canine vector-borne pathogens. Pathogens. 2020;9(2):122. - PMC - PubMed
    1. Fox RTV. The present and future use of technology to detect plant pathogens to guide disease control in sustainable farming systems. Agr Ecosyst Environ. 1997;64(2):125–132.
    1. Anahtar MN, Shaw BM, Slater D, Byrne EH, Botti-Lodovico Y, Adams G, et al. Development of a qualitative real-time RT-PCR assay for the detection of SARS-CoV-2: a guide and case study in setting up an emergency-use, laboratory-developed molecular microbiological assay. Journal of Clinical Pathology, 2021: p. jclinpath-2020-207128. - PMC - PubMed
    1. Rajapaksha P, et al. A review of methods for the detection of pathogenic microorganisms. Analyst. 2019;144(2):396–411. - PubMed