Orphan Genes Shared by Pathogenic Genomes Are More Associated with Bacterial Pathogenicity
- PMID: 30801025
- PMCID: PMC6372840
- DOI: 10.1128/mSystems.00290-18
Orphan Genes Shared by Pathogenic Genomes Are More Associated with Bacterial Pathogenicity
Abstract
Orphan genes (also known as ORFans [i.e., orphan open reading frames]) are new genes that enable an organism to adapt to its specific living environment. Our focus in this study is to compare ORFans between pathogens (P) and nonpathogens (NP) of the same genus. Using the pangenome idea, we have identified 130,169 ORFans in nine bacterial genera (505 genomes) and classified these ORFans into four groups: (i) SS-ORFans (P), which are only found in a single pathogenic genome; (ii) SS-ORFans (NP), which are only found in a single nonpathogenic genome; (iii) PS-ORFans (P), which are found in multiple pathogenic genomes; and (iv) NS-ORFans (NP), which are found in multiple nonpathogenic genomes. Within the same genus, pathogens do not always have more genes, more ORFans, or more pathogenicity-related genes (PRGs)-including prophages, pathogenicity islands (PAIs), virulence factors (VFs), and horizontal gene transfers (HGTs)-than nonpathogens. Interestingly, in pathogens of the nine genera, the percentages of PS-ORFans are consistently higher than those of SS-ORFans, which is not true in nonpathogens. Similarly, in pathogens of the nine genera, the percentages of PS-ORFans matching the four types of PRGs are also always higher than those of SS-ORFans, but this is not true in nonpathogens. All of these findings suggest the greater importance of PS-ORFans for bacterial pathogenicity. IMPORTANCE Recent pangenome analyses of numerous bacterial species have suggested that each genome of a single species may have a significant fraction of its gene content unique or shared by a very few genomes (i.e., ORFans). We selected nine bacterial genera, each containing at least five pathogenic and five nonpathogenic genomes, to compare their ORFans in relation to pathogenicity-related genes. Pathogens in these genera are known to cause a number of common and devastating human diseases such as pneumonia, diphtheria, melioidosis, and tuberculosis. Thus, they are worthy of in-depth systems microbiology investigations, including the comparative study of ORFans between pathogens and nonpathogens. We provide direct evidence to suggest that ORFans shared by more pathogens are more associated with pathogenicity-related genes and thus are more important targets for development of new diagnostic markers or therapeutic drugs for bacterial infectious diseases.
Keywords: ORFan; horizontal gene transfer; orphan gene; pathogenic island; pathogenicity; prophage; virulence factor.
Figures



Similar articles
-
[Plasticity of bacterial genomes: pathogenicity islands and the locus of enterocyte effacement (LEE)].Berl Munch Tierarztl Wochenschr. 2004 Mar-Apr;117(3-4):116-29. Berl Munch Tierarztl Wochenschr. 2004. PMID: 15046458 Review. German.
-
Analysis of singleton ORFans in fully sequenced microbial genomes.Proteins. 2003 Nov 1;53(2):241-51. doi: 10.1002/prot.10423. Proteins. 2003. PMID: 14517975
-
Population diversity of ORFan genes in Escherichia coli.Genome Biol Evol. 2012;4(11):1176-87. doi: 10.1093/gbe/evs081. Genome Biol Evol. 2012. PMID: 23034216 Free PMC article.
-
Common and pathogen-specific virulence factors are different in function and structure.Virulence. 2013 Aug 15;4(6):473-82. doi: 10.4161/viru.25730. Epub 2013 Jul 15. Virulence. 2013. PMID: 23863604 Free PMC article.
-
Pathogenicity islands and the evolution of microbes.Annu Rev Microbiol. 2000;54:641-79. doi: 10.1146/annurev.micro.54.1.641. Annu Rev Microbiol. 2000. PMID: 11018140 Review.
Cited by
-
Categorization of Orthologous Gene Clusters in 92 Ascomycota Genomes Reveals Functions Important for Phytopathogenicity.J Fungi (Basel). 2021 Apr 27;7(5):337. doi: 10.3390/jof7050337. J Fungi (Basel). 2021. PMID: 33925458 Free PMC article.
-
The Lost and Found: Unraveling the Functions of Orphan Genes.J Dev Biol. 2023 Jun 13;11(2):27. doi: 10.3390/jdb11020027. J Dev Biol. 2023. PMID: 37367481 Free PMC article. Review.
-
Validation of predicted anonymous proteins simply using Fisher's exact test.Bioinform Adv. 2021 Nov 15;1(1):vbab034. doi: 10.1093/bioadv/vbab034. eCollection 2021. Bioinform Adv. 2021. PMID: 36700095 Free PMC article.
-
Cadmium stress triggers significant metabolic reprogramming in Enterococcus faecium CX 2-6.Comput Struct Biotechnol J. 2021 Oct 18;19:5678-5687. doi: 10.1016/j.csbj.2021.10.021. eCollection 2021. Comput Struct Biotechnol J. 2021. PMID: 34765088 Free PMC article.
-
Distinct but Intertwined Evolutionary Histories of Multiple Salmonella enterica Subspecies.mSystems. 2020 Jan 14;5(1):e00515-19. doi: 10.1128/mSystems.00515-19. mSystems. 2020. PMID: 31937675 Free PMC article.
References
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous