Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 11;104(9):105428.
doi: 10.1016/j.psj.2025.105428. Online ahead of print.

Comparative analysis of codon usage bias and host adaptation across avian metapneumovirus genotypes

Affiliations

Comparative analysis of codon usage bias and host adaptation across avian metapneumovirus genotypes

Shuiqin Shi et al. Poult Sci. .

Abstract

Avian metapneumovirus (aMPV) is a significant pathogen affecting poultry worldwide, causing respiratory disease and economic losses. This study investigated the genetic and evolutionary differences among aMPV genotypes through codon usage bias analysis. Using whole-genome and F gene sequences, we assessed phylogenetic relationships, codon usage patterns, evolutionary pressures, and host adaptation. Our results indicate clear genotype differentiation in the phylogenetic tree, with Group C identified as the earliest diverging lineage of aMPV. The F gene exhibits independent evolutionary trajectories, reflecting distinct selective pressures. Codon usage bias varies across genotypes and is primarily driven by selection pressure, with Groups B and C experiencing stronger selective constraints. The F gene, crucial for viral entry and adaptation, undergoes intense selection, optimising codon usage for host adaptation. Host adaptation analysis reveals that aMPV is most suited to chickens. Additionally, Group B exhibits the largest population size; however, recent declines, particularly in this genotype, suggest that vaccine-driven selection pressure may be influencing aMPV population dynamics. These findings provide critical insights into aMPV evolution, highlighting the role of codon usage bias and selection pressure in shaping viral adaptation. Understanding these evolutionary mechanisms may aid in vaccine development and disease control strategies.

Keywords: Avian metapneumovirus; Codon usage bias; Genotype; Host adaptation; Phylogenetic analysis.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: Shuiqin Shi reports financial support was provided by Anqing Normal University. If there are other authors, they declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig 1
Fig. 1
Phylogenetic and temporal evolutionary trajectory of the F gene in aMPV. (a) Phylogenetic analysis based on the whole genome. (b) Phylogenetic analysis based on the F gene. Sequences were aligned using multiple alignment with fast Fourier transform, and ambiguous regions were removed with trimAL. The best-fit model was selected via ModelFinder. Maximum likelihood phylogenetic trees were inferred using IQ-TREE with ultrafast bootstraps in PhyloSuite. (c) Temporal evolutionary trajectory. Bayesian molecular clock analysis of aMPV was performed using F gene sequences. The BEST Model Test package selected the optimal nucleotide substitution model. An optimised relaxed clock model accounted for substitution rate variation, while the coalescent Skygrid model inferred population size changes.
Fig 2
Fig. 2
Relative synonymous codon usage, correspondence analysis, and linear discriminant analysis. (a) Relative synonymous codon usage (RSCU) analysis based on the whole genome. RSCU values of >1.6 indicate codon preference, while values of <0.6 indicate underrepresentation. (b) Principal component analysis (PCA) based on the whole genome. PCA reduced the high-dimensional RSCU dataset into two principal components (Axis 1 and 2) for clustering visualisation. (c) RSCU analysis based on the F gene. (d) PCA based on the F gene.
Fig 3
Fig. 3
Differences in codon usage characteristics between genotypes. (a) Analysis based on the whole genome. (b) Analysis based on the F gene. A: adenine, G: guanine, C: cytosine, T: thymine, GC %: percentage of G and C in a gene or genome, GC3 %: GC content at the third codon position, GC3s: GC content at the third position of synonymous codons. ENC: effective number of codons, which quantifies codon usage bias, with lower values indicating stronger codon usage bias. Dinucleotide abundance was assessed using O/E ratios, with values of >1.25 indicating overrepresentation and <0.78 indicating underrepresentation. *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001.
Fig 4
Fig. 4
Evolutionary forces of codon usage bias in aMPV genotypes. (a) Whole genome. (b) F gene. PR2: Parity Rule 2, ENC: effective number of codons, F: Fusion. The regression line was generated using the least squares method, plotting GC content at the first and second codon positions (GC12 %) against GC content at the third codon position (GC3) to assess the balance between selection and mutational pressure. The selection pressure acting on each genotype was estimated using (1 − slope of regression line) × 100 %.
Fig 5
Fig. 5
Host adaptation and codon usage optimisation of aMPV genotypes. (a) Whole genome. (b) F gene. GG: chicken (Gallus gallus), MG: turkey (Meleagris gallopavo), AP: duck (Anas platyrhynchos), and AA: goose (Anser anser). CAI: codon adaptation index, RCDI: relative codon deoptimisation index, SiD: similarity index. Higher CAI and lower RCDI/ SiD indicate stronger adaptation to host codon usage and higher translational efficiency. *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001.
Fig 6
Fig. 6
Population expansion of each genotype in the context of codon usage. The shaded intervals represent the 95 % highest posterior density of the product of generation time and effective population size, while the middle line represents the inferred median effective population size. Bayesian skyline analysis was performed using BEAST 2.7 to infer historical population dynamics. The BEST Model Test selected the nucleotide substitution model, an optimised relaxed clock model accounted for rate variation, and a coalescent Bayesian skyline model was used as the tree prior. Markov chain Monte Carlo sampling (100M iterations) was conducted, with convergence assessed in Tracer (effective sample size > 200).

Similar articles

References

    1. Bahir I., Fromer M., Prat Y., Linial M. Viral adaptation to host: a proteome-based analysis of codon usage and amino acid preferences. Mol. Syst. Biol. 2009;5:311. doi: 10.1038/msb.2009.71. - DOI - PMC - PubMed
    1. Bao Y., Yu M., Liu P., Hou F., Muhammad F., Wang Z., Li X., Zhang Z., Wang S., Chen Y., Cui H., Liu A., Qi X., Pan Q., Zhang Y., Gao L., Li K., Liu C., He X., Wang X., Gao Y. Novel inactivated subtype B avian metapneumovirus vaccine induced humoral and cellular immune responses. Vaccines (Basel) 2020;8 doi: 10.3390/vaccines8040762. - DOI - PMC - PubMed
    1. Bayon-Auboyer M.H., Arnauld C., Toquin D., Eterradossi N. Nucleotide sequences of the F, L and G protein genes of two non-A/non-B avian pneumoviruses (APV) reveal a novel APV subgroup. J. Gen. Virol. 2000;81:2723–2733. doi: 10.1099/0022-1317-81-11-2723. - DOI - PubMed
    1. Bouckaert R.R., Drummond A.J. bModelTest: bayesian phylogenetic site model averaging and model comparison. BMC Evol. Biol. 2017;17:42. doi: 10.1186/s12862-017-0890-6. - DOI - PMC - PubMed
    1. Brown P.A., Lemaitre E., Briand F.X., Courtillon C., Guionie O., Allee C., Toquin D., Bayon-Auboyer M.H., Jestin V., Eterradossi N. Molecular comparisons of full length metapneumovirus (MPV) genomes, including newly determined French AMPV-C and -D isolates, further supports possible subclassification within the MPV Genus. PLoS One. 2014;9 doi: 10.1371/journal.pone.0102740. - DOI - PMC - PubMed