Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 May 4;23(9):5121.
doi: 10.3390/ijms23095121.

Deciphering Pleiotropic Signatures of Regulatory SNPs in Zea mays L. Using Multi-Omics Data and Machine Learning Algorithms

Affiliations

Deciphering Pleiotropic Signatures of Regulatory SNPs in Zea mays L. Using Multi-Omics Data and Machine Learning Algorithms

Ataul Haleem et al. Int J Mol Sci. .

Abstract

Maize is one of the most widely grown cereals in the world. However, to address the challenges in maize breeding arising from climatic anomalies, there is a need for developing novel strategies to harness the power of multi-omics technologies. In this regard, pleiotropy is an important genetic phenomenon that can be utilized to simultaneously enhance multiple agronomic phenotypes in maize. In addition to pleiotropy, another aspect is the consideration of the regulatory SNPs (rSNPs) that are likely to have causal effects in phenotypic development. By incorporating both aspects in our study, we performed a systematic analysis based on multi-omics data to reveal the novel pleiotropic signatures of rSNPs in a global maize population. For this purpose, we first applied Random Forests and then Markov clustering algorithms to decipher the pleiotropic signatures of rSNPs, based on which hierarchical network models are constructed to elucidate the complex interplay among transcription factors, rSNPs, and phenotypes. The results obtained in our study could help to understand the genetic programs orchestrating multiple phenotypes and thus could provide novel breeding targets for the simultaneous improvement of several agronomic traits.

Keywords: gene expression profiles; hierarchical network model; incremental feature selection; markov clustering; multi-omics; random forest; regulatory SNPs.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Overview of the analyses pipeline highlighting key machine learning algorithms for the identification of pleiotropic signatures of regulatory SNPs (rSNPs) to establish complex interplay of transcription factors (TFs), rSNPs and multiple phenotypes. The genotypic data (A1), consisting of 1.03 m SNP markers was filtered for MAF (<0.05) and 31,000 SNPs found within promoter regions of 37,407 maize genes were considered for association analysis with 20 quantitative agronomic traits (A2). RNA-seq (A3) dataset was utilized for the validation of pleiotropic rSNPs on the underlying gene expression. As of first step in the data analysis, rSNPs were identified (B) for their impact on the gain or loss of TFBSs, after which their association with multiple phenotypes was determined using random forest (RF) using the Boruta algorithm and incremental feature selection (IFS) technique (C). Pleiotropic signatures of rSNPs were then established by pruning weaker connections in the overall network into smaller non-overlapping fully connected clusters, using Markov clustering (MCL) algorithm (D) which provided the basis for the construction of hierarchical network models with three distinct layers modelling the complex interplay of TFs, rSNPs and multiple phenotype (B). Further, the boxplots show the impact of pleiotropic rSNPs at gene expression level as a function of gain or loss of TFBSs (E).
Figure 2
Figure 2
A plot to show the change of R2values versus the number of rSNPs in association with the phenotype pollen shed. The incremental feature selection (IFS) curves were drawn using the ranking of rSNPs. The R2 value reached a peak when considering the first 90 rSNPs. These rSNPs were used for the further analysis of this phenotype.
Figure 3
Figure 3
Number of associated rSNPs determined by the incremental feature selection (IFS) procedure for each phenotype and their overlap represented in matrix layouts using the UpSet technique [67]. Black circles in the matrix layout are related to the phenotypes that are part of the intersection. For the sake of clarity, not all intersections are displayed.
Figure 4
Figure 4
Hierarchical network model constructed using Cluster-7 to elucidate the complex interplay among TFs−rSNPs(genes)−Phenotypes. (AC) show the significant changes in the gene expression values resulting from the consequences of pleiotropic rSNPs. (D) Hierarchical network model with three layers.

Similar articles

Cited by

References

    1. Shiferaw B., Prasanna B.M., Hellin J., Bänziger M. Crops that feed the world 6. Past successes and future challenges to the role played by maize in global food security. Food Secur. 2011;3:307–327. doi: 10.1007/s12571-011-0140-5. - DOI
    1. Prasanna B.M., Palacios-Rojas N., Hossain F., Muthusamy V., Menkir A., Dhliwayo T., Ndhlela T., San Vicente F., Nair S.K., Vivek B.S., et al. Molecular breeding for nutritionally enriched maize: Status and prospects. Front. Genet. 2020;10:1392. doi: 10.3389/fgene.2019.01392. - DOI - PMC - PubMed
    1. Ortiz-Monasterio J.I., Palacios-Rojas N., Meng E., Pixley K., Trethowan R., Pena R. Enhancing the mineral and vitamin content of wheat and maize through plant breeding. J. Cereal Sci. 2007;46:293–307. doi: 10.1016/j.jcs.2007.06.005. - DOI
    1. Bänziger M., Betrán F., Lafitte H. Efficiency of high-nitrogen selection environments for improving maize for low-nitrogen target environments. Crop. Sci. 1997;37:1103–1109. doi: 10.2135/cropsci1997.0011183X003700040012x. - DOI
    1. Suwarno W.B., Pixley K.V., Palacios-Rojas N., Kaeppler S.M., Babu R. Genome-wide association analysis reveals new targets for carotenoid biofortification in maize. Theor. Appl. Genet. 2015;128:851–864. doi: 10.1007/s00122-015-2475-3. - DOI - PMC - PubMed

LinkOut - more resources