Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation

Automated quantitative trait locus analysis (AutoQTL)

Philip J Freda et al. bioRxiv. .

Update in

  • Automated quantitative trait locus analysis (AutoQTL).
    Freda PJ, Ghosh A, Zhang E, Luo T, Chitre AS, Polesskaya O, St Pierre CL, Gao J, Martin CD, Chen H, Garcia-Martinez AG, Wang T, Han W, Ishiwari K, Meyer P, Lamparelli A, King CP, Palmer AA, Li R, Moore JH. Freda PJ, et al. BioData Min. 2023 Apr 10;16(1):14. doi: 10.1186/s13040-023-00331-3. BioData Min. 2023. PMID: 37038201 Free PMC article.

Abstract

Background: Quantitative Trait Locus (QTL) analysis and Genome-Wide Association Studies (GWAS) have the power to identify variants that capture significant levels of phenotypic variance in complex traits. However, effort and time are required to select the best methods and optimize parameters and pre-processing steps. Although machine learning approaches have been shown to greatly assist in optimization and data processing, applying them to QTL analysis and GWAS is challenging due to the complexity of large, heterogenous datasets. Here, we describe proof-of-concept for an automated machine learning approach, AutoQTL, with the ability to automate many complex decisions related to analysis of complex traits and generate diverse solutions to describe relationships that exist in genetic data.

Results: Using a dataset of 18 putative QTL from a large-scale GWAS of body mass index in the laboratory rat, Rattus norvegicus , AutoQTL captures the phenotypic variance explained under a standard additive model while also providing evidence of non-additive effects including deviations from additivity and 2-way epistatic interactions from simulated data via multiple optimal solutions. Additionally, feature importance metrics provide different insights into the inheritance models and predictive power of multiple GWAS-derived putative QTL.

Conclusions: This proof-of-concept illustrates that automated machine learning techniques can be applied to genetic data and has the potential to detect both additive and non-additive effects via various optimal solutions and feature importance metrics. In the future, we aim to expand AutoQTL to accommodate omics-level datasets with intelligent feature selection strategies.

PubMed Disclaimer

Publication types

LinkOut - more resources