Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Oct 1;6(10):1-7.
doi: 10.1093/gigascience/gix084.

Combining semi-automated image analysis techniques with machine learning algorithms to accelerate large-scale genetic studies

Affiliations

Combining semi-automated image analysis techniques with machine learning algorithms to accelerate large-scale genetic studies

Jonathan A Atkinson et al. Gigascience. .

Erratum in

Abstract

Genetic analyses of plant root systems require large datasets of extracted architectural traits. To quantify such traits from images of root systems, researchers often have to choose between automated tools (that are prone to error and extract only a limited number of architectural traits) or semi-automated ones (that are highly time consuming). We trained a Random Forest algorithm to infer architectural traits from automatically extracted image descriptors. The training was performed on a subset of the dataset, then applied to its entirety. This strategy allowed us to (i) decrease the image analysis time by 73% and (ii) extract meaningful architectural traits based on image descriptors. We also show that these traits are sufficient to identify the quantitative trait loci that had previously been discovered using a semi-automated method. We have shown that combining semi-automated image analysis with machine learning algorithms has the power to increase the throughput of large-scale root studies. We expect that such an approach will enable the quantification of more complex root systems for genetic studies. We also believe that our approach could be extended to other areas of plant phenotyping.

Keywords: QTL analysis; machine learning; plant phenotyping; root.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
Overview of the analysis pipeline used in this study. (A) We divided the full dataset (2614 images) into two: a training set (100 to 900 images) and a test set (1645 images). (B) For each dataset, all the images were analysed using a semi-automated root image analysis tool (RootNav) to extract the ground-truth, as well as with a fully automated root image analysis tool (RIA-J) to extract image descriptors (see the text for details). (C) We trained a Random Forest model on the image descriptors and the ground-truth from the training dataset. (D) We applied the Random Forest model on the image descriptors from the test dataset. (E) We compared the image descriptors and the Random Forest estimators from the test dataset with their corresponding ground-truth. (F) Comparison of biologically relevant metrics extracted with the automated analysis and the Random Forest analysis. (G) QTL were identified and compared using both Random Forest estimators and the ground-truth data.
Figure 2:
Figure 2:
Accuracy of the Random Forest estimators. The R2 values of the linear regression between the Random Forest estimators and the ground-truths were computed for each size and repetition of training datasets. The dotted line represents the R2 value between the most closely related image descriptors and the ground-truth.
Figure 3:
Figure 3:
Screenshot of PRIMAL. (A) Variable to evaluate with the Random Forest algorithm. (B) Random Forest algorithm parameters. (C) Visualization of the accuracy of the Random Forest estimators. (D) Accuracy metrics for the different descriptors.

References

    1. Herder GD, Van Isterdael G, Beeckman T et al. . The roots of a new green revolution. Trends Plant Sci 2010;15:600–7. - PubMed
    1. Lynch JP. Turner review no. 14. Roots of the second green revolution. Aust J Bot 2007;55:493–512.
    1. Lobet G, Koevoets IT, Noll M et al. . Using a structural root system model to evaluate and improve the accuracy of root image analysis pipelines. Front Plant Sci 2017; doi: 10.3389/fpls.2017.00447. - PMC - PubMed
    1. Hund A, Trachsel S, Stamp P. Growth of axile and lateral roots of maize: I. Development of a phenotying platform. Plant Soil 2009;325:335–49.
    1. Atkinson JA, Wingen LU, Griffiths M et al. . Phenotyping pipeline reveals major seedling root growth QTL in hexaploid wheat. J Exp Bot Soc Experiment Biol 2015;66:2283–92. - PMC - PubMed

Publication types