Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Feb 5;15(2):jkae281.
doi: 10.1093/g3journal/jkae281.

A unified VCF dataset from nearly 1,500 diverse maize accessions and resources to explore the genomic landscape of maize

Affiliations

A unified VCF dataset from nearly 1,500 diverse maize accessions and resources to explore the genomic landscape of maize

Carson M Andorf et al. G3 (Bethesda). .

Abstract

Efforts to capture and analyze maize nucleotide diversity have ranged widely in scope, but differences in reference genome version and software algorithms used in these efforts inhibit comparison, and these data are generally not available in an easy-to-use visualization platform for quick access and analysis. To address these issues, The Maize Genetics and Genomics Database has collaborated with maize researchers to offer variant data from a diverse set of 1,498 inbred lines, traditional varieties, and teosintes through a standardized variant-calling pipeline against version 5 of the B73 reference genome. The output was filtered for mapping quality, completeness, and linkage disequilibrium, and annotated based on variant effects relative to the B73 RefGen_v5 gene annotations. MaizeGDB has also updated a web tool, SNPversity 2.0, to filter, visualize, and download genotype sets based on genomic locations and accessions of interest, and added external datasets to demonstrate SNPversity 2.0's broad usage. MaizeGDB plans to host annual updates of these resources as additional resequencing data become available, with plans to expand to all publicly available sequence data.

Keywords: diversity; genome; maize; tools; variants.

PubMed Disclaimer

Conflict of interest statement

Conflicts of interest: The author(s) declare no conflict of interest.

Figures

Fig. 1.
Fig. 1.
Overview of SNPversity 2.0. a) The “Select options” tab. b) An example of one of the projects, opened to view the accessions within and the option selections. c) The “Table view” output tab featuring a subset of accessions for gene model Zm00001eb56510. d) The “Tree view” output tab from c for gene model Zm00001eb56510.
Fig. 2.
Fig. 2.
Closer look at the “Select options” tab. a) Overview. b) “Select genomic interval” options. c) “Choose a dataset” options.
Fig. 3.
Fig. 3.
Example data for the SNPversity 2.0 “Table view” tab. a) Twenty-five NAM founder accessions plus Ab10 for the gene model Zm00001eb407010. b) JBrowse instance of the variant highlighted with the arrow from a. Featured are the reference B73 gene model Zm00001eb407010 at the top, followed by V18 ear RNA-seq track, and the lifted over gene model annotations to genomes corresponding to the NAM founder accessions from a. The accessions from a of CML277 and Tzi8 that have the A−>G stop lost variant are represented by the lifted over gene models from their respective genomes. The A−>G variant from a is represented by the vertical line (arrow). The stop lost variant is associated with extended gene models in CML277 and Tzi8 relative to the other gene models. c) The Tzi8 gene model in its home JBrowse instance. V18 ear RNA-seq expression validates the extended gene model structure relative to B73 and B97.
Fig. 4.
Fig. 4.
Connecting SNPversity 2.0 to the MaizeGDB PanEffect tool. a) Select the “High-Coverage” dataset from the SNPversity 2.0 “Selection options” tab. b) High-coverage output for gene model Zm00001eb257910. Click on the “missense variant” link in the “Effect type” column to go to the predicted codon missense variant effect prediction in PanEffect. c) The PanEffect visualization platform. By selecting the “Variant effects in B73” tab and the “MaizeGDB 2024 High Coverage variant effects” radio button, the missense variants from the High-Coverage WGS data are shown, with color-coded effect predictions for each missense variant, from blue (benign effect) to red (strong effect). Circled is the P−>L missense variant at codon 296 (P296L) for Zm00001eb257910 from SNPversity 2.0 shown in Fig. 3b.

References

    1. Andorf C, Beavis WD, Hufford M, Smith S, Suza WP, Wang K, Woodhouse M, Yu J, Lübberstedt T. 2019. Technological advances in maize breeding: past, present and future. Theor Appl Genet. 132(3):817–849. doi: 10.1007/s00122-019-03306-3. - DOI - PubMed
    1. Andorf CM, Olivia C, Haley RK, Hayford JL, Portwood JL II, Sen S, Cannon EK, Gardiner JM, Woodhouse MR. 2024. PanEffect: a pan-genome visualization tool for variant effects in maize. Bioinformatics. 40(2):btae073. doi: 10.1093/bioinformatics/btae073. - DOI - PMC - PubMed
    1. Buckler ES, Gaut BS, McMullen MD. 2006. Molecular and functional diversity of maize. Curr Opin Plant Biol. 9(2):172–176. doi: 10.1016/j.pbi.2006.01.013. - DOI - PubMed
    1. Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A, Keane T, McCarthy SA, Davies RM, et al. 2021. Twelve years of SAMtools and BCFtools. GigaScience. 10(2):giab008. doi: 10.1093/gigascience/giab008. - DOI - PMC - PubMed
    1. Freed D, Aldana R, Weber JA, Edwards JS. 2017. The Sentieon Genomics Tools—a fast and accurate solution to variant calling from next-generation sequence data. bioRxiv 115717. 10.1101/115717, preprint: not peer reviewed. - DOI