Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Dec 18;10(12):2805.
doi: 10.3390/plants10122805.

CottonGen: The Community Database for Cotton Genomics, Genetics, and Breeding Research

Affiliations

CottonGen: The Community Database for Cotton Genomics, Genetics, and Breeding Research

Jing Yu et al. Plants (Basel). .

Abstract

Over the last eight years, the volume of whole genome, gene expression, SNP genotyping, and phenotype data generated by the cotton research community has exponentially increased. The efficient utilization/re-utilization of these complex and large datasets for knowledge discovery, translation, and application in crop improvement requires them to be curated, integrated with other types of data, and made available for access and analysis through efficient online search tools. Initiated in 2012, CottonGen is an online community database providing access to integrated peer-reviewed cotton genomic, genetic, and breeding data, and analysis tools. Used by cotton researchers worldwide, and managed by experts with crop-specific knowledge, it continuous to be the logical choice to integrate new data and provide necessary interfaces for information retrieval. The repository in CottonGen contains colleague, gene, genome, genotype, germplasm, map, marker, metabolite, phenotype, publication, QTL, species, transcriptome, and trait data curated by the CottonGen team. The number of data entries housed in CottonGen has increased dramatically, for example, since 2014 there has been an 18-fold increase in genes/mRNAs, a 23-fold increase in whole genomes, and a 372-fold increase in genotype data. New tools include a genetic map viewer, a genome browser, a synteny viewer, a metabolite pathways browser, sequence retrieval, BLAST, and a breeding information management system (BIMS), as well as various search pages for new data types. CottonGen serves as the home to the International Cotton Genome Initiative, managing its elections and serving as a communication and coordination hub for the community. With its extensive curation and integration of data and online tools, CottonGen will continue to facilitate utilization of its critical resources to empower research for cotton crop improvement.

Keywords: analysis; big data; bioinformatics; crop improvement; genotype; whole genome sequence.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Synteny Viewer in CottonGen. (A) Home page of Synteny Viewer allows researchers to choose a chromosome of agenome and multiple genomes for comparison. Researchers can also choose a synteny block ID. (B) A circular diagram and a table shows the synteny blocks between a chromosome of a reference genome and all chromosomes of another genome being compared. (C) A bar diagram and a table that shows all the genes in a syntenic block. The table displays E-value between the matching genes and the gene names have hyperlinks to the gene detail page. (D) A gene detail page with a resource side bar and the hyperlink to JBrowse. (E) JBrowse around the mRNA of interest with tracks such as gene, mRNA, SNP and SSR markers.
Figure 2
Figure 2
SNP Genotype search page in CottonGen. (A. left) Researchers can search SNP genotype data by dataset name, species, germplasm name, SNP name, genome location, and/or gene name. Researchers can also upload a file with a list of germplasm names. (B. right) Search result table that shows SNP name, genome location, allele, and the genotype data of all the germplasm chosen in the order of SNP location in the genome. The red square highlights the options to download the genotype for all the markers displayed in the result page or the genotype data that are polymorphic in the germplasm set chosen.
Figure 3
Figure 3
CottonGen BIMS. (A) ‘Template List’ subsection in ‘Data Import’ section provides downloadable templates for researchers to enter various breeding data. (B) ‘Search’ section allows researchers to search and save the list of accessions using any combination of properties and trait cut of values: accessions name, trial, location, cross, data year, and trait values. The middle section shows the statistical information on the filtered dataset for the trait chosen and the right section shows the number of accessions filtered so far. (C) A page with the search result table. Researchers can add more columns in the table using ‘Column options’ and save/download the result table. (D) ‘Data Analysis’ section that allows researchers to choose multiple datasets, using the categories or saved accession lists and compare the trait statistics between the datasets.

References

    1. Yu J., Main D. Role of Bioinformatics Tools and Databases in Cotton Research. In: Fang D.D., Percy R.G., editors. Cotton. 2nd ed. John Wiley & Sons; New York, NY, USA: 2015. pp. 303–338. Agronomy Monograph 57.
    1. Yu J., Jung S., Cheng C.-H., Ficklin S.P., Lee T., Zheng P., Jones D., Percy R.G., Main D. CottonGen: A genomics, genetics and breeding database for cotton research. Nucleic Acids Res. 2014;42:D1229–D1236. doi: 10.1093/nar/gkt1064. - DOI - PMC - PubMed
    1. Yu J., Hinze L.L., Yu J.Z., Kohel R.J. CottonDB.org—New website for cotton genome database; Proceedings of the International Cotton Genome Initiative Research Conference; Brasilia, Brazil. 18–20 September 2006; [(accessed on 17 November 2021)]. Available online: https://www.ars.usda.gov/research/publications/publication/?seqNo115=197886.
    1. Yu J., Kohel R., Hinze L., Yu J.Z., Frelichowski J., Ficklin S., Main D., Percy R.G. CottonDB; Proceedings of the International Plant and Animal Genome Conference XX; San Diego, CA, USA. 14–18 January 2012; [(accessed on 17 November 2021)]. Available online: https://pag.confex.com/pag/xx/webprogram/Paper1715.html.
    1. Blenda A., Scheffler J., Scheffler B., Palmer M., Lacape J.-M., Yu J.Z., Jesudurai C., Jung S., Muthukumar S., Yellambalase P., et al. CMD: A Cotton Microsatellite Database resource for Gossypium genomics. BMC Genom. 2006;7:132. doi: 10.1186/1471-2164-7-132. - DOI - PMC - PubMed