Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Apr 20:13:842387.
doi: 10.3389/fgene.2022.842387. eCollection 2022.

PolyReco: A Method to Automatically Label Collinear Regions and Recognize Polyploidy Events Based on the K S Dotplot

Affiliations

PolyReco: A Method to Automatically Label Collinear Regions and Recognize Polyploidy Events Based on the K S Dotplot

Fushun Wang et al. Front Genet. .

Abstract

Polyploidization plays a critical role in producing new gene functions and promoting species evolution. Effective identification of polyploid types can be helpful in exploring the evolutionary mechanism. However, current methods for detecting polyploid types have some major limitations, such as being time-consuming and strong subjectivity, etc. In order to objectively and scientifically recognize collinearity fragments and polyploid types, we developed PolyReco method, which can automatically label collinear regions and recognize polyploidy events based on the K S dotplot. Combining with whole-genome collinearity analysis, PolyReco uses DBSCAN clustering method to cluster K S dots. According to the distance information in the x-axis and y-axis directions between the categories, the clustering results are merged based on certain rules to obtain the collinear regions, automatically recognize and label collinear fragments. According to the information of the labeled collinear regions on the y-axis, the polyploidization recognition algorithm is used to exhaustively combine and obtain the genetic collinearity evaluation index of each combination, and then draw the genetic collinearity evaluation index graph. Based on the inflection point on the graph, polyploid types and related chromosomes with polyploidy signal can be detected. The validation experiments showed that the conclusions of PolyReco were consistent with the previous study, which verified the effectiveness of this method. It is expected that this approach can become a reference architecture for other polyploid types classification methods.

Keywords: DBSCAN; chromosome; clustering; collinearity fragment; polyploidy.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
(A) K S dotplot between Salix sinopurpurea and Vitis vinifera genome homologous genes (B) DBSCAN cluster recognition effect figure (C) Automatic label result of collinearity fragments based on DBSCAN.
FIGURE 2
FIGURE 2
(A) Collinearity evaluation index line chart of Vitis vinifera Chr.4, Chr.13, and Chr.14 (B) Combination figure of Salix sinopurpurea polyploidy. (A,B) correspond to each other. (1) The grape chromosome 4 (2) The grape chromosome 13 (3) The grape chromosome 14.
FIGURE 3
FIGURE 3
(A) Collinearity evaluation index line chart of Arabidopsis thaliana Chr.1, Chr.3 and Chr.5 (B) Combination figure of Brassica rapa polyploidy. (A,B) correspond to each other. (1) The Arabidopsis thaliana chromosome 1 (2) The Arabidopsis thaliana chromosome 3 (3) The Arabidopsis thaliana chromosome 5

Similar articles

Cited by

References

    1. Cheng S., Xian W., Fu Y., Marin B., Keller J., Wu T., et al. (2019). Genomes of Subaerial Zygnematophyceae Provide Insights into Land Plant Evolution. Cell 179 (5), 1057–1067. 10.1016/j.cell.2019.10.019 - DOI - PubMed
    1. Dong S., Liu M., Liu Y., Chen F., Yang T., Chen L., et al. (2021). The Genome of Magnolia Biondii Pamp. Provides Insights into the Evolution of Magnoliales and Biosynthesis of Terpenoids. Hortic. Res. 8 (1), 38. 10.1038/s41438-021-00471-9 - DOI - PMC - PubMed
    1. Edgar R. C. (2004). MUSCLE: a Multiple Sequence Alignment Method with Reduced Time and Space Complexity. BMC Bioinformatics 5, 113. 10.1186/1471-2105-5-113 - DOI - PMC - PubMed
    1. Hao M., Li A., Shi T., Luo J., Zhang L., Zhang X., et al. (2017). The Abundance of Homoeologue Transcripts Is Disrupted by Hybridization and Is Partially Restored by Genome Doubling in Synthetic Hexaploid Wheat. Bmc Genomics 18, 149. 10.1186/s12864-017-3558-0 - DOI - PMC - PubMed
    1. Li Z., Defoort J., Tasdighian S., Maere S., Van de Peer Y., De Smet R. (2016). Gene Duplicability of Core Genes Is Highly Consistent across All Angiosperms. Plant Cell 28 (2), 326–344. 10.1105/tpc.15.00877 - DOI - PMC - PubMed

LinkOut - more resources