TIPPo: A User-Friendly Tool for De Novo Assembly of Organellar Genomes with High-Fidelity Data
- PMID: 39800935
- PMCID: PMC11725521
- DOI: 10.1093/molbev/msae247
TIPPo: A User-Friendly Tool for De Novo Assembly of Organellar Genomes with High-Fidelity Data
Abstract
Plant cells have two major organelles with their own genomes: chloroplasts and mitochondria. While chloroplast genomes tend to be structurally conserved, the mitochondrial genomes of plants, which are much larger than those of animals, are characterized by complex structural variation. We introduce TIPPo, a user-friendly, reference-free assembly tool that uses PacBio high-fidelity long-read data and that does not rely on genomes from related species or nuclear genome information for the assembly of organellar genomes. TIPPo employs a deep learning model for initial read classification and leverages k-mer counting for further refinement, significantly reducing the impact of nuclear insertions of organellar DNA on the assembly process. We used TIPPo to completely assemble a set of 54 complete chloroplast genomes. No other tool was able to completely assemble this set. TIPPo is comparable with PMAT in assembling mitochondrial genomes from most species but does achieve even higher completeness for several species. We also used the assembled organelle genomes to identify instances of nuclear plastid DNA (NUPTs) and nuclear mitochondrial DNA (NUMTs) insertions. The cumulative length of NUPTs/NUMTs positively correlates with the size of the nuclear genome, suggesting that insertions occur stochastically. NUPTs/NUMTs show predominantly C:G to T:A changes, with the mutated cytosines typically found in CG and CHG contexts, suggesting that degradation of NUPT and NUMT sequences is driven by the known elevated mutation rate of methylated cytosines. Small interfering RNA loci are enriched in NUPTs and NUMTs, consistent with the RdDM pathway mediating DNA methylation in these sequences.
Keywords: PacBio HiFi reads; chloroplast genome; genome assembly; mitochondrial genome; nuclear insertions of organellar genomes.
© The Author(s) 2025. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution.
Conflict of interest statement
Conflict of Interest: D.W. holds equity in Computomics, which advises plant breeders. D.W. also consults for KWS SE, a plant breeder and seed producer with activities throughout the world. All other authors declare no conflicts.
Figures







Similar articles
-
Intergenomic gene transfer in diploid and allopolyploid Gossypium.BMC Plant Biol. 2019 Nov 12;19(1):492. doi: 10.1186/s12870-019-2041-2. BMC Plant Biol. 2019. PMID: 31718541 Free PMC article.
-
Recent and frequent insertions of chloroplast DNA into maize nuclear chromosomes.Cytogenet Genome Res. 2010 Jul;129(1-3):17-23. doi: 10.1159/000312724. Epub 2010 Jul 13. Cytogenet Genome Res. 2010. PMID: 20628248
-
Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health.Genome Biol Evol. 2017 May 1;9(5):1190-1203. doi: 10.1093/gbe/evx078. Genome Biol Evol. 2017. PMID: 28444372 Free PMC article.
-
Nuclear Integrants of Organellar DNA Contribute to Genome Structure and Evolution in Plants.Int J Mol Sci. 2020 Jan 21;21(3):707. doi: 10.3390/ijms21030707. Int J Mol Sci. 2020. PMID: 31973163 Free PMC article. Review.
-
Genome Editing of Plant Mitochondrial and Chloroplast Genomes.Plant Cell Physiol. 2024 May 14;65(4):477-483. doi: 10.1093/pcp/pcad162. Plant Cell Physiol. 2024. PMID: 38113380 Free PMC article. Review.
Cited by
-
The evolutionary dynamics of organellar pan-genomes in Arabidopsis thaliana.Genome Biol. 2025 Aug 11;26(1):240. doi: 10.1186/s13059-025-03717-0. Genome Biol. 2025. PMID: 40790212 Free PMC article.
-
Oatk: a de novo assembly tool for complex plant organelle genomes.Genome Biol. 2025 Aug 7;26(1):235. doi: 10.1186/s13059-025-03676-6. Genome Biol. 2025. PMID: 40775726 Free PMC article.
-
Minimizing detection bias of somatic mutations in a highly heterozygous oak genome.G3 (Bethesda). 2025 Aug 6;15(8):jkaf143. doi: 10.1093/g3journal/jkaf143. G3 (Bethesda). 2025. PMID: 40543050 Free PMC article.
-
PMAT2: An efficient graphical assembly toolkit for comprehensive organellar genomes.Imeta. 2025 Jul 1;4(4):e70064. doi: 10.1002/imt2.70064. eCollection 2025 Aug. Imeta. 2025. PMID: 40860429 Free PMC article.
-
PlastidHub: An integrated analysis platform for plastid phylogenomics and comparative genomics.Plant Divers. 2025 May 22;47(4):544-560. doi: 10.1016/j.pld.2025.05.005. eCollection 2025 Jul. Plant Divers. 2025. PMID: 40734827 Free PMC article.
References
-
- Ankenbrand MJ, Hohlfeld S, Hackl T, Förster F. AliTV—interactive visualization of whole genome comparisons. PeerJ Comput Sci. 2017:3:e116. 10.7717/peerj-cs.116. - DOI
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous