Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 1;41(7):btaf388.
doi: 10.1093/bioinformatics/btaf388.

Agptools: a utility suite for editing genome assemblies

Affiliations

Agptools: a utility suite for editing genome assemblies

Edward S Ricemeyer et al. Bioinformatics. .

Abstract

Summary: The AGP format is a tab-separated table format describing how components of a genome assembly fit together. A standard submission format for genome assemblies is a fasta file giving the sequence of contigs along with an AGP file showing how these components are assembled into larger pieces like scaffolds or chromosomes. For this reason, many scaffolding software pipelines output assemblies in this format. However, although many programs for assembling and scaffolding genomes read and write this format, there is currently no published software for making edits to AGP files when performing assembly curation. We present agptools, a suite of command-line programs that can perform common operations on AGP files, such as breaking and joining sequences, inverting pieces of assembly components, assembling contigs into larger sequences based on an AGP file, and transforming between coordinate systems of different assembly layouts. Additionally, agptools includes an API that writers of other software packages can use to read, write, and manipulate AGP files within their own programs.

Availability and implementation: Source code and binaries freely available for download at https://github.com/WarrenLab/agptools, implemented in Python and supported on all operating systems.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Subcommands of agptools. (a) The join command joins two or more separate sequences into a single sequence. (b) The split command splits a sequence at a given location into two sequences. (c) The flip command inverts part of a sequence. (d) The assemble command produces an assembled fasta file from a fasta file of sequence components and an AGP file giving their order and orientation. (e) The transform command translates coordinates in a bed file into the coordinate system of assembled sequences.

References

    1. AGP Specification v2.1. screed: a simple read-only sequence database, designed for short reads. https://www.ncbi.nlm.nih.gov/genbank/genome\_agp\_specification/ (3 July 2025, date last accessed).
    1. Alonge M, Lebeigle L, Kirsche M et al. Automated assembly scaffolding using RagTag elevates a new tomato system for high-throughput genome editing. Genome Biol 2022;23:258. - PMC - PubMed
    1. Belser C, Baurens F-C, Noel B et al. Telomere-to-telomere gapless chromosomes of banana using nanopore sequencing. Commun Biol 2021;4:1047. - PMC - PubMed
    1. Bliznina A, Masunaga A, Mansfield MJ et al. Telomere-to-telomere assembly of the genome of an individual oikopleura dioica from Okinawa using nanopore-based sequencing. BMC Genomics 2021;22:222. - PMC - PubMed
    1. Carroll RA, Rice ES, Murphy WJ et al. A chromosome-scale fishing cat reference genome for the evaluation of potential germline risk variants. Sci Rep 2024;14:8073. - PMC - PubMed