Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 28;38(7):1788-1793.
doi: 10.1093/bioinformatics/btac005.

Telogator: a method for reporting chromosome-specific telomere lengths from long reads

Affiliations

Telogator: a method for reporting chromosome-specific telomere lengths from long reads

Zachary Stephens et al. Bioinformatics. .

Abstract

Motivation: Telomeres are the repetitive sequences found at the ends of eukaryotic chromosomes and are often thought of as a 'biological clock,' with their average length shortening during division in most cells. In addition to their association with senescence, abnormal telomere lengths are well known to be associated with multiple cancers, short telomere syndromes and as risk factors for a broad range of diseases. While a majority of methods for measuring telomere length will report average lengths across all chromosomes, it is known that aberrations in specific chromosome arms are biomarkers for certain diseases. Due to their repetitive nature, characterizing telomeres at this resolution is prohibitive for short read sequencing approaches, and is challenging still even with longer reads.

Results: We present Telogator: a method for reporting chromosome-specific telomere length from long read sequencing data. We demonstrate Telogator's sensitivity in detecting chromosome-specific telomere length in simulated data across a range of read lengths and error rates. Telogator is then applied to 10 germline samples, yielding a high correlation with short read methods in reporting average telomere length. In addition, we investigate common subtelomere rearrangements and identify the minimum read length required to anchor telomere/subtelomere boundaries in samples with these haplotypes.

Availability and implementation: Telogator is written in Python3 and is available at github.com/zstephens/telogator.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Overview of Telogator. ‘Tel’ and ‘subtel’ are used as abbreviations for telomere and subtelomere, respectively. ‘T2T + alt’ indicates the telomere-to-telomere reference genome augmented with alternate subtelomere assemblies
Fig. 2.
Fig. 2.
An example read from HG002 showing the kmer density and telomere scoring functions for a telomere in chr2p. The red highlighted section indicates read coordinates identified as telomere sequence
Fig. 3.
Fig. 3.
An example violin plot produced by Telogator on simulated data (read length 20k, error rate 10%). Black lines indicate the ground truth telomere length
Fig. 4.
Fig. 4.
Telomere length estimation error for different sequencing error rates and read lengths
Fig. 5.
Fig. 5.
Percentage of telomeres identified by Telogator at different average coverage rates
Fig. 6.
Fig. 6.
Chromosome-specific telomere length for HG002 (NA24385). The lines indicate the mean TL at each chromosome arm
Fig. 7.
Fig. 7.
Comparison of average TLs reported by Telogator versus TelomereHunter
Fig. 8.
Fig. 8.
Correlation of TL as reported from PacBio reads versus Nanopore
Fig. 9.
Fig. 9.
Very long Nanopore reads were found to span a 100 kb section of subtelomere 20q between telomere sequence and subtelomere 6p in HG002

References

    1. Armanios M. (2009) Syndromes of telomere shortening. Annu. Rev. Genomics Hum. Genet., 10, 45–61. - PMC - PubMed
    1. Barbaro P. et al. (2016) The wide-ranging clinical implications of the short telomere syndromes. Int. Med. J., 46, 393–403. - PubMed
    1. Barkovskaya M.S. et al. (2019) Telomere length distribution on individual chromosome arms in patients with bronchial asthma. Bull. Siberian Med., 18, 164–174.
    1. Blinova E. et al. (2016) Telomere length of individual chromosomes in patients with rheumatoid arthritis. Bull. Exp. Biol. Med., 160, 779–782. - PubMed
    1. Bojesen S.E. et al. ; Genetic Modifiers of Cancer Risk in BRCA1/2 Mutation Carriers (GEMO). (2013) Multiple independent variants at the tert locus are associated with telomere length and risks of breast and ovarian cancer. Nat. Genet., 45, 371–384. - PMC - PubMed

Publication types