BEDTools: a flexible suite of utilities for comparing genomic features
- PMID: 20110278
- PMCID: PMC2832824
- DOI: 10.1093/bioinformatics/btq033
BEDTools: a flexible suite of utilities for comparing genomic features
Abstract
Motivation: Testing for correlations between different sets of genomic features is a fundamental task in genomics research. However, searching for overlaps between features with existing web-based methods is complicated by the massive datasets that are routinely produced with current sequencing technologies. Fast and flexible tools are therefore required to ask complex questions of these data in an efficient manner.
Results: This article introduces a new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format. BEDTools also supports the comparison of sequence alignments in BAM format to both BED and GFF features. The tools are extremely efficient and allow the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks. BEDTools can be combined with one another as well as with standard UNIX commands, thus facilitating routine genomics tasks as well as pipelines that can quickly answer intricate questions of large genomic datasets.
Availability and implementation: BEDTools was written in C++. Source code and a comprehensive user manual are freely available at http://code.google.com/p/bedtools
Contact: aaronquinlan@gmail.com; imh4y@virginia.edu
Supplementary information: Supplementary data are available at Bioinformatics online.
Similar articles
-
Pgltools: a genomic arithmetic tool suite for manipulation of Hi-C peak and other chromatin interaction data.BMC Bioinformatics. 2017 Apr 7;18(1):207. doi: 10.1186/s12859-017-1621-0. BMC Bioinformatics. 2017. PMID: 28388874 Free PMC article.
-
Pybedtools: a flexible Python library for manipulating genomic datasets and annotations.Bioinformatics. 2011 Dec 15;27(24):3423-4. doi: 10.1093/bioinformatics/btr539. Epub 2011 Sep 23. Bioinformatics. 2011. PMID: 21949271 Free PMC article.
-
Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser.Bioinformatics. 2014 Apr 1;30(7):1003-5. doi: 10.1093/bioinformatics/btt637. Epub 2013 Nov 13. Bioinformatics. 2014. PMID: 24227676 Free PMC article.
-
Phyx: phylogenetic tools for unix.Bioinformatics. 2017 Jun 15;33(12):1886-1888. doi: 10.1093/bioinformatics/btx063. Bioinformatics. 2017. PMID: 28174903 Free PMC article.
-
UCSC genome browser tutorial.Genomics. 2008 Aug;92(2):75-84. doi: 10.1016/j.ygeno.2008.02.003. Epub 2008 Jun 2. Genomics. 2008. PMID: 18514479 Review.
Cited by
-
The genome sequence of an Entiminae weevil, Polydrusus pterygomalis Boheman, 1840.Wellcome Open Res. 2024 Sep 18;9:528. doi: 10.12688/wellcomeopenres.23048.1. eCollection 2024. Wellcome Open Res. 2024. PMID: 39439931 Free PMC article.
-
Parentage-based tagging combined with genetic stock identification is a cost-effective and viable replacement for coded-wire tagging in large-scale assessments of marine Chinook salmon fisheries in British Columbia, Canada.Evol Appl. 2021 Mar 19;14(5):1365-1389. doi: 10.1111/eva.13203. eCollection 2021 May. Evol Appl. 2021. PMID: 34025773 Free PMC article.
-
Population genomics of invasive rodents on islands: Genetic consequences of colonization and prospects for localized synthetic gene drive.Evol Appl. 2021 Mar 10;14(5):1421-1435. doi: 10.1111/eva.13210. eCollection 2021 May. Evol Appl. 2021. PMID: 34025776 Free PMC article.
-
Discovery of widespread transcription initiation at microsatellites predictable by sequence-based deep neural network.Nat Commun. 2021 Jun 2;12(1):3297. doi: 10.1038/s41467-021-23143-7. Nat Commun. 2021. PMID: 34078885 Free PMC article.
-
Facile accelerated specific therapeutic (FAST) platform develops antisense therapies to counter multidrug-resistant bacteria.Commun Biol. 2021 Mar 12;4(1):331. doi: 10.1038/s42003-021-01856-1. Commun Biol. 2021. PMID: 33712689 Free PMC article.
References
-
- Smit A, et al. RepeatMasker. Open-3.0. 1996–2004 Available at http://www.repeatmasker.org/
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous