Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jan 15;32(2):289-91.
doi: 10.1093/bioinformatics/btv562. Epub 2015 Sep 30.

regioneR: an R/Bioconductor package for the association analysis of genomic regions based on permutation tests

Affiliations

regioneR: an R/Bioconductor package for the association analysis of genomic regions based on permutation tests

Bernat Gel et al. Bioinformatics. .

Abstract

Motivation: Statistically assessing the relation between a set of genomic regions and other genomic features is a common challenging task in genomic and epigenomic analyses. Randomization based approaches implicitly take into account the complexity of the genome without the need of assuming an underlying statistical model.

Summary: regioneR is an R package that implements a permutation test framework specifically designed to work with genomic regions. In addition to the predefined randomization and evaluation strategies, regioneR is fully customizable allowing the use of custom strategies to adapt it to specific questions. Finally, it also implements a novel function to evaluate the local specificity of the detected association.

Availability and implementation: regioneR is an R package released under Artistic-2.0 License. The source code and documents are freely available through Bioconductor (http://www.bioconductor.org/packages/regioneR).

Contact: rmalinverni@carrerasresearch.org.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
(A) Plot of the results of a permutation test assessing the association between a subset of 1000 HepG2 CTCF narrow peaks (ENCODE/Broad Institute) and CpG islands (Wu et al., 2010), using a per chromosome randomization of CTCF peaks, the number of overlaps as the evaluation function and 5000 permutations. The association is highly significant with the observed value far from the limit of significance of the random distribution. (B) Plot of the local z-score of the permutation test in A. The association is strongly related to the exact position of the CTCF peaks since the z-score drops sharply as soon as the regions are shifted a few hundreds of bases

References

    1. De S., et al. (2013) The dilemma of choosing the ideal permutation strategy while estimating statistical significance of genome-wide enrichment. Brief. Bioinform. - PMC - PubMed
    1. Favorov A., et al. (2012) Exploring massive, genome scale datasets with the genometricorr package. PLoS Comput. Biol., 8, e1002529. - PMC - PubMed
    1. Heger A., et al. (2013) GAT: a simulation framework for testing the association of genomic intervals. Bioinformatics, 29, 2046–2048. - PMC - PubMed
    1. Lawrence M., et al. (2009) rtracklayer: An R package for interfacing with genome browsers. Bioinformatics, 25, 1841–1842. - PMC - PubMed
    1. Lawrence M., et al. (2013) Software for computing and annotating genomic ranges. PLoS Comput. Biol., 9, 1–10. - PMC - PubMed

Publication types