Exploring massive, genome scale datasets with the GenometriCorr package
- PMID: 22693437
- PMCID: PMC3364938
- DOI: 10.1371/journal.pcbi.1002529
Exploring massive, genome scale datasets with the GenometriCorr package
Abstract
We have created a statistically grounded tool for determining the correlation of genomewide data with other datasets or known biological features, intended to guide biological exploration of high-dimensional datasets, rather than providing immediate answers. The software enables several biologically motivated approaches to these data and here we describe the rationale and implementation for each approach. Our models and statistics are implemented in an R package that efficiently calculates the spatial correlation between two sets of genomic intervals (data and/or annotated features), for use as a metric of functional interaction. The software handles any type of pointwise or interval data and instead of running analyses with predefined metrics, it computes the significance and direction of several types of spatial association; this is intended to suggest potentially relevant relationships between the datasets.
Availability and implementation: The package, GenometriCorr, can be freely downloaded at http://genometricorr.sourceforge.net/. Installation guidelines and examples are available from the sourceforge repository. The package is pending submission to Bioconductor.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
References
-
- Bird AP. CpG-rich islands and the function of DNA methylation. Nature. 1986;321:209–213. - PubMed
-
- Bickel PJ, Brown JB, Huang H, Li Q. An overview of recent developments in genomics and associated statistical methods. Philos Transact A Math Phys Eng Sci. 2009;367:4313–4337. - PubMed
-
- Bickel PJ, Boley N, Brown JB, Huang H, Zhang NR. Subsampling methods for genomic inference. Ann Appl Stat. 2010;4:1660–1660–1697.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
