ParaSAM: a parallelized version of the significance analysis of microarrays algorithm
- PMID: 20400455
- PMCID: PMC2872005
- DOI: 10.1093/bioinformatics/btq161
ParaSAM: a parallelized version of the significance analysis of microarrays algorithm
Abstract
Motivation: Significance analysis of microarrays (SAM) is a widely used permutation-based approach to identifying differentially expressed genes in microarray datasets. While SAM is freely available as an Excel plug-in and as an R-package, analyses are often limited for large datasets due to very high memory requirements.
Summary: We have developed a parallelized version of the SAM algorithm called ParaSAM to overcome the memory limitations. This high performance multithreaded application provides the scientific community with an easy and manageable client-server Windows application with graphical user interface and does not require programming experience to run. The parallel nature of the application comes from the use of web services to perform the permutations. Our results indicate that ParaSAM is not only faster than the serial version, but also can analyze extremely large datasets that cannot be performed using existing implementations.
Availability: A web version open to the public is available at http://bioanalysis.genomics.mcg.edu/parasam. For local installations, both the windows and web implementations of ParaSAM are available for free at http://www.amdcc.org/bioinformatics/software/parasam.aspx.
Figures
Similar articles
-
ParaKMeans: Implementation of a parallelized K-means algorithm suitable for general laboratory use.BMC Bioinformatics. 2008 Apr 16;9:200. doi: 10.1186/1471-2105-9-200. BMC Bioinformatics. 2008. PMID: 18416829 Free PMC article.
-
A modified hyperplane clustering algorithm allows for efficient and accurate clustering of extremely large datasets.Bioinformatics. 2009 May 1;25(9):1152-7. doi: 10.1093/bioinformatics/btp123. Epub 2009 Mar 4. Bioinformatics. 2009. PMID: 19261720 Free PMC article.
-
Interactively optimizing signal-to-noise ratios in expression profiling: project-specific algorithm selection and detection p-value weighting in Affymetrix microarrays.Bioinformatics. 2004 Nov 1;20(16):2534-44. doi: 10.1093/bioinformatics/bth280. Epub 2004 Apr 29. Bioinformatics. 2004. PMID: 15117752
-
MADGE: scalable distributed data management software for cDNA microarrays.Bioinformatics. 2003 Jan;19(1):87-9. doi: 10.1093/bioinformatics/19.1.87. Bioinformatics. 2003. PMID: 12499297
-
Open source software for the analysis of microarray data.Biotechniques. 2003 Mar;Suppl:45-51. Biotechniques. 2003. PMID: 12664684 Review.
Cited by
-
Combining small-volume metabolomic and transcriptomic approaches for assessing brain chemistry.Anal Chem. 2013 Mar 19;85(6):3136-43. doi: 10.1021/ac3032959. Epub 2013 Mar 1. Anal Chem. 2013. PMID: 23409944 Free PMC article.
References
-
- Benjamini Y, Hochberg Y. Controlling the false discovery rate - a practical and powerful approach to multiple testing. J. Royal Stat. Soc. Series B Methodol. 1995;57:289–300.
-
- Kaizer EC, et al. Gene expression in peripheral blood mononuclear cells from children with diabetes. J. Clin. Endocrinol. Metab. 2007;92:3705–3711. - PubMed