High-throughput SELEX SAGE method for quantitative modeling of transcription-factor binding sites
- PMID: 12101405
- DOI: 10.1038/nbt718
High-throughput SELEX SAGE method for quantitative modeling of transcription-factor binding sites
Abstract
The ability to determine the location and relative strength of all transcription-factor binding sites in a genome is important both for a comprehensive understanding of gene regulation and for effective promoter engineering in biotechnological applications. Here we present a bioinformatically driven experimental method to accurately define the DNA-binding sequence specificity of transcription factors. A generalized profile was used as a predictive quantitative model for binding sites, and its parameters were estimated from in vitro-selected ligands using standard hidden Markov model training algorithms. Computer simulations showed that several thousand low- to medium-affinity sequences are required to generate a profile of desired accuracy. To produce data on this scale, we applied high-throughput genomics methods to the biochemical problem addressed here. A method combining systematic evolution of ligands by exponential enrichment (SELEX) and serial analysis of gene expression (SAGE) protocols was coupled to an automated quality-controlled sequence extraction procedure based on Phred quality scores. This allowed the sequencing of a database of more than 10,000 potential DNA ligands for the CTF/NFI transcription factor. The resulting binding-site model defines the sequence specificity of this protein with a high degree of accuracy not achieved earlier and thereby makes it possible to identify previously unknown regulatory sequences in genomic DNA. A covariance analysis of the selected sites revealed non-independent base preferences at different nucleotide positions, providing insight into the binding mechanism.
Similar articles
-
Experimental analysis and computer prediction of CTF/NFI transcription factor DNA binding sites.J Mol Biol. 2000 Apr 7;297(4):833-48. doi: 10.1006/jmbi.2000.3614. J Mol Biol. 2000. PMID: 10736221
-
Better estimation of protein-DNA interaction parameters improve prediction of functional sites.BMC Biotechnol. 2008 Dec 23;8:94. doi: 10.1186/1472-6750-8-94. BMC Biotechnol. 2008. PMID: 19105805 Free PMC article.
-
Quantitative modeling and data analysis of SELEX experiments.Phys Biol. 2005 Dec 16;3(1):13-28. doi: 10.1088/1478-3975/3/1/002. Phys Biol. 2005. PMID: 16582458
-
In vitro DNA-binding profile of transcription factors: methods and new insights.J Endocrinol. 2011 Jul;210(1):15-27. doi: 10.1530/JOE-11-0010. Epub 2011 Mar 9. J Endocrinol. 2011. PMID: 21389103 Review.
-
SELEX experiments: new prospects, applications and data analysis in inferring regulatory pathways.Biomol Eng. 2007 Jun;24(2):179-89. doi: 10.1016/j.bioeng.2007.03.001. Epub 2007 Mar 12. Biomol Eng. 2007. PMID: 17428731 Review.
Cited by
-
Inferring direct DNA binding from ChIP-seq.Nucleic Acids Res. 2012 Sep 1;40(17):e128. doi: 10.1093/nar/gks433. Epub 2012 May 18. Nucleic Acids Res. 2012. PMID: 22610855 Free PMC article.
-
Guiding the design of synthetic DNA-binding molecules with massively parallel sequencing.J Am Chem Soc. 2012 Oct 24;134(42):17814-22. doi: 10.1021/ja308888c. Epub 2012 Oct 10. J Am Chem Soc. 2012. PMID: 23013524 Free PMC article.
-
UniPROBE: an online database of protein binding microarray data on protein-DNA interactions.Nucleic Acids Res. 2009 Jan;37(Database issue):D77-82. doi: 10.1093/nar/gkn660. Epub 2008 Oct 8. Nucleic Acids Res. 2009. PMID: 18842628 Free PMC article.
-
Systematic Evolution of Ligands by Exponential Enrichment Technologies and Aptamer-Based Applications: Recent Progress and Challenges in Precision Medicine of Infectious Diseases.Front Bioeng Biotechnol. 2021 Aug 10;9:704077. doi: 10.3389/fbioe.2021.704077. eCollection 2021. Front Bioeng Biotechnol. 2021. PMID: 34447741 Free PMC article. Review.
-
Scoring Targets of Transcription in Bacteria Rather than Focusing on Individual Binding Sites.Front Microbiol. 2017 Nov 22;8:2314. doi: 10.3389/fmicb.2017.02314. eCollection 2017. Front Microbiol. 2017. PMID: 29213263 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources