Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 May;2(5):e52.
doi: 10.1371/journal.pcbi.0020052. Epub 2006 May 19.

Iterative reconstruction of transcriptional regulatory networks: an algorithmic approach

Affiliations

Iterative reconstruction of transcriptional regulatory networks: an algorithmic approach

Christian L Barrett et al. PLoS Comput Biol. 2006 May.

Abstract

The number of complete, publicly available genome sequences is now greater than 200, and this number is expected to rapidly grow in the near future as metagenomic and environmental sequencing efforts escalate and the cost of sequencing drops. In order to make use of this data for understanding particular organisms and for discerning general principles about how organisms function, it will be necessary to reconstruct their various biochemical reaction networks. Principal among these will be transcriptional regulatory networks. Given the physical and logical complexity of these networks, the various sources of (often noisy) data that can be utilized for their elucidation, the monetary costs involved, and the huge number of potential experiments approximately 10(12)) that can be performed, experiment design algorithms will be necessary for synthesizing the various computational and experimental data to maximize the efficiency of regulatory network reconstruction. This paper presents an algorithm for experimental design to systematically and efficiently reconstruct transcriptional regulatory networks. It is meant to be applied iteratively in conjunction with an experimental laboratory component. The algorithm is presented here in the context of reconstructing transcriptional regulation for metabolism in Escherichia coli, and, through a retrospective analysis with previously performed experiments, we show that the produced experiment designs conform to how a human would design experiments. The algorithm is able to utilize probability estimates based on a wide range of computational and experimental sources to suggest experiments with the highest potential of discovering the greatest amount of new regulatory knowledge.

PubMed Disclaimer

Conflict of interest statement

Competing interests. The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. An Overview of the Combined Computational and Experimental Iterative Procedure
The computational algorithm utilizes a dynamic simulation of the current integrated transcriptional regulatory and metabolic network reconstruction to design experiments. The new regulatory logic rules discovered by the experiments are then added to the reconstruction. TFi, transcription factor i; TFj, transcription factor j.
Figure 2
Figure 2. A Graphical Depiction of the Computational Algorithm
The algorithm ultimately produces experiment designs ranked by their potential for producing the most new regulatory rules. TFi, transcription factor i ; TFj, transcription factor j.
Figure 3
Figure 3. A Graphical Depiction of How an Activity Profile Is Created for a Growth Simulation
The example model M in (A) contains genes for three transcription factors and two enzymes and shows the 0/1 (“off/on”) state of each gene in each time step tn of a simulation for a defined growth environment E. The dashed line in the model indicates that a regulatory connection between the two genes is not explicitly modelled, but is suspected to exist. (B) shows how a “basic unit” is defined and shows the general formula for the parameterization of each cell. One basic unit exists for every known or suspected TF–target gene pair. As shown in (C), for such regulatory connections explicitly modelled, each cell of the basic unit gets either a “0” or a “1,” depending on whether its associated transcription factor and target gene were observed to be in the indicated 0/1 combination in any simulation time step. (D) illustrates how the inferred TF–target gene regulatory connections and logic derived from experimental and/or computational data are incorporated in a basic unit. In (E) the basic units from (C) and (D) are concatenated to form the activity profile for the simulation.

References

    1. Peterson JD, Umayam LA, Dickinson T, Hickey EK, White O. The Comprehensive Microbial Resource. Nucleic Acids Res. 2001;29:123–125. - PMC - PubMed
    1. Bernal A, Ear U, Kyrpides N. Genomes OnLine Database (GOLD): A monitor of genome projects world-wide. Nucleic Acids Res. 2001;29:126–127. - PMC - PubMed
    1. Handelsman J. Metagenomics or megagenomics. Nat Rev Microbiol. 2005;3:457–458.
    1. Venter JC, Remington K, Heidelberg JF, Halpern AL, Rusch D, et al. Environmental genome shotgun sequencing of the Sargasso Sea. Science. 2004;304:66–74. - PubMed
    1. Price ND, Reed JL, Palsson BO. Genome-scale models of microbial cells: evaluating the consequences of constraints. Nat Rev Microbiol. 2004;2:886–897. - PubMed

Publication types