Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Feb 10:3:93-114.

Data mining for gene networks relevant to poor prognosis in lung cancer via backward-chaining rule induction

Affiliations

Data mining for gene networks relevant to poor prognosis in lung cancer via backward-chaining rule induction

Mary E Edgerton et al. Cancer Inform. .

Abstract

We use Backward Chaining Rule Induction (BCRI), a novel data mining method for hypothesizing causative mechanisms, to mine lung cancer gene expression array data for mechanisms that could impact survival. Initially, a supervised learning system is used to generate a prediction model in the form of "IF <conditions> THEN <outcome>" style rules. Next, each antecedent (i.e. an IF condition) of a previously discovered rule becomes the outcome class for subsequent application of supervised rule induction. This step is repeated until a termination condition is satisfied. "Chains" of rules are created by working backward from an initial condition (e.g. survival status). Through this iterative process of "backward chaining," BCRI searches for rules that describe plausible gene interactions for subsequent validation. Thus, BCRI is a semi-supervised approach that constrains the search through the vast space of plausible causal mechanisms by using a top-level outcome to kick-start the process. We demonstrate the general BCRI task sequence, how to implement it, the validation process, and how BCRI-rules discovered from lung cancer microarray data can be combined with prior knowledge to generate hypotheses about functional genomics.

Keywords: C4.5; class discovery; data analysis; decision trees; microarray; molecular mechanisms; non-small cell lung cancer; semi-supervised methods; systems biology.

PubMed Disclaimer

Figures

Figure 6.
Figure 6.
A decision tree.
Figure 7.
Figure 7.
Joint distribution of attribute A and Class
Figure 8.
Figure 8.
Possible threshold values for a continuous attribute, C
Figure 1.
Figure 1.
candidate split points.
Figure 2.
Figure 2.
C45W-BCRI rules expressed as an AND/OR graph. *The rule to predict MRPL19 < 161.4 in the Low Risk trace will also predict MRPL19 < 161.4 in the High Risk trace. It is not shown again in the High Risk trace.
Figure 3.
Figure 3.
Pathway Assist™ diagram showing SERPINA1 and ELA2 relationships of Example 1. The protein products are indicated by the large ovals, a binding interaction is indicated by the purple dot relationship between the ovals, and gene expression regulation is indicated by a square along a dotted line.
Figure 4.
Figure 4.
Pathway Assist™ diagram illustrating FXN and EIF2S1 relationships of Example 2
Figure 5.
Figure 5.
Pathway Assist™ diagram of KRT13 and DDX5 relationships of Example 3

Similar articles

Cited by

References

    1. Anzick SL, Trent JM. Role of genomics in identifying new targets for cancer therapy. Oncology., (Huntingt) 2002;16(5 Suppl 4):7–13. - PubMed
    1. Bai C, Connolly B, Metzker ML, et al. Overexpression of M68/DcR3 in human gastrointestinal tract tumors independent of gene amplification and its location in a four-gene cluster. Proc. Natl. Acad. Sci. U.S.A. 2000;97(3):1230–5. - PMC - PubMed
    1. Beer D, Kardia S, Huang C, et al. Gene-expression profiles predict survival of patients with lung adenocarcinoma. Nature Medicine. 2002;8:816–824. - PubMed
    1. Bhattacharjee A, Richards WG, Staunton J, et al. Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses. Proc. Natl. Acad. Sci. U.S.A. 2001;98:13790–13795. - PMC - PubMed
    1. Bjorkhem-Bergman L, Jonsson-Videsater K, Paul C, et al. Mammalian thioredoxin reductase alters cytolytic activity of an antibacterial peptide. Peptides. 2004;25(11):1849–55. - PubMed

LinkOut - more resources