Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Oct;7(10):2048-60.
doi: 10.1074/mcp.M700550-MCP200. Epub 2008 May 18.

Targeting the human cancer pathway protein interaction network by structural genomics

Affiliations

Targeting the human cancer pathway protein interaction network by structural genomics

Yuanpeng Janet Huang et al. Mol Cell Proteomics. 2008 Oct.

Abstract

Structural genomics provides an important approach for characterizing and understanding systems biology. As a step toward better integrating protein three-dimensional (3D) structural information in cancer systems biology, we have constructed a Human Cancer Pathway Protein Interaction Network (HCPIN) by analysis of several classical cancer-associated signaling pathways and their physical protein-protein interactions. Many well known cancer-associated proteins play central roles as "hubs" or "bottlenecks" in the HCPIN. At least half of HCPIN proteins are either directly associated with or interact with multiple signaling pathways. Although some 45% of residues in these proteins are in sequence segments that meet criteria sufficient for approximate homology modeling (Basic Local Alignment Search Tool (BLAST) E-value <10(-6)), only approximately 20% of residues in these proteins are structurally covered using high accuracy homology modeling criteria (i.e. BLAST E-value <10(-6) and at least 80% sequence identity) or by actual experimental structures. The HCPIN Website provides a comprehensive description of this biomedically important multipathway network together with experimental and homology models of HCPIN proteins useful for cancer biology research. To complement and enrich cancer systems biology, the Northeast Structural Genomics Consortium is targeting >1000 human proteins and protein domains from the HCPIN for sample production and 3D structure determination. The long range goal of this effort is to provide a comprehensive 3D structure-function database for human cancer-associated proteins and protein complexes in the context of their interaction networks. The network-based target selection (BioNet) approach described here is an example of a general strategy for targeting co-functioning proteins by structural genomics projects.

PubMed Disclaimer

Figures

F<sc>ig</sc>. 1.
Fig. 1.
The HCPIN is a Web-accessible database. It is designed for use by cancer biologists interested in assessing 3D protein structural information in the context of the protein interaction network. A, HCPIN home page. B, a snapshot of Networks view, visualizing protein-protein interactions with structure annotations. The outside ring represents the percentage of structural coverage. Green ring, experimental model is available with >99% sequence identities; yellow ring, homology model is available with >80% sequence identities. The Website provides tools for interactive analysis of the HCPIN. C, a snapshot of Proteins view, listing sequence information and PDB BLAST hits, summarizing all structural information available for the human HCPIN protein and its homologues and providing links to the corresponding PDB entries and other structure-function annotation information. D, a snapshot of Icon gallery, a collection of ribbon diagrams for each of the known structures and the structural models in the HCPIN.
F<sc>ig</sc>. 2.
Fig. 2.
Scatter plot of degree and betweenness measures for HCPIN proteins. Black, HCPIN proteins; red, proteins also listed in the Cancer Gene Census Database (1). EGFR, EGF receptor; CREBBP, CREB-binding protein; DMPK, dystrophia myotonica protein kinase; LCK, Lymphocyte cell-specific protein-tyrosine kinase; MPL, myeloproliferative leukemia protein; HSPCA, heat shock protein HSP 90-alpha.
F<sc>ig</sc>. 3.
Fig. 3.
Cross-talk between pathways. A, frequency of observing one protein in one or more of the seven KEGG signaling pathways. ∼20% of HCPIN pathway proteins are associated with two or more pathways. B, frequency of observing one HCPIN protein in one or more of seven pathway interaction subnets. >50% of HCPIN proteins are associated with two or more interaction subnets. C, frequency of observing one pathway protein in one or more pathway interaction subnets. The frequencies (17) are also labeled on the side of these pie charts.
F<sc>ig</sc>. 4.
Fig. 4.
Ribbon diagram of some HCPIN proteins/domains solved by NESG. At the bottom of each ribbon diagram, we have listed Swiss-Prot (SW) name, NESG target ID, PDB ID, residue coverage, and method used for structure determination.
F<sc>ig</sc>. 5.
Fig. 5.
A, percent residue coverage distributions for HCPIN proteins. Intracellular, proteins inside the cell; s/m, proteins predicted to be secreted or having at least a segment that is integral or transmembrane. B, box plots of size distributions of HCPIN proteins and HCPIN proteins with single domain coverage. Intracellular, proteins inside the cell; s/m, as defined above; intracellular-SD, intracellular proteins with single domain coverage, s/m-SD, proteins predicted to be secreted or having at least a segment that is transmembrane with single domain coverage. C, box plots of size distributions of HCPIN proteins with residue coverage. Intracellular-residue and s/m-residue, residue coverage of intracellular proteins and predicted secreted/membrane-associated proteins, respectively. Single domain and residue coverages are shown at high accuracy level. A similar distribution is observed at medium accuracy level. D, box plots of size distributions of full-length and targeted subregions of proteins selected by the NESG structural genomics project.
F<sc>ig</sc>. 6.
Fig. 6.
HCPIN target selection process. SEG regions, low complexity regions predicted by the program SEG (37). SignalP region, signaling peptide predicted by SignalP (32). TM region, transmembrane region predicted by TMHMM (33). C/U-region, structure covered (C) or uncovered (U) region. T-region, targeted region. Disordered regions are predicted based on mean hydrophobicity and net charge (38). E, E-value; HTP, high throughput.

References

    1. Futreal, P. A., Coin, L., Marshall, M., Down, T., Hubbard, T., Wooster, R., Rahman, N., and Stratton, M. R. ( 2004) A census of human cancer genes. Nat. Rev. Cancer 4, 177–183 - PMC - PubMed
    1. Vogelstein, B., and Kinzler, K. W. ( 2004) Cancer genes and the pathways they control. Nat. Med. 10, 789–799 - PubMed
    1. Greenman, C., Stephens, P., Smith, R., Dalgliesh, G. L., Hunter, C., Bignell, G., Davies, H., Teague, J., Butler, A., Stevens, C., Edkins, S., O'Meara, S., Vastrik, I., Schmidt, E. E., Avis, T., Barthorpe, S., Bhamra, G., Buck, G., Choudhury, B., Clements, J., Cole, J., Dicks, E., Forbes, S., Gray, K., Halliday, K., Harrison, R., Hills, K., Hinton, J., Jenkinson, A., Jones, D., Menzies, A., Mironenko, T., Perry, J., Raine, K., Richardson, D., Shepherd, R., Small, A., Tofts, C., Varian, J., Webb, T., West, S., Widaa, S., Yates, A., Cahill, D. P., Louis, D. N., Goldstraw, P., Nicholson, A. G., Brasseur, F., Looijenga, L., Weber, B. L., Chiew, Y. E., DeFazio, A., Greaves, M. F., Green, A. R., Campbell, P., Birney, E., Easton, D. F., Chenevix-Trench, G., Tan, M. H., Khoo, S. K., Teh, B. T., Yuen, S. T., Leung, S. Y., Wooster, R., Futreal, P. A., and Stratton, M. R. ( 2007) Patterns of somatic mutation in human cancer genomes. Nature 446, 153–158 - PMC - PubMed
    1. Lander, E. S., Linton, L. M., Birren, B., Nusbaum, C., Zody, M. C., Baldwin, J., Devon, K., Dewar, K., Doyle, M., FitzHugh, W., Funke, R., Gage, D., Harris, K., Heaford, A., Howland, J., et al. ( 2001) Initial sequencing and analysis of the human genome. Nature 409, 860–921 - PubMed
    1. Venter, J. C., Adams, M. D., Myers, E. W., Li, P. W., Mural, R. J., Sutton, G. G., Smith, H. O., Yandell, M., Evans, C. A., Holt, R. A., Gocayne, J. D., Amanatides, P., Ballew, R. M., Huson, D. H., Wortman, J. R., et al. ( 2001) The sequence of the human genome. Science 291, 1304–1351 - PubMed

Publication types

Substances