Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Jan;35(Database issue):D663-7.
doi: 10.1093/nar/gkl1017. Epub 2006 Dec 13.

The ENCODE Project at UC Santa Cruz

Affiliations

The ENCODE Project at UC Santa Cruz

Daryl J Thomas et al. Nucleic Acids Res. 2007 Jan.

Abstract

The goal of the Encyclopedia Of DNA Elements (ENCODE) Project is to identify all functional elements in the human genome. The pilot phase is for comparison of existing methods and for the development of new methods to rigorously analyze a defined 1% of the human genome sequence. Experimental datasets are focused on the origin of replication, DNase I hypersensitivity, chromatin immunoprecipitation, promoter function, gene structure, pseudogenes, non-protein-coding RNAs, transcribed RNAs, multiple sequence alignment and evolutionarily constrained elements. The ENCODE project at UCSC website (http://genome.ucsc.edu/ENCODE) is the primary portal for the sequence-based data produced as part of the ENCODE project. In the pilot phase of the project, over 30 labs provided experimental results for a total of 56 browser tracks supported by 385 database tables. The site provides researchers with a number of tools that allow them to visualize and analyze the data as well as download data for local analyses. This paper describes the portal to the data, highlights the data that has been made available, and presents the tools that have been developed within the ENCODE project. Access to the data and types of interactive analysis that are possible are illustrated through supplemental examples.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Composite track control and display. (A) Controls for options that apply to all data in this track (top) with checkboxes to include or exclude individual sub-tracks as desired (bottom). (B) Example of a composite track display showing the IRF1 gene, repeats, Yale transcript maps and Yale transcriptionally active regions (6,7). The latter two are composite tracks, each containing multiple datasets. The Placenta RNA checkbox is deselected above, so that the data are not displayed in the image below.
Figure 2
Figure 2
Conservation display. (A) Conservation track at the base level shows details of a multiple sequence alignment, conservation scores and amino acid translations in coding regions. (‘.’: base is identical to human; ‘N’: missing sequence, ‘=’: sequence that does not align to reference is present in this species; orange numbers/lines: additional bases that are present in other species). (B) Conservation track zoomed out shows pairwise identity summary and conservation scores, highlighting non-coding elements in addition to exons.
Figure 3
Figure 3
Track correlation in the Table Browser. Correlation of the Boston University. OH Radical Cleavage Intensity Database (ORChID) (–17) is shown with the CpG Island (left) and with the GC Percent (right) tracks. Statistical summaries (upper panels), scatter and residual plots (middle panels) and histograms (lower panels) are shown.

References

    1. The ENCODE Project Consortium. The ENCODE (ENCyclopedia Of DNA Elements) project. Science. 2004;322:636–640. - PubMed
    1. Hinrichs A.S., Karolchik D., Baertsch R., Barber G.P., Bejerano G., Clawson H., Diekhans M., Furey T.S., Harte R.A., Hsu F., et al. The UCSC Genome Browser Database: update 2006. Nucleic Acids Res. 2006;34:D590–D598. - PMC - PubMed
    1. Kuhn B., the UCSC Genome Bioinformatics Group The UCSC Genome Browser Database: update 2007. Nucleic Acids Res. 2007 in press. - PMC - PubMed
    1. Karolchik D., Hinrichs A.S., Furey T.S., Roskin K.M., Sugnet C.W., Haussler D., Kent W.J. The UCSC Table Browser data retrieval tool. Nucleic Acids Res. 2004;32:D493–D496. - PMC - PubMed
    1. Kent W.J., Sugnet C.W., Furey T.S., Roskin K.M., Pringle T.H., Zahler A.M., Haussler D. The human genome browser at UCSC. Genome Res. 2002;12:996–1006. - PMC - PubMed

Publication types