Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Mar 8;6(3):e17844.
doi: 10.1371/journal.pone.0017844.

TargetMine, an integrated data warehouse for candidate gene prioritisation and target discovery

Affiliations

TargetMine, an integrated data warehouse for candidate gene prioritisation and target discovery

Yi-An Chen et al. PLoS One. .

Abstract

Prioritising candidate genes for further experimental characterisation is a non-trivial challenge in drug discovery and biomedical research in general. An integrated approach that combines results from multiple data types is best suited for optimal target selection. We developed TargetMine, a data warehouse for efficient target prioritisation. TargetMine utilises the InterMine framework, with new data models such as protein-DNA interactions integrated in a novel way. It enables complicated searches that are difficult to perform with existing tools and it also offers integration of custom annotations and in-house experimental data. We proposed an objective protocol for target prioritisation using TargetMine and set up a benchmarking procedure to evaluate its performance. The results show that the protocol can identify known disease-associated genes with high precision and coverage. A demonstration version of TargetMine is available at http://targetmine.nibio.go.jp/.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Schema for selected examples of newly created data sources in TargetMine.
The data model is depicted as a class diagram in the Unified Modeling Language (http://www.uml.org). Some details of the model are ignored to reduce the complexity of the diagram.
Figure 2
Figure 2. A schematic representation of the suggested objective protocol for candidate gene prioritisation with TargetMine.
Figure 3
Figure 3. Outline of the procedure for benchmarking candidate gene prioritisation on 19 sets of known disease-associated genes with TargetMine.
TP- True positive, FP- False positive (see text for details).
Figure 4
Figure 4. Benchmarking results for 19 sets of known disease-associated genes.
(The full disease names and their abbreviations are listed in Table 2.) Each line represents the F-score for a particular disease data set as a function of the threshold (the top N significant associations considered). The error bars show the standard deviation across ten benchmarking evaluations for each disease.

References

    1. Ge H, Walhout AJ, Vidal M. Integrating ‘omic’ information: a bridge between genomics and systems biology. Trends Genet. 2003;19:551–560. - PubMed
    1. Gerstein M, Lan N, Jansen R. Proteomics. Integrating interactomes. Science. 2002;295:284–287. - PubMed
    1. Burgun A, Bodenreider O. Accessing and integrating data and knowledge for biomedical research. Yearb Med Inform. 2008:91–101. - PMC - PubMed
    1. Chen LS, Emmert-Streib F, Storey JD. Harnessing naturally randomized transcription to infer regulatory relationships among genes. Genome Biol. 2007;8:R219. - PMC - PubMed
    1. Garcia Castro A, Chen YP, Ragan MA. Information integration in molecular bioscience. Appl Bioinformatics. 2005;4:157–173. - PubMed

Publication types

Substances

LinkOut - more resources