. 2011 Mar 8;6(3):e17844.

doi: 10.1371/journal.pone.0017844.

TargetMine, an integrated data warehouse for candidate gene prioritisation and target discovery

Yi-An Chen¹, Lokesh P Tripathi, Kenji Mizuguchi

Affiliations

PMID: 21408081
PMCID: PMC3050930
DOI: 10.1371/journal.pone.0017844

TargetMine, an integrated data warehouse for candidate gene prioritisation and target discovery

Yi-An Chen et al. PLoS One. 2011.

. 2011 Mar 8;6(3):e17844.

doi: 10.1371/journal.pone.0017844.

Authors

Yi-An Chen¹, Lokesh P Tripathi, Kenji Mizuguchi

Affiliation

¹ National Institute of Biomedical Innovation, Saito-Asagi, Ibaraki, Osaka, Japan.

PMID: 21408081
PMCID: PMC3050930
DOI: 10.1371/journal.pone.0017844

Abstract

Prioritising candidate genes for further experimental characterisation is a non-trivial challenge in drug discovery and biomedical research in general. An integrated approach that combines results from multiple data types is best suited for optimal target selection. We developed TargetMine, a data warehouse for efficient target prioritisation. TargetMine utilises the InterMine framework, with new data models such as protein-DNA interactions integrated in a novel way. It enables complicated searches that are difficult to perform with existing tools and it also offers integration of custom annotations and in-house experimental data. We proposed an objective protocol for target prioritisation using TargetMine and set up a benchmarking procedure to evaluate its performance. The results show that the protocol can identify known disease-associated genes with high precision and coverage. A demonstration version of TargetMine is available at http://targetmine.nibio.go.jp/.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

**Figure 1. Schema for selected examples of newly created data sources in TargetMine.**
The data model is depicted as a class diagram in the Unified Modeling Language (http://www.uml.org). Some details of the model are ignored to reduce the complexity of the diagram.

**Figure 2. A schematic representation of the suggested objective protocol for candidate gene prioritisation with TargetMine.**

**Figure 3. Outline of the procedure for benchmarking candidate gene prioritisation on 19 sets of known disease-associated genes with TargetMine.**
TP- True positive, FP- False positive (see text for details).

**Figure 4. Benchmarking results for 19 sets of known disease-associated genes.**
(The full disease names and their abbreviations are listed in Table 2.) Each line represents the F-score for a particular disease data set as a function of the threshold (the top N significant associations considered). The error bars show the standard deviation across ten benchmarking evaluations for each disease.

See this image and copyright information in PMC

References

1. Ge H, Walhout AJ, Vidal M. Integrating ‘omic’ information: a bridge between genomics and systems biology. Trends Genet. 2003;19:551–560. - PubMed
1. Gerstein M, Lan N, Jansen R. Proteomics. Integrating interactomes. Science. 2002;295:284–287. - PubMed
1. Burgun A, Bodenreider O. Accessing and integrating data and knowledge for biomedical research. Yearb Med Inform. 2008:91–101. - PMC - PubMed
1. Chen LS, Emmert-Streib F, Storey JD. Harnessing naturally randomized transcription to infer regulatory relationships among genes. Genome Biol. 2007;8:R219. - PMC - PubMed
1. Garcia Castro A, Chen YP, Ragan MA. Information integration in molecular bioscience. Appl Bioinformatics. 2005;4:157–173. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

TargetMine, an integrated data warehouse for candidate gene prioritisation and target discovery

Affiliation

TargetMine, an integrated data warehouse for candidate gene prioritisation and target discovery

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources