Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Jul;37(Web Server issue):W147-52.
doi: 10.1093/nar/gkp384. Epub 2009 May 25.

PosMed (Positional Medline): prioritizing genes with an artificial neural network comprising medical documents to accelerate positional cloning

Affiliations

PosMed (Positional Medline): prioritizing genes with an artificial neural network comprising medical documents to accelerate positional cloning

Yuko Yoshida et al. Nucleic Acids Res. 2009 Jul.

Abstract

PosMed (http://omicspace.riken.jp/) prioritizes candidate genes for positional cloning by employing our original database search engine GRASE, which uses an inferential process similar to an artificial neural network comprising documental neurons (or 'documentrons') that represent each document contained in databases such as MEDLINE and OMIM. Given a user-specified query, PosMed initially performs a full-text search of each documentron in the first-layer artificial neurons and then calculates the statistical significance of the connections between the hit documentrons and the second-layer artificial neurons representing each gene. When a chromosomal interval(s) is specified, PosMed explores the second-layer and third-layer artificial neurons representing genes within the chromosomal interval by evaluating the combined significance of the connections from the hit documentrons to the genes. PosMed is, therefore, a powerful tool that immediately ranks the candidate genes by connecting phenotypic keywords to the genes through connections representing not only gene-gene interactions but also other biological interactions (e.g. metabolite-gene, mutant mouse-gene, drug-gene, disease-gene and protein-protein interactions) and ortholog data. By utilizing orthologous connections, PosMed facilitates the ranking of human genes based on evidence found in other model species such as mouse. Currently, PosMed, an artificial superbrain that has learned a vast amount of biological knowledge ranging from genomes to phenomes (or 'omic space'), supports the prioritization of positional candidate genes in humans, mouse, rat and Arabidopsis thaliana.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Neural network model for the PosMed gene search algorithm. As an example, the user's keyword ‘diabetes’ can be found in several documents, including MEDLINE (Input). These documents are mapped to genes that are supported by manual curation (Concept). Using biological knowledge (e.g. protein–protein interaction, co-expression and co-citation of document sets), PosMed can also suggest genes that do not have the user's keyword ‘diabetes’ in their associated documents (Association). PosMed then returns the candidate genes that are located within the user's specified genomic interval (Output). Thereafter, the user's keyword will be highlighted in the documents (Display).
Figure 2.
Figure 2.
Output display on PosMed Search. Example search result for mouse genes against the keyword ‘diabetes OR insulin’ between 90M and 140M bp on Chr 1 in the NCBIm 36 genome. Users can apply their queries at the top of the output display (A). To select genomic interval visually, PosMed cooperates with the Flash-based genomic browser OmicBrowse (12). The tab labeled ‘All Hits’ (B) shows a list of selectable document sets to be included in the search. As a default parameter, PosMed sets ‘Associate the keyword with entities co-cited within the same sentences’. If the total number of the candidate genes is below 20, PosMed will automatically change this parameter to ‘Associate the keyword with entities co-cited within the same document’ to show more candidates (B). PosMed search results are ranked in (C). Users can download at most 300 candidate genes and their annotations from (D).
Figure 3.
Figure 3.
Detail page showing supporting documents of the inference-type search. Adipor1 related genes are listed in (A). The supporting documents for Adipor1 and Adipoq are ranked in (B).

Similar articles

Cited by

References

    1. Toyoda T, Wada A. Omic space: coordinate-based integration and analysis of genomic phenomic interactions. Bioinformatics. 2004;20:1759–1765. - PubMed
    1. Kobayashi N, Toyoda T. Statistical search on the Semantic Web. Bioinformatics. 2008;24:1002–1010. - PubMed
    1. Prud'hommeaux E, Seaborne A. SPARQL Query Language for RDF, The World Wide Web Consortium, W3C Recommendation 15 January 2008. 2008. http://www.w3.org/TR/2008/REC-rdf-sparql-query-20080115/
    1. Berners-Lee T, Hendler J, Lassila O. The semantic web. Sci. Am. 2001;28:34–43.
    1. Van Driel M, Cuelenaere K, Kemmeren P, Leunissen J, Brunner H, Vriend G. GeneSeeker: extraction and integration of human disease-related information from web-based genetic databases. Nucleic Acids Res. 2005;33:W758–W761. - PMC - PubMed

Publication types