Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Aug 6:8:287.
doi: 10.1186/1471-2105-8-287.

OReFiL: an online resource finder for life sciences

Affiliations

OReFiL: an online resource finder for life sciences

Yasunori Yamamoto et al. BMC Bioinformatics. .

Abstract

Background: Many online resources for the life sciences have been developed and introduced in peer-reviewed papers recently, ranging from databases and web applications to data-analysis software. Some have been introduced in special journal issues or websites with a search function, but others remain scattered throughout the Internet and in the published literature. The searchable resources on these sites are collected and maintained manually and are therefore of higher quality than automatically updated sites, but also require more time and effort.

Description: We developed an online resource search system called OReFiL to address these issues. We developed a crawler to gather all of the web pages whose URLs appear in MEDLINE abstracts and full-text papers on the BioMed Central open-access journals. The URLs were extracted using regular expressions and rules based on our heuristic knowledge. We then indexed the online resources to facilitate their retrieval and comparison by researchers. Because every online resource has at least one PubMed ID, we can easily acquire its summary with Medical Subject Headings (MeSH) terms and confirm its credibility through reference to the corresponding PubMed entry. In addition, because OReFiL automatically extracts URLs and updates the index, minimal time and effort is needed to maintain the system.

Conclusion: We developed OReFiL, a search system for online life science resources, which is freely available. The system's distinctive features include the ability to return up-to-date query-relevant online resources introduced in peer-reviewed papers; the ability to search using free words, MeSH terms, or author names; easy verification of each hit following links to the corresponding PubMed entry or to papers citing the URL through the search systems of BioMed Central, Scirus, HighWire Press, or Google Scholar; and quick confirmation of the existence of an online resource web page.

PubMed Disclaimer

Figures

Figure 1
Figure 1
MeSH term distribution. MeSH term distribution at the second level of the hierarchy in those annotated to all the retrievable MEDLINE abstracts. Note that the following categories were excepted: "L" (Information Science). "V" (Publication Components), and "Z" (Geographic Locations).
Figure 2
Figure 2
Screen image of OReFiL. This image shows the search result of the query protein protein interaction. MeSH terms annotated to the MEDLINE abstracts in the hit list and their conceptual ancestors in the MeSH hierarchy are displayed in the alphabetical order in the MeSH term box (encircled by a dotted line), and each font size reflects the frequency. MeSH terms also can be used to filter the result by narrowing down to those entries that have a specified MeSH term. Changing a query to narrow down is done by clicking a MeSH term in the box. Clicking a same MeSH term twice removes it from the query.
Figure 3
Figure 3
Growth of online resources and MEDLINE. The numbers of URLs appeared in MEDLINE abstracts. The number of the DNS-resolvable URLs is that of URLs whose server name can be resolvable. The number of the page-accessible URLs is that of URLs whose page can be accessed (the server returns the HTTP status code of 200). MEDLINE growth is added for reference.

Similar articles

Cited by

References

    1. Wren JD. 404 not found: the stability and persistence of URLs published in MEDLINE. Bioinformatics. 2004;20:668–672. doi: 10.1093/bioinformatics/btg465. - DOI - PubMed
    1. BioMed Central Databases http://databases.biomedcentral.com/
    1. Babu PA, Boddepalli R, Lakshmi VV, Rao GN. DoD: Database of Databases–updated molecular biology databases. In Silico Biol. 2005;5:605–610. - PubMed
    1. Galperin MY. The Molecular Biology Database Collection: 2007 update. Nucleic Acids Res. 2007:D3–D4. doi: 10.1093/nar/gkl1008. - DOI - PMC - PubMed
    1. Chen YB, Chattopadhyay A, Bergen P, Gadd C, Tannery N. The Online Bioinformatics Resources Collection at the University of Pittsburgh Health Sciences Library System--a one-stop gateway to online bioinformatics databases and software tools. Nucleic Acids Res. 2007:D780–D785. doi: 10.1093/nar/gkl781. - DOI - PMC - PubMed

Publication types

MeSH terms

LinkOut - more resources