Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Mar-Apr;15(2):138-49.
doi: 10.1197/jamia.M2522. Epub 2007 Dec 20.

caGrid 1.0: an enterprise Grid infrastructure for biomedical research

Affiliations

caGrid 1.0: an enterprise Grid infrastructure for biomedical research

Scott Oster et al. J Am Med Inform Assoc. 2008 Mar-Apr.

Abstract

Objective: To develop software infrastructure that will provide support for discovery, characterization, integrated access, and management of diverse and disparate collections of information sources, analysis methods, and applications in biomedical research.

Design: An enterprise Grid software infrastructure, called caGrid version 1.0 (caGrid 1.0), has been developed as the core Grid architecture of the NCI-sponsored cancer Biomedical Informatics Grid (caBIG) program. It is designed to support a wide range of use cases in basic, translational, and clinical research, including 1) discovery, 2) integrated and large-scale data analysis, and 3) coordinated study.

Measurements: The caGrid is built as a Grid software infrastructure and leverages Grid computing technologies and the Web Services Resource Framework standards. It provides a set of core services, toolkits for the development and deployment of new community provided services, and application programming interfaces for building client applications.

Results: The caGrid 1.0 was released to the caBIG community in December 2006. It is built on open source components and caGrid source code is publicly and freely available under a liberal open source license. The core software, associated tools, and documentation can be downloaded from the following URL: https://cabig.nci.nih.gov/workspaces/Architecture/caGrid.

Conclusions: While caGrid 1.0 is designed to address use cases in cancer research, the requirements associated with discovery, analysis and integration of large scale data, and coordinated studies are common in other biomedical fields. In this respect, caGrid 1.0 is the realization of a framework that can benefit the entire biomedical community.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The caGrid 1.0 infrastructure and environment. The core caGrid services include the security services (Dorian, Grid Trust Service (GTS), and Grid Grouper), metadata services (Index Service, Global Model Exchange (GME), Enterprise Vocabulary Services (EVS), and cancer Data Standards Repository (caDSR)), and high level services such as the Federated Query Processing service (FQP) and the Workflow services. In addition to the core services, data services and analytical services (e.g., caArray and caBIO services in the figure), which are provided by research groups, institutions, individual researchers, can be discovered and securely accessed using the caGrid core services and protocols. Common invocation patterns are indicated by directional arrows.
Figure 2
Figure 2
The GAARDS security infrastructure and its deployment in a multi-institutional environment. It consists of Dorian for federation and management of grid credentials, the Grid Trust Services for management of a trust fabric in the environment, and Grid Grouper for facilitating authorization and access control. When a Grid client wants to access a secure caGrid data or analytical service, the client can use his/her local authentication mechanisms and Dorian to get a temporary grid credential (based on the long term certificate and private key of the client, which are managed by Dorian). The client can then interact with the secure caGrid service and invoke its methods with his/her grid credentials. The caGrid service may contact the Grid Trust Services infrastructure to check if the client’s credentials are valid (i.e., they are not revoked; Certificate Revocation Lists: CRLs are used to keep track of revoked certificates) and if the credentials are issued by a trusted identity provider. If the authentication of the client is successful, the caGrid service may contact the Grid Grouper service to obtain the client’s group information upon which access control policies may be defined. Depending on the authorization information, the caGrid service may deny the client’s access to some or all service methods or the data served by the service.

Similar articles

Cited by

References

    1. Saltz J, Oster S, Hastings S, et al. caGrid: Design and Implementation of the Core Architecture of the Cancer Biomedical Informatics Grid Bioinform 2006;22(15):1910-1916. - PubMed
    1. Foster I. Globus Toolkit Version 4: Software for Service-Oriented Systems J Comp Sci Technol 2006;21(4):523-530.
    1. Grethe JS, Baru C, Gupta A, James M, Ludaescher B, Martone ME, et al. Biomedical Informatics Research Network: Building a National Collaboratory to Hasten the Derivation of New Understanding and Treatment of Disease Stud Health Technol Inform 2005;112:100-109. - PubMed
    1. Peltier ST, Ellisman MH. The Biomedical Informatics Research NetworkIn: Foster I, Kesselman C, editors. The Grid 2, Blueprint for a New Computing Infrastructure. 2nd ed. Morgan Kaufmann; 2003.
    1. Foster I, Czajkowski K, Ferguson D, et al. Modeling and Managing State in Distributed Systems: The Role of OGSI and WSRF Proc IEEE 2005;93(3):604-612.

Publication types