. 2008 Sep;84(3):362-9.

doi: 10.1038/clpt.2008.89. Epub 2008 May 21.

Development of a large-scale de-identified DNA biobank to enable personalized medicine

D M Roden¹, J M Pulley, M A Basford, G R Bernard, E W Clayton, J R Balser, D R Masys

Affiliations

PMID: 18500243
PMCID: PMC3763939
DOI: 10.1038/clpt.2008.89

Development of a large-scale de-identified DNA biobank to enable personalized medicine

D M Roden et al. Clin Pharmacol Ther. 2008 Sep.

. 2008 Sep;84(3):362-9.

doi: 10.1038/clpt.2008.89. Epub 2008 May 21.

Authors

D M Roden¹, J M Pulley, M A Basford, G R Bernard, E W Clayton, J R Balser, D R Masys

Affiliation

¹ Office of Personalized Medicine, Vanderbilt University School of Medicine, Nashville, Tennessee, USA. dan.roden@vanderbilt.edu

PMID: 18500243
PMCID: PMC3763939
DOI: 10.1038/clpt.2008.89

Abstract

Our objective was to develop a DNA biobank linked to phenotypic data derived from an electronic medical record (EMR) system. An "opt-out" model was implemented after significant review and revision. The plan included (i) development and maintenance of a de-identified mirror image of the EMR, namely, the "synthetic derivative" (SD) and (ii) DNA extracted from discarded blood samples and linked to the SD. Surveys of patients indicated general acceptance of the concept, with only a minority ( approximately 5%) opposing it. As a result, mechanisms to facilitate opt-out included publicity and revision of a standard "consent to treatment" form. Algorithms for sample handling and procedures for de-identification were developed and validated in order to ensure acceptable error rates (<0.3 and <0.1%, respectively). The rate of sample accrual is 700-900 samples/week. The advantages of this approach are the rate of sample acquisition and the diversity of phenotypes based on EMRs.

PubMed Disclaimer

Conflict of interest statement

CONFLICT OF INTEREST

The authors declared no conflict of interest.

Figures

**Figure 1**
Examples of under- and overmarking. The original text is shown on the left and the result of the scrubbing process is shown in the middle. The target text and the result of scrubbing are highlighted in red. (Lortab; Mikart, Atlanta, GA.)

**Figure 2**
A descriptive example of a record in the synthetic derivative (SD) described in the text. The arrows indicate examples of scrubbing: the medical record number has been removed (black), the social security and phone numbers have been masked (blue), names have been changed (purple), and dates have been shifted (red) as described in Methods.

**Figure 3**
Synthetic derivative interrogation tool. Search criteria are entered in the blue box, and entries and potential records are returned, with the “keywords in context” shown below. The user then has the option of including the record in the sample set to be analyzed. (Ciprofloxacin; Bayer HealthCare, West Haven, CI.)

**Figure 4**
Mechanism for linking DNA samples and patient-related information in a de-identified fashion. The approach depends on the use of a one-way hash, an algorithm that always generates the same 128-character code (the research unique identifier, RUI) when the same medical record number is used as input. The medical record number on barcoded blood samples that are about to be discarded is scanned, eligible samples are relabeled with the RUI, and DNA is extracted and stored. The medical record number in each patient’s record is replaced by the RUI, and the record is de-identified to create the synthetic derivative described in the text.

**Figure 5**
Program review. The program plan was reviewed by the Office for Human Research Protections (OHRP) and the Institutional Review Board (IRB). The IRB recommended further review from the standpoint of ethics, and the Ethics Review recommendations included the formation of a Community Advisory Board. The IRB, Ethics, and Community reviews resulted in program revisions, and these are ongoing.

See this image and copyright information in PMC

References

1. Garrod AE. Inborn Errors of Metabolism. 2. Henry Frowde and Hodder Stoughton; London: 1923.
1. Evans DA, Manley KA, McKusick VA. Genetic control of isoniazid metabolism in man. Br Med J. 1960;2:485–491. - PMC - PubMed
1. Vesell ES, Page JG. Genetic control of dicumarol levels in man. J Clin Invest. 1968;47:2657–2663. - PMC - PubMed
1. Forbat A, Lehmann H, Silk E. Prolonged apnea following injection of succinyldicholine. Lancet. 1953;2:1067–1068. - PubMed
1. Motulsky AG. Drug reactions enzymes and biochemical genetics. J Am Med Assoc. 1957;165:835–837. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
Medical
- ClinicalTrials.gov
Molecular Biology Databases
- NIAID Data Ecosystem - Find datasets on Infectious and Immune-mediated Diseases
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Development of a large-scale de-identified DNA biobank to enable personalized medicine

Affiliation

Development of a large-scale de-identified DNA biobank to enable personalized medicine

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Molecular Biology Databases

Research Materials