Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation

The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): a completed reference database of lung nodules on CT scans

Samuel G Armato 3rd et al. Med Phys. 2011 Feb.

Abstract

Purpose: The development of computer-aided diagnostic (CAD) methods for lung nodule detection, classification, and quantitative assessment can be facilitated through a well-characterized repository of computed tomography (CT) scans. The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI) completed such a database, establishing a publicly available reference for the medical imaging research community. Initiated by the National Cancer Institute (NCI), further advanced by the Foundation for the National Institutes of Health (FNIH), and accompanied by the Food and Drug Administration (FDA) through active participation, this public-private partnership demonstrates the success of a consortium founded on a consensus-based process.

Methods: Seven academic centers and eight medical imaging companies collaborated to identify, address, and resolve challenging organizational, technical, and clinical issues to provide a solid foundation for a robust database. The LIDC/IDRI Database contains 1018 cases, each of which includes images from a clinical thoracic CT scan and an associated XML file that records the results of a two-phase image annotation process performed by four experienced thoracic radiologists. In the initial blinded-read phase, each radiologist independently reviewed each CT scan and marked lesions belonging to one of three categories ("nodule > or =3 mm," "nodule <3 mm," and "non-nodule > or =3 mm"). In the subsequent unblinded-read phase, each radiologist independently reviewed their own marks along with the anonymized marks of the three other radiologists to render a final opinion. The goal of this process was to identify as completely as possible all lung nodules in each CT scan without requiring forced consensus.

Results: The Database contains 7371 lesions marked "nodule" by at least one radiologist. 2669 of these lesions were marked "nodule > or =3 mm" by at least one radiologist, of which 928 (34.7%) received such marks from all four radiologists. These 2669 lesions include nodule outlines and subjective nodule characteristic ratings.

Conclusions: The LIDC/IDRI Database is expected to provide an essential medical imaging research resource to spur CAD development, validation, and dissemination in clinical practice.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Examples of lesions considered to satisfy the LIDC∕IDRI definition of (a) a nodule≥3 mm, (b) a nodule<3 mm, and (c) a non-nodule≥3 mm (reprinted with permission from Ref. 29).
Figure 2
Figure 2
(a) A lesion considered to be a nodule≥3 mm by all four LIDC∕IDRI radiologists. (b) The nested outline of one radiologist reflects the radiologist’s opinion that a region of exclusion (a dilated bronchus) exists within the nodule. The inner outline is explicitly noted as an exclusion in the XML file. Each outline is an “outer border” so that neither outline is meant to overlap pixels interpreted as belonging to the nodule.
Figure 3
Figure 3
(a) A lesion considered to be a nodule≥3 mm by two LIDC∕IDRI radiologists and a nodule<3 mm or non-nodule≥3 mm by the other two radiologists. (b) A lesion identified as a nodule≥3 mm (arrow) by three LIDC∕IDRI radiologists but assigned no mark at all by the fourth radiologist (reprinted with permission from Ref. 36).
Figure 4
Figure 4
Distributions depicting the proportions of the 7371 nodules that were (1) marked as a nodule by different numbers of radiologists (gray) or (2) assigned any mark at all (including non-nodule≥3 mm) by different numbers of radiologists (black).
Figure 5
Figure 5
Distributions depicting the proportions of the 2669 lesions marked by at least one radiologist as a nodule≥3 mm that were marked as such by different numbers of radiologists.
Figure 6
Figure 6
Examples of lesions marked as a nodule≥3 mm (a) by only a single radiologist (the other three radiologists identified this lesion as a non-nodule≥3 mm) and (b) by all four radiologists.
Figure 7
Figure 7
Distributions depicting the proportions of the 2669 lesions marked by at least one radiologist as a nodule≥3 mm that were marked as either a nodule≥3 mm or a nodule<3 mm by different numbers of radiologists.
Figure 8
Figure 8
(a) A lesion identified by three radiologists as a single nodule≥3 mm that was considered to be two separate nodules≥3 mm by the fourth radiologist. [(b) and (c)] The outlines constructed on this section by two of the radiologists.
Figure 9
Figure 9
A lesion identified by one radiologist as a single nodule≥3 mm that was considered to be a nodule≥3 mm (arrowhead) and a separate nodule<3 mm (arrow) by another radiologist and a non-nodule≥3 mm (arrowhead) and a separate nodule<3 mm (arrow) by two other radiologists.
Figure 10
Figure 10
Examples of differences in radiologists’ interpretation of nodule≥3 mm boundaries. (a) In-plane outlines differ between two radiologists in a single CT section. (b) A lesion depicted in two adjacent CT sections that is outlined by all four radiologists in the more superior section (left) but only by two radiologists in the more inferior section (right) (outlines not shown). (c) A nodule outline for which a portion (arrow) encloses no nodule pixels based on the outer border definition.

References

    1. Nishikawa R. M., “Design of a common database for research in mammogram image analysis,” Proc. SPIE PSISDG 1905, 548–549 (1993).10.1117/12.148620 - DOI
    1. Kallergi M., Clark R. A., and Clarke L. P., “Medical image databases for CAD applications in digital mammography: Design issues,” Stud. Health Technol. Inform. ZZZZZZ 43, 601–605 (1997). - PubMed
    1. Nishikawa R. M. and Yarusso L. M., “Variations in measured performance of CAD schemes due to database composition and scoring protocol,” Proc. SPIE PSISDG 3338, 840–844 (1998).10.1117/12.310894 - DOI
    1. Nishikawa R. M. et al., “Effect of case selection on the performance of computer-aided detection schemes,” Med. Phys. MPHYA6 21, 265–269 (1999).10.1118/1.597287 - DOI - PubMed
    1. Bowyer K.et al., in “The digital database for screening mammography,” Digital Mammography ‘96: Proceedings of the Third International Workshop on Digital Mammography, edited by Doi K., Giger M. L., Nishikawa R. M., and Schmidt R. A. (Elsevier, Amsterdam, 1996), pp. 431–434.

Publication types

MeSH terms