Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Multicenter Study
. 2020 Jan 1;77(1):49-57.
doi: 10.1001/jamaneurol.2019.3531.

Interrater Reliability of Experts in Identifying Interictal Epileptiform Discharges in Electroencephalograms

Affiliations
Multicenter Study

Interrater Reliability of Experts in Identifying Interictal Epileptiform Discharges in Electroencephalograms

Jin Jing et al. JAMA Neurol. .

Erratum in

Abstract

Importance: The validity of using electroencephalograms (EEGs) to diagnose epilepsy requires reliable detection of interictal epileptiform discharges (IEDs). Prior interrater reliability (IRR) studies are limited by small samples and selection bias.

Objective: To assess the reliability of experts in detecting IEDs in routine EEGs.

Design, setting, and participants: This prospective analysis conducted in 2 phases included as participants physicians with at least 1 year of subspecialty training in clinical neurophysiology. In phase 1, 9 experts independently identified candidate IEDs in 991 EEGs (1 expert per EEG) reported in the medical record to contain at least 1 IED, yielding 87 636 candidate IEDs. In phase 2, the candidate IEDs were clustered into groups with distinct morphological features, yielding 12 602 clusters, and a representative candidate IED was selected from each cluster. We added 660 waveforms (11 random samples each from 60 randomly selected EEGs reported as being free of IEDs) as negative controls. Eight experts independently scored all 13 262 candidates as IEDs or non-IEDs. The 1051 EEGs in the study were recorded at the Massachusetts General Hospital between 2012 and 2016.

Main outcomes and measures: Primary outcome measures were percentage of agreement (PA) and beyond-chance agreement (Gwet κ) for individual IEDs (IED-wise IRR) and for whether an EEG contained any IEDs (EEG-wise IRR). Secondary outcomes were the correlations between numbers of IEDs marked by experts across cases, calibration of expert scoring to group consensus, and receiver operating characteristic analysis of how well multivariate logistic regression models may account for differences in the IED scoring behavior between experts.

Results: Among the 1051 EEGs assessed in the study, 540 (51.4%) were those of females and 511 (48.6%) were those of males. In phase 1, 9 experts each marked potential IEDs in a median of 65 (interquartile range [IQR], 28-332) EEGs. The total number of IED candidates marked was 87 636. Expert IRR for the 13 262 individually annotated IED candidates was fair, with the mean PA being 72.4% (95% CI, 67.0%-77.8%) and mean κ being 48.7% (95% CI, 37.3%-60.1%). The EEG-wise IRR was substantial, with the mean PA being 80.9% (95% CI, 76.2%-85.7%) and mean κ being 69.4% (95% CI, 60.3%-78.5%). A statistical model based on waveform morphological features, when provided with individualized thresholds, explained the median binary scores of all experts with a high degree of accuracy of 80% (range, 73%-88%).

Conclusions and relevance: This study's findings suggest that experts can identify whether EEGs contain IEDs with substantial reliability. Lower reliability regarding individual IEDs may be largely explained by various experts applying different thresholds to a common underlying statistical model.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest Disclosures: Dr Muniz reported being issued US patent 10,349,888. Dr Chu reported receiving grants from the National Institutes of Health (NIH) and being a paid consultant to Alliance Family of Companies, Biogen, and SleepMed. Dr Westover reported receiving grants from the NIH. No other disclosures were reported.

Figures

Figure 1.
Figure 1.. Flow Diagram of the Study
EEG indicates electroencephalogram; IEDs, interictal epileptiform discharges.
Figure 2.
Figure 2.. Interrater Reliability (IRR) for Interictal Epileptiform Discharges (IEDs) at the Level of Individual IEDs and Entire Electroencephalograms (EEGs)
A, The 8 rows each contain 10 randomly selected samples scored by 8 experts as being IEDs. Qualitatively, expert IRR increases in proportion to the degree that candidate waves exhibit the morphological features of IEDs as defined by IFSECN (International Federation of Societies for EEG and Clinical Neurophysiology) criteria. B, The IED-wise percentage of agreement between pairs of experts. C, The EEG-wise percentage of agreement among pairs of experts. D, The EEGs are arranged in order of the mean number of IEDs marked by all experts, and experts are arranged from top to bottom in order of the total number of IEDs they marked across all EEGs. E and F, The IEDs were binned according to the number of votes (0 through 8) that they received (termed the reference standard). The 8 calibration curves, 1 for each expert, indicate the probability of that expert marking events within a given bin as IEDs. These curves allow assessment of the variation among experts relative to the group consensus.
Figure 3.
Figure 3.. Morphological Characteristics of the Interictal Epileptiform Discharges (IEDs)
A, For each candidate IED, 5 fiducial points are identified (triangles) corresponding to the IED peak, troughs preceding and following the peak, peak of the after-going slow wave, and trough following the slow-wave peak. These feature values are used to construct a single multivariate logistic regression (MLR) model to evaluate binary IED scores for all experts combined (combined model) and individualized MLR models to evaluate the IED scores of individual experts (individualized models). B, Receiver operating characteristic curve for the MLR model fit to the scores of all experts (universal model). The operating point (false-positive rate = 1 − specificity and sensitivity) of each expert, corresponding to the threshold in the combined model that best evaluates that expert’s binary scores, is indicated by a solid circle. The number by each circle is the threshold that best evaluates that expert’s binary scores. A indicates the area under the IED curve; Ap, area under the peak; As, area under the slow wave; AUC, area under the curve; D, duration of the IED candidate wave; Dp, duration of the peak; Dpf, duration of the falling half-wave of peak; Dpr, duration of the rising half-wave of peak; Ds, duration of the slow wave; Dsf, duration of the falling half-wave of the slow wave; Dsr, duration of the rising half-wave of the slow wave; Ref, reference; Spf, slope of the falling half-wave of peak; Ssf, slope of the falling half-wave of the slow wave; Spr, slope of the rising half-wave of peak; Ssr, slope of the rising half-wave of the slow wave; Vp0, peak voltage with respect to baseline 0 defined by V = 0; Vp1, peak voltage with respect to baseline 1 defined by Von; Vp2, peak voltage with respect to baseline 2 defined by trough Vtr; Vs0, slow-wave peak voltage with respect to baseline 0 defined by V = 0; Vs1, slow-wave peak voltage with respect to baseline 1 defined by onset Von; Vs2, slow-wave peak voltage with respect to baseline 2 defined by trough Vtr; Vt0, trough voltage with respect to baseline 0 defined by V = 0; and Vt1, trough voltage with respect to baseline 1 defined by onset Von.

Comment in

References

    1. Seidel S, Pablik E, Aull-Watschinger S, Seidl B, Pataraia E. Incidental epileptiform discharges in patients of a tertiary centre. Clin Neurophysiol. 2016;127(1):102-107. doi:10.1016/j.clinph.2015.02.056 - DOI - PubMed
    1. van Donselaar CA, Schimsheimer RJ, Geerts AT, Declerck AC. Value of the electroencephalogram in adult patients with untreated idiopathic first seizures. Arch Neurol. 1992;49(3):231-237. doi:10.1001/archneur.1992.00530270045017 - DOI - PubMed
    1. Fountain NB, Freeman JM. EEG is an essential clinical tool: pro and con. Epilepsia. 2006;47(suppl 1):23-25. doi:10.1111/j.1528-1167.2006.00655.x - DOI - PubMed
    1. Smith SJM. EEG in the diagnosis, classification, and management of patients with epilepsy. J Neurol Neurosurg Psychiatry. 2005;76(suppl 2):ii2-ii7. doi:10.1136/jnnp.2005.069245 - DOI - PMC - PubMed
    1. Pellegrino G, Hedrich T, Chowdhury R, et al. . Source localization of the seizure onset zone from ictal EEG/MEG data. Hum Brain Mapp. 2016;37(7):2528-2546. doi:10.1002/hbm.23191 - DOI - PMC - PubMed

Publication types