Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 May 26;10(6):649.
doi: 10.3390/bioengineering10060649.

Imagined Speech Classification Using EEG and Deep Learning

Affiliations

Imagined Speech Classification Using EEG and Deep Learning

Mokhles M Abdulghani et al. Bioengineering (Basel). .

Abstract

In this paper, we propose an imagined speech-based brain wave pattern recognition using deep learning. Multiple features were extracted concurrently from eight-channel electroencephalography (EEG) signals. To obtain classifiable EEG data with fewer sensors, we placed the EEG sensors on carefully selected spots on the scalp. To decrease the dimensions and complexity of the EEG dataset and to avoid overfitting during the deep learning algorithm, we utilized the wavelet scattering transformation. A low-cost 8-channel EEG headset was used with MATLAB 2023a to acquire the EEG data. The long-short term memory recurrent neural network (LSTM-RNN) was used to decode the identified EEG signals into four audio commands: up, down, left, and right. Wavelet scattering transformation was applied to extract the most stable features by passing the EEG dataset through a series of filtration processes. Filtration was implemented for each individual command in the EEG datasets. The proposed imagined speech-based brain wave pattern recognition approach achieved a 92.50% overall classification accuracy. This accuracy is promising for designing a trustworthy imagined speech-based brain-computer interface (BCI) future real-time systems. For better evaluation of the classification performance, other metrics were considered, and we obtained 92.74%, 92.50%, and 92.62% for precision, recall, and F1-score, respectively.

Keywords: EEG decoding; LSTM; brain–computer interface (BCI); imagined speech; inner speech; wavelet scattering transformation (WST).

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
(A) Broca’s and Wernicke’s regions, (B) The electrode positions of the system. Ground and reference are fixed on the back of ears (mastoids) with a disposable sticker, (C) 8-channel EEG headset.
Figure 2
Figure 2
The recording procedure.
Figure 3
Figure 3
Sample of the recorded 8−channel raw EEG dataset at 250 Hz (250 samples per second).
Figure 4
Figure 4
Eight−channel normalized EEG dataset at 250 Hz (250 samples per second).
Figure 5
Figure 5
Eight−extracted features using wavelet scattering transformation at 250 Hz (250 samples per second).
Figure 6
Figure 6
The architecture of the LSTM model.
Figure 7
Figure 7
The data validation and loss curves during the training process of the LSTM model using MATLAB.
Figure 8
Figure 8
The performance of the designed LSTM model. The wrong predicted commands (red bars) were only 6 out of 80 (5 recordings per participant) for all participants, which leads to 92.50% accuracy in the overall prediction of the designed LSTM model.
Figure 9
Figure 9
The confusion matrix for the classification of the four imagined speech commands. The rows represent the predicted class and the columns represent the true class. The diagonal (green) cells correspond to observations that are correctly classified. The off-diagonal (red) cells correspond to incorrectly classified observations. Both the number of observations and the percentage of the total number of observations are shown in each cell. The column on the far right of the plot shows the percentages of all the samples predicted to belong to each class that are correctly and incorrectly classified. The row at the bottom of the plot shows the percentages of all the samples belonging to each class that are correctly and incorrectly classified.

References

    1. Panachakel J.T., Vinayak N.N., Nunna M., Ramakrishnan A.G., Sharma K. Advances in Communication Systems and Networks. Springer; Berlin/Heidelberg, Germany: 2020. An improved EEG acquisition protocol facilitates localized neural activation; pp. 267–281.
    1. Abdulkader S.N., Atia A., Mostafa M.-S.M. Brain-computer interfacing: Applications and challenges. Egypt. Inform. J. 2015;16:213–230. doi: 10.1016/j.eij.2015.06.002. - DOI
    1. Abdulghani M.M., Walters W.L., Abed K.H. EEG Classifier Using Wavelet Scattering Transform-Based Features and Deep Learning for Wheelchair Steering; Proceedings of the 2022 International Conference on Computational Science and Computational Intelligence—Artificial Intelligence (CSCI'22–AI), IEEE Conference Publishing Services (CPS); Las Vegas, NV, USA. 14–16 December 2022.
    1. Al-Aubidy K.M., Abdulghani M.M. Wheelchair Neuro Fuzzy Control Using Brain Computer Interface; Proceedings of the 12th International Conference on Developments in eSystems Engineering (DeSE); Kazan, Russia. 7–10 October 2019; pp. 640–645.
    1. Al-Aubidy K.M., Abdulghani M.M. Advanced Systems for Biomedical Applications. Springer; Cham, Switzerland: 2021. Towards Intelligent Control of Electric Wheelchairs for Physically Challenged People; p. 225. - DOI

LinkOut - more resources