Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Aug 31;18(9):2892.
doi: 10.3390/s18092892.

Feature Representation and Data Augmentation for Human Activity Classification Based on Wearable IMU Sensor Data Using a Deep LSTM Neural Network

Affiliations

Feature Representation and Data Augmentation for Human Activity Classification Based on Wearable IMU Sensor Data Using a Deep LSTM Neural Network

Odongo Steven Eyobu et al. Sensors (Basel). .

Abstract

Wearable inertial measurement unit (IMU) sensors are powerful enablers for acquisition of motion data. Specifically, in human activity recognition (HAR), IMU sensor data collected from human motion are categorically combined to formulate datasets that can be used for learning human activities. However, successful learning of human activities from motion data involves the design and use of proper feature representations of IMU sensor data and suitable classifiers. Furthermore, the scarcity of labelled data is an impeding factor in the process of understanding the performance capabilities of data-driven learning models. To tackle these challenges, two primary contributions are in this article: first; by using raw IMU sensor data, a spectrogram-based feature extraction approach is proposed. Second, an ensemble of data augmentations in feature space is proposed to take care of the data scarcity problem. Performance tests were conducted on a deep long term short term memory (LSTM) neural network architecture to explore the influence of feature representations and the augmentations on activity recognition accuracy. The proposed feature extraction approach combined with the data augmentation ensemble produces state-of-the-art accuracy results in HAR. A performance evaluation of each augmentation approach is performed to show the influence on classification accuracy. Finally, in addition to using our own dataset, the proposed data augmentation technique is evaluated against the University of California, Irvine (UCI) public online HAR dataset and yields state-of-the-art accuracy results at various learning rates.

Keywords: data augmentation; deep learning; feature representation; human activity recognition; inertial measurement unit sensor; long short term memory.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Human activity recognition system workflow.
Figure 2
Figure 2
Data collection architecture.
Figure 3
Figure 3
An example of 3D raw data for (a) sitting and (b) walking based on 1IMU sensor tied on the left-hand wrist.
Figure 4
Figure 4
(a) An example of time domain data and their spectrogram representing walking data; (b) Proposed feature extraction algorithm.
Figure 5
Figure 5
Data augmentation workflow.
Figure 6
Figure 6
Set 1: OR dataset.
Figure 7
Figure 7
Set 1: Generating local averages.
Figure 8
Figure 8
Set 2: OR + LA1 dataset.
Figure 9
Figure 9
Randomly shuffled (OR + LA1 + SH) feature set 2.
Figure 10
Figure 10
Generating local averages of the shuffled feature set.
Figure 11
Figure 11
Set 3: OR + LA1 + SH + LA2 feature set with local averages.
Figure 12
Figure 12
Visualizing the variance of data to check the influence of each data augmentation block. (a,b,c) represent the variance of the unaugmented dataset, augmented dataset after the first local averaging and that of the augmented dataset after the first local averaging, shuffling and the second local averaging procedure respectively for the standing activity. (d,e,(f) represent the variance of the unaugmented dataset, augmented dataset after the first local averaging and that of the augmented dataset after the first local averaging, shuffling and the second local averaging procedure respectively for the sitting activity. (g,h,i) represent the variance of the unaugmented dataset, augmented dataset after the first local averaging and that of the augmented dataset after the first local averaging, shuffling and the second local averaging procedure respectively for the walking activity.
Figure 13
Figure 13
LSTM cell.
Figure 14
Figure 14
(a) Accuracy versus learning rate based on only the OR dataset without augmentation, (b) precision versus learning rate based on only the OR dataset without augmentation, (c) recall versus learning rate based on only the OR dataset without augmentation and (d) f1_score versus learning rate based on only the OR dataset without augmentation.
Figure 15
Figure 15
100-feature vector dataset: (a) Accuracy versus learning rate, (b) precision versus learning rate, (c) recall versus learning rate and (d) f1_score versus learning rate.
Figure 16
Figure 16
200-feature vector dataset: (a) Accuracy versus learning rate, (b) precision versus learning rate, (c) recall versus learning rate and (d) f1_score versus learning rate.
Figure 17
Figure 17
UCI’s 128-feature vector dataset: (a) Accuracy versus learning rate, (b) precision versus learning rate, (c) recall versus learning rate and (d) f1_score versus learning rate.
Figure 17
Figure 17
UCI’s 128-feature vector dataset: (a) Accuracy versus learning rate, (b) precision versus learning rate, (c) recall versus learning rate and (d) f1_score versus learning rate.
Figure 18
Figure 18
Comparing the performance of OR and OR + LA1 using the UCI dataset and our dataset at various learning rates of: (a) 0.0002 (b) 0.003 (c) 0.006 and (d) 0.01.

Similar articles

Cited by

References

    1. Debes C., Merentitis A., Sukhanov S., Niessen M., Frangiadakis N., Bauer A. Monitoring activities of daily living in smart homes: Understanding human behavior. IEEE Signal Process. Mag. 2016;33:81–94. doi: 10.1109/MSP.2015.2503881. - DOI
    1. Reiss A., Stricker D. Introducing a new benchmarked dataset for activity monitoring; Proceedings of the 2012 16th International Symposium on Wearable Computers (ISWC); Newcastle, UK. 18–22 June 2012; pp. 108–109.
    1. Sukkarieh S., Nebot E.M., Durrant-Whyte H.F. A high integrity IMU/GPS navigation loop for autonomous land vehicle applications. IEEE Trans. Robot. Autom. 1999;15:572–578. doi: 10.1109/70.768189. - DOI
    1. Jiménez A.R., Seco F., Prieto J.C., Guevara J. Indoor pedestrian navigation using an INS/EKF framework for yaw drift reduction and a foot-mounted IMU; Proceedings of the 2010 7th Workshop on Positioning Navigation and Communication (WPNC); Dresden, Germany. 11–12 March 2010; pp. 135–143.
    1. Ojeda L., Borenstein J. Non-GPS navigation for security personnel and first responders. J. Navig. 2007;60:391–407. doi: 10.1017/S0373463307004286. - DOI