Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Dec 21;19(1):29.
doi: 10.3390/s19010029.

An Efficient Incremental Mining Algorithm for Discovering Sequential Pattern in Wireless Sensor Network Environments

Affiliations

An Efficient Incremental Mining Algorithm for Discovering Sequential Pattern in Wireless Sensor Network Environments

Xin Lyu et al. Sensors (Basel). .

Abstract

Wireless sensor networks (WSNs) are an important type of network for sensing the environment and collecting information. It can be deployed in almost every type of environment in the real world, providing a reliable and low-cost solution for management. Huge amounts of data are produced from WSNs all the time, and it is significant to process and analyze data effectively to support intelligent decision and management. However, the new characteristics of sensor data, such as rapid growth and frequent updates, bring new challenges to the mining algorithms, especially given the time constraints for intelligent decision-making. In this work, an efficient incremental mining algorithm for discovering sequential pattern (novel incremental algorithm, NIA) is proposed, in order to enhance the efficiency of the whole mining process. First, a reasoned proof is given to demonstrate how to update the frequent sequences incrementally, and the mining space is greatly narrowed based on the proof. Second, an improvement is made on PrefixSpan, which is a classic sequential pattern mining algorithm with a high-complexity recursive process. The improved algorithm, named PrefixSpan+, utilizes a mapping structure to extend the prefixes to sequential patterns, making the mining step more efficient. Third, a fast support number-counting algorithm is presented to choose frequent sequences from the potential frequent sequences. A reticular tree is constructed to store all the potential frequent sequences according to subordinate relations between them, and then the support degree can be efficiently calculated without scanning the original database repeatedly. NIA is compared with various kinds of mining algorithms via intensive experiments on the real monitoring datasets, benchmarking datasets and synthetic datasets from aspects including time cost, sensitivity of factors, and space cost. The results show that NIA performs better than the existed methods.

Keywords: WSNs; big data; incremental mining; prefix projection database; reticular sequence tree.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
The updated database UD which is the combination of SDB and sdb.
Figure 2
Figure 2
Reticular Sequence Tree.
Figure 3
Figure 3
The Construction Process.
Figure 4
Figure 4
The Counted Results.
Figure 5
Figure 5
Diamond Structure.
Figure 6
Figure 6
The Monitoring Data Flow.
Figure 7
Figure 7
The Runtimes on the Dataset of |UD|=100K with N=100K.
Figure 8
Figure 8
The Runtimes on the Dataset of |UD|=100K with N=10K.
Figure 9
Figure 9
The Runtimes on the Dataset of |UD|=200K with N=10K.
Figure 10
Figure 10
The Runtime Comparison with Three Novel SPM Algorithms.
Figure 11
Figure 11
The Runtime Comparison with the Practical SPM Algorithms.
Figure 12
Figure 12
The Runtimes on BMSWebView-1 with TIR = 0.1% and PIR = 0.01%.
Figure 13
Figure 13
The Runtimes on Kosarak with TIR = 0.1% and PIR = 0.01%.
Figure 14
Figure 14
The Runtimes on the Synthetic Dataset 1 with Rsdb=5%,Rnew=50%.
Figure 15
Figure 15
The Runtimes on the Synthetic Dataset 2 with Rsdb=5%,Rnew=50%.
Figure 16
Figure 16
The Runtimes with the Different Settings of Rsdb.
Figure 17
Figure 17
The Runtimes with the Different Settings of Rnew.
Figure 18
Figure 18
The Amounts of Potential Frequent Sequences on the Different minsups.
Figure 19
Figure 19
The Memory Usage on the Dataset of |UD|=100K with N=100K.
Figure 20
Figure 20
The Memory Usage Comparison with Three Novel SPM Algorithms.

References

    1. Yang D., Xu B., Rao K.Y., Sheng W.H. Passive infrared (PIR)-based indoor position tracking for smart homes using accessibility maps and A-Star algorithm. Sensors. 2018;18:332. doi: 10.3390/s18020332. - DOI - PMC - PubMed
    1. Cruz-Piris L., Rivera D., Fernandez S., Marsa-Maestre I. Optimized sensor network and multi-agent decision support for smart traffic light management. Sensors. 2018;18:435. doi: 10.3390/s18020435. - DOI - PMC - PubMed
    1. Gu Y.L., Hsu L.T., Kamijo S. Passive sensor integration for vehicle self-localization in urban traffic environment. Sensors. 2015;15:30199–30220. doi: 10.3390/s151229795. - DOI - PMC - PubMed
    1. Collier-Oxandale A., Coffey E., Thorson J., Johnston J., Hannigan M. Comparing building and neighborhood-scale variability of CO2 and O3 to inform deployment considerations for low-cost sensor system use. Sensors. 2018;18:1349. doi: 10.3390/s18051349. - DOI - PMC - PubMed
    1. Meng X.L., Nguyen D.T., Xie Y.L., Owen J.S., Psimoulis P., Ince S., Chen Q., Ye J., Bhatia P. Design and implementation of a new system for large bridge monitoring-GeoSHM. Sensors. 2018;18:775. doi: 10.3390/s18030775. - DOI - PMC - PubMed

LinkOut - more resources