Time series big data: a survey on data stream frameworks, analysis and algorithms
- PMID: 37274443
- PMCID: PMC10225118
- DOI: 10.1186/s40537-023-00760-1
Time series big data: a survey on data stream frameworks, analysis and algorithms
Abstract
Big data has a substantial role nowadays, and its importance has significantly increased over the last decade. Big data's biggest advantages are providing knowledge, supporting the decision-making process, and improving the use of resources, services, and infrastructures. The potential of big data increases when we apply it in real-time by providing real-time analysis, predictions, and forecasts, among many other applications. Our goal with this article is to provide a viewpoint on how to build a system capable of processing big data in real-time, performing analysis, and applying algorithms. A system should be designed to handle vast amounts of data and provide valuable knowledge through analysis and algorithms. This article explores the current approaches and how they can be used for the real-time operations and predictions.
Keywords: Anomaly detection; Big data; Forecasting; Machine learning; Stream processing engines; Time series.
© The Author(s) 2023.
Conflict of interest statement
Competing interestsThe authors declare that they have no competing interests.
Figures















Similar articles
-
A new Apache Spark-based framework for big data streaming forecasting in IoT networks.J Supercomput. 2023;79(10):11078-11100. doi: 10.1007/s11227-023-05100-x. Epub 2023 Feb 21. J Supercomput. 2023. PMID: 36845222 Free PMC article.
-
Design of a Spark Big Data Framework for PM2.5 Air Pollution Forecasting.Int J Environ Res Public Health. 2021 Jul 2;18(13):7087. doi: 10.3390/ijerph18137087. Int J Environ Res Public Health. 2021. PMID: 34281023 Free PMC article.
-
A Distributed Stream Processing Middleware Framework for Real-Time Analysis of Heterogeneous Data on Big Data Platform: Case of Environmental Monitoring.Sensors (Basel). 2020 Jun 3;20(11):3166. doi: 10.3390/s20113166. Sensors (Basel). 2020. PMID: 32503145 Free PMC article.
-
A Survey of Biological Data in a Big Data Perspective.Big Data. 2022 Aug;10(4):279-297. doi: 10.1089/big.2020.0383. Epub 2022 Apr 7. Big Data. 2022. PMID: 35394342 Review.
-
Discovering anomalies in big data: a review focused on the application of metaheuristics and machine learning techniques.Front Big Data. 2023 Aug 17;6:1179625. doi: 10.3389/fdata.2023.1179625. eCollection 2023. Front Big Data. 2023. PMID: 37663272 Free PMC article. Review.
Cited by
-
Solutions for Lithium Battery Materials Data Issues in Machine Learning: Overview and Future Outlook.Adv Sci (Weinh). 2024 Dec;11(48):e2410065. doi: 10.1002/advs.202410065. Epub 2024 Nov 18. Adv Sci (Weinh). 2024. PMID: 39556707 Free PMC article. Review.
-
EntroLLM: Leveraging Entropy and Large Language Model Embeddings for Enhanced Risk Prediction with Wearable Device Data.AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:225-234. eCollection 2025. AMIA Jt Summits Transl Sci Proc. 2025. PMID: 40502232 Free PMC article.
References
-
- Cox M, Ellsworth D. Application-controlled demand paging for out-of-core visualization. In: Proceedings of the 8th Conference on Visualization ’97. VIS ’97, pp. 235–244. IEEE Computer Society Press, Washington, DC, USA, 1997. 10.1109/VISUAL.1997.663888
-
- Gomes EHA, Plentz PDM, Rolt CRD, Dantas MAR. A survey on data stream, big data and real-time. Int J Netw Virtual Organ. 2019;20(2):143–167. doi: 10.1504/IJNVO.2019.097631. - DOI
-
- Zhou B, Li J, Wang X, Gu Y, Xu L, Hu Y, Zhu L. Online internet traffic monitoring system using spark streaming. Big Data Mining Anal. 2018;1(1):47–56. doi: 10.26599/BDMA.2018.9020005. - DOI
-
- Thudumu S, Branch P, Jin J, Singh J. A comprehensive survey of anomaly detection techniques for high dimensional big data. J Big Data. 2020 doi: 10.1186/s40537-020-00320-x. - DOI
LinkOut - more resources
Full Text Sources