Survey of feature selection and extraction techniques for stock market prediction
- PMID: 36687795
- PMCID: PMC9834034
- DOI: 10.1186/s40854-022-00441-7
Survey of feature selection and extraction techniques for stock market prediction
Abstract
In stock market forecasting, the identification of critical features that affect the performance of machine learning (ML) models is crucial to achieve accurate stock price predictions. Several review papers in the literature have focused on various ML, statistical, and deep learning-based methods used in stock market forecasting. However, no survey study has explored feature selection and extraction techniques for stock market forecasting. This survey presents a detailed analysis of 32 research works that use a combination of feature study and ML approaches in various stock market applications. We conduct a systematic search for articles in the Scopus and Web of Science databases for the years 2011-2022. We review a variety of feature selection and feature extraction approaches that have been successfully applied in the stock market analyses presented in the articles. We also describe the combination of feature analysis techniques and ML methods and evaluate their performance. Moreover, we present other survey articles, stock market input and output data, and analyses based on various factors. We find that correlation criteria, random forest, principal component analysis, and autoencoder are the most widely used feature selection and extraction techniques with the best prediction accuracy for various stock market applications.
Keywords: Dimensionality reduction; Feature extraction; Feature selection; Machine learning; Stock market forecasting.
© The Author(s) 2023.
Conflict of interest statement
Competing interestsThe authors declare that they have no competing interests.
Figures
Similar articles
-
Fusion in stock market prediction: A decade survey on the necessity, recent developments, and potential future directions.Inf Fusion. 2021 Jan;65:95-107. doi: 10.1016/j.inffus.2020.08.019. Epub 2020 Aug 26. Inf Fusion. 2021. PMID: 32868979 Free PMC article.
-
News sensitive stock market prediction: literature review and suggestions.PeerJ Comput Sci. 2021 May 4;7:e490. doi: 10.7717/peerj-cs.490. eCollection 2021. PeerJ Comput Sci. 2021. PMID: 34013029 Free PMC article.
-
Analyzing the critical steps in deep learning-based stock forecasting: a literature review.PeerJ Comput Sci. 2024 Sep 23;10:e2312. doi: 10.7717/peerj-cs.2312. eCollection 2024. PeerJ Comput Sci. 2024. PMID: 39650437 Free PMC article.
-
A bibliometric literature review of stock price forecasting: From statistical model to deep learning approach.Sci Prog. 2024 Jan-Mar;107(1):368504241236557. doi: 10.1177/00368504241236557. Sci Prog. 2024. PMID: 38490223 Free PMC article. Review.
-
Systematic literature review of the performance characteristics of Chebyshev polynomials in machine learning applications for economic forecasting in low-income communities in sub-Saharan Africa.SN Bus Econ. 2022;2(12):184. doi: 10.1007/s43546-022-00328-w. Epub 2022 Nov 10. SN Bus Econ. 2022. PMID: 36407751 Free PMC article. Review.
Cited by
-
The contagion effect of heterogeneous investor groups.PLoS One. 2023 Oct 18;18(10):e0292795. doi: 10.1371/journal.pone.0292795. eCollection 2023. PLoS One. 2023. PMID: 37851630 Free PMC article.
-
Developing an Early Warning System for Financial Networks: An Explainable Machine Learning Approach.Entropy (Basel). 2024 Sep 17;26(9):796. doi: 10.3390/e26090796. Entropy (Basel). 2024. PMID: 39330129 Free PMC article.
-
Prediction of stock market using sentiment analysis and ensemble learning.MethodsX. 2025 Mar 12;14:103260. doi: 10.1016/j.mex.2025.103260. eCollection 2025 Jun. MethodsX. 2025. PMID: 40207066 Free PMC article.
-
A Novel Improvement of Feature Selection for Dynamic Hand Gesture Identification Based on Double Machine Learning.Sensors (Basel). 2025 Feb 13;25(4):1126. doi: 10.3390/s25041126. Sensors (Basel). 2025. PMID: 40006355 Free PMC article.
-
Enhanced stock market forecasting using dandelion optimization-driven 3D-CNN-GRU classification.Sci Rep. 2024 Sep 8;14(1):20908. doi: 10.1038/s41598-024-71873-7. Sci Rep. 2024. PMID: 39245700 Free PMC article.
References
-
- AIhamery E, Ahamery AA. Enhancing prediction of NASDAQ stock market based on technical indicators. J Eng Appl Sci. 2018;13:4630–4636.
-
- Aloraini A. Penalized ensemble feature selection methods for hidden associations in time series environments case study: equities companies in Saudi stock exchange market. Evol Syst. 2015;6:93–100. doi: 10.1007/s12530-014-9124-y. - DOI
-
- Alsubaie Y, Hindi KE, Alsalman H. Cost-sensitive prediction of stock price direction: selection of technical indicators. IEEE Access. 2019;7:146876–146892. doi: 10.1109/ACCESS.2019.2945907. - DOI
-
- Ampomah EK, Qin Z, Nyame G. Evaluation of tree-based ensemble machine learning models in predicting stock price direction of movement. Information. 2020;11:332. doi: 10.3390/info11060332. - DOI
-
- Ampomah EK, Nyame G, Qin Z, et al. Stock market prediction with Gaussian Naive Bayes machine learning algorithm. Informatica. 2021;45:243–256. doi: 10.31449/inf.v45i2.3407. - DOI
Publication types
LinkOut - more resources
Full Text Sources