Survey of feature selection and extraction techniques for stock market prediction
- PMID: 36687795
- PMCID: PMC9834034
- DOI: 10.1186/s40854-022-00441-7
Survey of feature selection and extraction techniques for stock market prediction
Abstract
In stock market forecasting, the identification of critical features that affect the performance of machine learning (ML) models is crucial to achieve accurate stock price predictions. Several review papers in the literature have focused on various ML, statistical, and deep learning-based methods used in stock market forecasting. However, no survey study has explored feature selection and extraction techniques for stock market forecasting. This survey presents a detailed analysis of 32 research works that use a combination of feature study and ML approaches in various stock market applications. We conduct a systematic search for articles in the Scopus and Web of Science databases for the years 2011-2022. We review a variety of feature selection and feature extraction approaches that have been successfully applied in the stock market analyses presented in the articles. We also describe the combination of feature analysis techniques and ML methods and evaluate their performance. Moreover, we present other survey articles, stock market input and output data, and analyses based on various factors. We find that correlation criteria, random forest, principal component analysis, and autoencoder are the most widely used feature selection and extraction techniques with the best prediction accuracy for various stock market applications.
Keywords: Dimensionality reduction; Feature extraction; Feature selection; Machine learning; Stock market forecasting.
© The Author(s) 2023.
Conflict of interest statement
Competing interestsThe authors declare that they have no competing interests.
Figures
References
-
- AIhamery E, Ahamery AA. Enhancing prediction of NASDAQ stock market based on technical indicators. J Eng Appl Sci. 2018;13:4630–4636.
-
- Aloraini A. Penalized ensemble feature selection methods for hidden associations in time series environments case study: equities companies in Saudi stock exchange market. Evol Syst. 2015;6:93–100. doi: 10.1007/s12530-014-9124-y. - DOI
-
- Alsubaie Y, Hindi KE, Alsalman H. Cost-sensitive prediction of stock price direction: selection of technical indicators. IEEE Access. 2019;7:146876–146892. doi: 10.1109/ACCESS.2019.2945907. - DOI
-
- Ampomah EK, Qin Z, Nyame G. Evaluation of tree-based ensemble machine learning models in predicting stock price direction of movement. Information. 2020;11:332. doi: 10.3390/info11060332. - DOI
-
- Ampomah EK, Nyame G, Qin Z, et al. Stock market prediction with Gaussian Naive Bayes machine learning algorithm. Informatica. 2021;45:243–256. doi: 10.31449/inf.v45i2.3407. - DOI
Publication types
LinkOut - more resources
Full Text Sources