The Role of Ensemble Learning in Stock Market Classification Model Accuracy Enhancement Based on Naive Bayes Classifiers


  •  Ghaith Abdulsattar A.Jabbar Alkubaisi    

Abstract

Over the last years, methods of hybrid and ensemble have attracted the attention of the data mining community. Moreover, in the computational intelligence area such as machine learning, constructing and adaptive hybrid models have become essential to achieve good performance. However, the accuracy of stock market classification models is still low, and this has negatively affected the stock market indicators. Furthermore, there are many factors that have a direct effect on the classification models’ accuracies which were not addressed by previous research such as the automatic labelling technique which results in low classification accuracy due to the absence of specific lexicon, and the suitability of the classifiers to the data features and domain. In this research, a proposed model is designed to enhance the classification accuracy by the incorporation of stock market domain expert labelling technique and the construction of an ensemble Naïve Bayes classifiers to classify the stock market sentiments. The methodology for this research consists of five phases. The first phase is data collection, and the second phase is labelling, in which polarity of data is specified and negative, positive or neutral values are assigned. The third phase involves data pre-processing. The fourth phase is the classification phase in which suitable patterns of the stock market are identified by Ensemble Naïve Bayes classifiers, and the final is the performance and evaluation. The classification method has produced a significant result; it has achieved accuracy of more than 89%.



This work is licensed under a Creative Commons Attribution 4.0 License.
  • ISSN(Print): 1927-7032
  • ISSN(Online): 1927-7040
  • Started: 2012
  • Frequency: bimonthly

Journal Metrics

  • h-index (December 2019): 15
  • i10-index (December 2019): 24
  • h5-index (December 2019): N/A
  • h5-median(December 2019): N/A

( The data was calculated based on Google Scholar Citations. Click Here to Learn More. )

Contact