Naive Bayes Classifier for Continuous Variables using Novel Method (NBC4D) and Distributions


Yıldırım P., Birant D.

IEEE International Symposium on Innovations in Intelligent Systems and Applications (INISTA), Alberobello, Italy, 23 - 25 June 2014, pp.110-115 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Volume:
  • Doi Number: 10.1109/inista.2014.6873605
  • City: Alberobello
  • Country: Italy
  • Page Numbers: pp.110-115
  • Keywords: Naive Bayes, continuous probability distributions, classification, data mining
  • Dokuz Eylül University Affiliated: Yes

Abstract

In data mining, when using Naive Bayes classification technique, it is necessary to overcome the problem of how to deal with continuous attributes. Most previous work has solved the problem either by using discretization, normal method or kernel method. This study proposes the usage of different continuous probability distribution techniques for Naive Bayes classification. It explores various probability density functions of distributions. The experimental results show that the proposed probability distributions also classify continuous data with potentially high accuracy. In addition, this paper introduces a novel method, named NBC4D, which offers a new approach for classification by applying different distribution types on different attributes. The results (obtained classification accuracy rates) show that our proposed method (the usage of more than one distribution types) has success on real-world datasets when compared with the usage of only one well known distribution type.