Lazy fine-tuning algorithms for naive Bayesian text classification

Khalil M. El Hindi; Reem R. Aljulaidan; Hussien AlSalman

doi:10.1016/j.asoc.2020.106652

Back

Lazy fine-tuning algorithms for naive Bayesian text classification

Journal article

Peer reviewed

Lazy fine-tuning algorithms for naive Bayesian text classification

Khalil M. El Hindi, Reem R. Aljulaidan and Hussien AlSalman

Applied soft computing, Vol.96, p.106652

01/11/2020

DOI: https://doi.org/10.1016/j.asoc.2020.106652

Abstract

Computer Science

Computer Science, Artificial Intelligence

Computer Science, Interdisciplinary Applications

Science & Technology

Technology

The naive Bayes (NB) learning algorithm is widely applied in many fields, particularly in text classification. However, its performance decreases when it is used in domains where its naive assumption is violated or when the training set is too small to find accurate estimations of the probabilities. In this study, we propose a lazy fine-tuning naive Bayes (LFTNB) method to address both problems. We propose a local fine-tuning algorithm that uses the nearest neighbors of a query instance to fine-tune the probability terms used by NB. Applying the nearest neighbors only makes the independence assumption more likely to be valid, whereas the fine-tuning algorithm is used to find more accurate estimations of the probability terms. The performance of the LFTNB approach was evaluated using 47 UCI datasets. The results show that the LFTNB method achieves superior performance than classical NB, eager FTNB, and k-nearest neighbor algorithms. We also propose eager and lazy fine-tuning versions of powerful NB-based text classification algorithms, namely, multinomial NB, complement NB, and one-versus-all NB. The empirical results using 18 UCI text classification datasets show that the proposed methods outperform untuned versions of these algorithms. (C) 2020 Elsevier B.V. All rights reserved.

Metrics

1 Record Views

Details

Title: Lazy fine-tuning algorithms for naive Bayesian text classification
Creators - without role: Khalil M. El Hindi - King Saud University
Reem R. Aljulaidan - King Saud University
Hussien AlSalman - King Saud University
Publication Details: Applied soft computing, Vol.96, p.106652
Publisher: Elsevier
Number of pages: 13
Grant note: RG-1439-035 / Deanship of Scientific Research at King Saud University; King Saud University
Identifiers: 9947236908331
Academic Unit: King Saud University
Language: English
Resource Type: Journal article