Improved Single-Label Text Categorization by Instance Filtration

Kashif Ullah Khan; Usman Qamar; IEEE

doi:10.1109/CISIS.2015.10

Back

Conference proceeding

Improved Single-Label Text Categorization by Instance Filtration

Kashif Ullah Khan, Usman Qamar and IEEE

2015 Ninth International Conference on Complex, Intelligent, and Software Intensive Systems, pp.28-35

01/07/2015

DOI: https://doi.org/10.1109/CISIS.2015.10

Abstract

Accuracy

Computational modeling

Filtration

KNN

Naïve Bayes

Standards

Support vector machines

SVM

Text categorization

text classification

Training

Machine learning classifiers are widely used for text categorization however a classifier misclassifies some of the instances into a category that is relevant to their actual category. The categorization ability of a classifier can be improved by filtering dataset with better classifier and removing such category for misclassified instances. In this paper we proposed a two level approach where level-1 filters instances according to their likelihood in each category and reduce training dataset to top ranked 't' categories and their instances whereas level-2 classifier is used to classify instances with filtered training set. We employed Naïve Bayes, SVM and KNN as machine learning classifiers. Experimental evaluations on standard reuters-21578, cade12 and 20 Newsgroups datasets showed improved categorization effectiveness as measured by accuracy, precision, recall and f-measure protocols.

Metrics

1 Record Views

Details

Title: Improved Single-Label Text Categorization by Instance Filtration
Creators - without role: Kashif Ullah Khan - Dept. of Comput. Eng., Nat. Univ. of Sci. & Technol. (NUST), Islamabad, Pakistan
Usman Qamar - National University of Sciences and Technology
IEEE
Publication Details: 2015 Ninth International Conference on Complex, Intelligent, and Software Intensive Systems, pp.28-35
Publisher: IEEE
Identifiers: 9932460108331
Academic Unit: University Ha'il
Language: English
Resource Type: Conference proceeding