Approaches of Dimensionality Reduction for Telugu Document Classification

P. Vijayapal Reddy; B. Sasidhar; B. Harinatha Reddy; B. Vishnu Vardhan; L. Pratap Reddy; A. Govardhan

doi:10.1109/IALP.2009.82

Back

Conference proceeding

Approaches of Dimensionality Reduction for Telugu Document Classification

P. Vijayapal Reddy, B. Sasidhar, B. Harinatha Reddy, B. Vishnu Vardhan, L. Pratap Reddy and A. Govardhan

2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, pp.259-264

International Conference on Asian Language Processing

01/01/2009

DOI: https://doi.org/10.1109/IALP.2009.82

Abstract

Computer Science

Computer Science, Artificial Intelligence

Computer Science, Information Systems

Science & Technology

Technology

Document classification is one of the prominent area of research evolved as a result of exponential growth in the usage of electronic documents. Classification of documents demands for understanding of document units by removing insignificant data and improving computational efficiency. This paper deals with the approaches aimed at Dimensionality Reduction (DR) in document units for Telugu. Bag of words is a generic model for English document classification, adaptation of this model on Indic based scripts found to have a meager performance. Two approaches are presented in this paper, first approach deals with language specific and Corpus based dimensionality reduction termed as validity based DR. The other approach is Category and Document specific approach termed as category based DR. The performance of the two approaches is evaluated with the help of accuracy as a measure.

Metrics

1 Record Views

Details

Title: Approaches of Dimensionality Reduction for Telugu Document Classification
Creators - without role: P. Vijayapal Reddy - Rajamahendra Coll Engn, Dept CSE, Ibrahimpatnam, Andhra Pradesh, India
B. Sasidhar - CMR University
B. Harinatha Reddy - TRR Coll Engn, Hyderabad, Andhra Pradesh, India
B. Vishnu Vardhan - Indur Inst Engn &Technol, Dept IT, Siddipet, India
L. Pratap Reddy - Jawaharlal Nehru Technological University, Kakinada
A. Govardhan - Jawaharlal Nehru Technological University, Hyderabad
Contributors - without role: M Zhang
H Z Li
K T Lua
M H Dong
Publication Details: 2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, pp.259-264
Series: International Conference on Asian Language Processing
Publisher: IEEE
Number of pages: 6
Identifiers: 9949711908331
Academic Unit: King Saud University
Language: English
Resource Type: Conference proceeding