Abstract
Conference Title: 2013 International Conference on IT Convergence and Security (ICITCS) Conference Start Date: 2013, Dec. 16 Conference End Date: 2013, Dec. 18 Conference Location: Macao Webpage text Classification is an important problem that has been studied through different approaches and algorithms. It aims to assign a predefined category to a Webpage based on its content and linguistic features. It has many applications such as word sense disambiguation, document indexing, text filtering, Webpages hierarchical categorization and document organization. This study is a part of a work in progress, in which we are targeting to develop Bi-languages algorithm for classifying Arabic and English Webpage text and can perform accurate and efficient in both languages. It aims at providing a simple overview of many approaches that constructed for classifying Arabic and English Webpage documents. In this survey, the widely used algorithms for text classification are given with a comparison of the recent researches in classification field for Arabic and English languages to conclude which is the best algorithm that we can apply for both Arabic and English Languages. [PUBLICATION ABSTRACT]