A novel term weighting scheme MIDF for text categorization

Citation

Deisy, C. and Sonai Muthu Anbananthen, Kalaiarasi and Baskar, M. and Gowri, M. and Ramaraj, N. (2010) A novel term weighting scheme MIDF for text categorization. Journal of Engineering Science and Technology., 5 (1). pp. 94-107. ISSN 1823-4690

[img] Text
15.pdf
Restricted to Repository staff only

Download (329kB)

Abstract

Text categorization is a task of automatically assigning documents to a set of predefined categories. Usually it involves a document representation method and term weighting scheme. This paper proposes a new term weighting scheme called Modified Inverse Document Frequency (MIDF) to improve the performance of text categorization. The document represented in MIDF is trained using the support vector machines classifier with radial basis function kernel. The experiments are carried out in Reuters-21578 corpora. The performance measures taken for text categorization are F1–measure and cost measure. The proposed term weighting scheme performs better than the existing term weighting schemes.

Item Type: Article
Subjects: T Technology > TA Engineering (General). Civil engineering (General)
Divisions: Faculty of Information Science and Technology (FIST)
Depositing User: Ms Rosnani Abd Wahab
Date Deposited: 21 Jan 2014 08:12
Last Modified: 27 Apr 2023 13:32
URII: http://shdl.mmu.edu.my/id/eprint/4919

Downloads

Downloads per month over past year

View ItemEdit (login required)