Citation
Deisy, C. and Sonai Muthu Anbananthen, Kalaiarasi and Baskar, M. and Gowri, M. and Ramaraj, N. (2010) A novel term weighting scheme MIDF for text categorization. Journal of Engineering Science and Technology., 5 (1). pp. 94-107. ISSN 1823-4690
Text
15.pdf Restricted to Repository staff only Download (329kB) |
Official URL: http://jestec.taylors.edu.my/V5Issue1.html
Abstract
Text categorization is a task of automatically assigning documents to a set of predefined categories. Usually it involves a document representation method and term weighting scheme. This paper proposes a new term weighting scheme called Modified Inverse Document Frequency (MIDF) to improve the performance of text categorization. The document represented in MIDF is trained using the support vector machines classifier with radial basis function kernel. The experiments are carried out in Reuters-21578 corpora. The performance measures taken for text categorization are F1–measure and cost measure. The proposed term weighting scheme performs better than the existing term weighting schemes.
Item Type: | Article |
---|---|
Subjects: | T Technology > TA Engineering (General). Civil engineering (General) |
Divisions: | Faculty of Information Science and Technology (FIST) |
Depositing User: | Ms Rosnani Abd Wahab |
Date Deposited: | 21 Jan 2014 08:12 |
Last Modified: | 27 Apr 2023 13:32 |
URII: | http://shdl.mmu.edu.my/id/eprint/4919 |
Downloads
Downloads per month over past year
Edit (login required) |