Improving Gloss Vector Semantic Relatedness Measure by Integrating pointwise mutual information optimizing second-order co-occurrence vectors computed from biomedical corpus and UMLS

Pesaranghader, Ahmad and Pesaranghader, Ali and Muthaiyah, Saravanan (2013) Improving Gloss Vector Semantic Relatedness Measure by Integrating pointwise mutual information optimizing second-order co-occurrence vectors computed from biomedical corpus and UMLS. In: 2013 International Conference on Informatics and Creative Multimedia (ICICM). IEEE, pp. 196-201. ISBN 978-0-7695-5133-3

[img] Text
Improving Gloss Vector Semantic Relatedness Measure by Integrating Pointwise Mutual Information....pdf
Restricted to Repository staff only

Download (571kB)
Official URL: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?...

Abstract

Methods of semantic relatedness are essential for wide range of tasks such as information retrieval and text mining. This paper, concerned with these automated methods, attempts to improve Gloss Vector semantic relatedness measure for more reliable estimation of relatedness between two input concepts. Generally, this measure by considering frequency cut-off for big rams tries to remove low and high frequency words which usually do not end up being significant features. However, this naive cutting approach can lead to loss of valuable information. By employing point wise mutual information (PMI) as a measure of association between features, we will try to enforce the foregoing elimination step in a statistical fashion. Applying both approaches to the biomedical domain, using MEDLINE as corpus, MeSH as thesaurus, and available reference standard of 311 concept pairs manually rated for semantic relatedness, we will show that PMI for removing insignificant features is more effective approach than frequency cut-off.

Item Type: Book Section
Subjects: H Social Sciences > HD Industries. Land use. Labor > HD28-70 Management. Industrial Management
Q Science > QA Mathematics > QA75.5-76.95 Electronic computers. Computer science
T Technology > T Technology (General)
Divisions: Faculty of Creative Multimedia (FCM)
Faculty of Management (FOM)
Depositing User: Ms Nurul Iqtiani Ahmad
Date Deposited: 17 Dec 2014 09:05
Last Modified: 17 Dec 2014 09:05
URI: http://shdl.mmu.edu.my/id/eprint/5851

Actions (login required)

View Item View Item