Citation
Lim, Lian Tze and Soon, Lay Ki and Lim, Tek Yong and Tang, Enya Kong and Ranaivo-Malancon, Bali (2014) Lexicon plus TX: Rapid construction of a multilingual lexicon with under-resourced languages. Language Resources and Evaluation, 48 (3). pp. 479-492. ISSN 1574-0218 Full text not available from this repository.Abstract
Most efforts at automatically creating multilingual lexicons require input lexical resources with rich content (e.g. semantic networks, domain codes, semantic categories) or large corpora. Such material is often unavailable and difficult to construct for under-resourced languages. In some cases, particularly for some ethnic languages, even unannotated corpora are still in the process of collection. We show how multilingual lexicons with under-resourced languages can be constructed using simple bilingual translation lists, which are more readily available. The prototype multilingual lexicon developed comprise six member languages: English, Malay, Chinese, French, Thai and Iban, the last of which is an under-resourced language in Borneo. Quick evaluations showed that 91.2 % of 500 random multilingual entries in the generated lexicon require minimal or no human correction.
Item Type: | Article |
---|---|
Subjects: | Q Science > QA Mathematics > QA71-90 Instruments and machines > QA75.5-76.95 Electronic computers. Computer science |
Divisions: | Faculty of Computing and Informatics (FCI) |
Depositing User: | Ms Nurul Iqtiani Ahmad |
Date Deposited: | 23 Sep 2014 02:56 |
Last Modified: | 23 Sep 2014 02:56 |
URII: | http://shdl.mmu.edu.my/id/eprint/5752 |
Downloads
Downloads per month over past year
Edit (login required) |