Dual indexing and mutual summation based keyword search method for XML databases

Sethuramalingam, Selvaganesan (2014) Dual indexing and mutual summation based keyword search method for XML databases. PhD thesis, Multimedia University.

Full text not available from this repository.
Official URL: http://library.mmu.edu.my/diglib/onlinedb/dig_lib....

Abstract

XML has become the most common means for publication, storage and exchange of data over the Internet. As a result, a huge amount of information is stored and represented in XML, and research on keyword search in XML documents is on the increase as it allows users to find information they are interested in without having to know the underlying database schema or complex query language. In XML keyword search, the accurate identification of user search intention and ranking of the result in the presence of keyword ambiguities have been challenging problems. In this thesis, we propose a XML keyword search using Dual indexing and Mutual summation based Algorithm (XDMA) to address the problems in XReal and other XML keyword search approaches. Our proposed approach builds dual indices, namely, tag information table and data node information table for structural node and data node in XML database respectively. Moreover, we propose a keyword search technique to select all possible T-typed nodes for a given query using the two-level matching between the two indices. Using this search technique, keywords in a given query can be identified and distinguished as tags or data while searching for an input query. Furthermore, another new keyword ambiguity, Ambiguity 4, i.e., A keyword can exist as the name of a tag for node types having different data (text) values and vice versa, is identified and addressed in this thesis. Subsequently, a new concept called mutual summation is proposed for a pair of random variables. By incorporating the concept of dependence of two indices and the concept of mutual summation, we define the Mutual Score (MScore) between selected tags and query keywords to find the desired node of type T.

Item Type: Thesis (PhD)
Additional Information: Call No.: QA76.76.H94 S48 2014
Uncontrolled Keywords: XML (Document markup language)
Subjects: Q Science > QA Mathematics > QA75.5-76.95 Electronic computers. Computer science > QA76.75-76.765 Computer software
Divisions: Faculty of Computing and Informatics (FCI)
Depositing User: Ms Nurul Iqtiani Ahmad
Date Deposited: 12 Jan 2016 11:06
Last Modified: 12 Jan 2016 11:06
URI: http://shdl.mmu.edu.my/id/eprint/6268

Actions (login required)

View Item View Item