Citation
Tengku Mohd Amin, Tengku Aisyah Asyikin (2020) Hybrid-based labeling scheme for mapping extensible markup language (XML) to relational database. Masters thesis, Multimedia University. Full text not available from this repository.Abstract
eXtensible Markup Language (XML) is the de facto standard for data exchange over the World Wide Web in many application domains such as document repositories, digital libraries and business transactions. However, these application data are subject to frequent changes. In order to make XML into a full-featured data exchange format, it is essential to support not only queries but dynamic updates (insert, update and delete operations) over XML content. On the otherhand, some of labeling schemes require to relabel the whole XML tree. As a result, it will increase the XML database size. As such, a persistent, robust and durable labeling scheme which avoids re-labeling is very much desirable. The first part of this research concentrates on designing a robust and persistent labeling scheme, which supports dynamic updates in XML databases. Relational Database (RDB) is used as the repository due to the fact that RDBs are still the most popular back-end storage in most organizations. Since XML and RDB are in different format, an efficient mapping technique is certainly required. As such, in the second part of the research, our goal is to implement a mapping algorithm between XML and RDB. The contribution of the thesis can be summarized as follows. Firstly, a robust labeling scheme known as ORD-GAP is proposed. This labeling scheme is a rangebased labeling scheme, which assigns certain gap between each node to support future insertion. Secondly, a mapping scheme which built upon ORD-GAP labeling scheme is proposed in order to transform XML into RDB. Finally, in order to demonstrate that ORD-GAP is robust enough for dynamic updates, this method has implemented three use cases, namely (i) left-most insertion, (ii) in-between insertion and (iii) right-most insertion for the evaluation. A mapping scheme of ORG-GAP adapt from the ORDPath insertion and uses model-mapping approach to store the XML document structure. This method uses two tables to store the data from XML documents. The two tables are internal and text tables. Experimental evaluations demonstrated that ORD-GAP outperformed some existing approaches such as ORDPath and ME Labeling in terms of data loading time, query retrieval time and database storage size. On average, ORD-GAP has the best storing and query retrieval time. From the observation, ORD-GAP takes longer time for the data loading as it needs to buffer some time for some initial calculation. Nevertheless, data loading is usually executed once only in most organization as compared to query retrieval.
Item Type: | Thesis (Masters) |
---|---|
Additional Information: | Call No.: QA76.76.H94 T46 2020 |
Uncontrolled Keywords: | XML (Document markup language) |
Subjects: | Q Science > QA Mathematics > QA71-90 Instruments and machines > QA75-76.95 Calculating machines |
Divisions: | Faculty of Computing and Informatics (FCI) |
Depositing User: | Ms Nurul Iqtiani Ahmad |
Date Deposited: | 26 Sep 2024 04:03 |
Last Modified: | 26 Sep 2024 04:03 |
URII: | http://shdl.mmu.edu.my/id/eprint/12985 |
Downloads
Downloads per month over past year
Edit (login required) |