Data location aware scheduling for virtual Hadoop cluster deployment on private cloud computing environment


Thaha, Asmath Fahad and Muhamad Amin, Anang Hudaya and Kannan, Subarmaniam and Ahmad, Nazrul Muhaimin (2016) Data location aware scheduling for virtual Hadoop cluster deployment on private cloud computing environment. In: 2016 22nd Asia-Pacific Conference on Communications (APCC), 25-27 Aug. 2016, Yogyakarta, Indonesia.


Download (349kB) | Preview


With the advancements of Internet-of-Things (IoT) and Machine-to-Machine Communications (M2M), the ability to generate massive amount of streaming data from sensory devices in distributed environment is inevitable. A common practice nowadays is to process these data in a high-performance computing infrastructure, such as cloud. Cloud platform has the ability to deploy Hadoop ecosystem on virtual clusters. In cloud configuration with different geographical regions, virtual machines (VMs) that are part of virtual cluster are placed randomly. Prior to processing, data have to be transferred to the regional sites with VMs for data locality purposes. In this paper, a provisioning strategy with data-location aware deployment for virtual cluster will be proposed, as to localize and provision the cluster near to the storage. The proposed mechanism reduces the network distance between virtual cluster and storage, resulting in reduced job completion times.

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: Cloud computing, Distributed databases, Computational modeling, Data models, Computer architecture, Data processing, Conferences
Subjects: Q Science > QA Mathematics > QA71-90 Instruments and machines > QA75.5-76.95 Electronic computers. Computer science > QA76.75-76.765 Computer software
Divisions: Faculty of Information Science and Technology (FIST)
Depositing User: Ms Suzilawati Abu Samah
Date Deposited: 06 Jul 2020 05:03
Last Modified: 06 Jul 2020 05:03


Downloads per month over past year

View ItemEdit (login required)