Vision-Based Hand Gesture Recognition Using Deep Learning Approach

Citation

Tan, Yong Soon (2019) Vision-Based Hand Gesture Recognition Using Deep Learning Approach. Masters thesis, Multimedia University.

Full text not available from this repository.

Abstract

Hand Gesture Recognition (HGR) serves as a fundamental way of communication and interaction for human being. HGR has the potential to transform Human Computer Interaction (HCI) and also entails many useful applications. For instance, HGR can be used to recognize sign language, which is a visual language and is used all over the world as a primary means of communication by the deaf and mute. Conventional HGR equipment like coloured glove to obtain accurate information about the gesture. It is intrusive, inconvenient and costly, which limits its applicability. Therefore, there is great need for vision-based HGR to simplify the communication and interaction process. However, vision-based HGR faces many challenges such as the variation in background, illumination, hand size, color of skin and also similarities among gestures. Vision-based HGR using traditional machine learning approaches typically involve multiple stages of complicated processing, such as hand-crafted feature extraction methods, which are usually designed to deal with certain challenges specifically. Hence, the effectiveness of the system and its ability to deal with different challenges across multiple datasets are heavily relied on the methods used. In contrast, deep learning approach such as CNN, adapts to different challenges via supervised learning. However, CNN network architecture is not fully explored and exploited for vision-based HGR. This study investigates the problem of vision-based HGR, with primary focus on deep learning approach. In this thesis, three deep neural network architectures are proposed for vision-based HGR, which are based on variants of Convolutional Neural Network, namely: (1) Wide ResNet, (2) Convolutional Neural Network with Spatial Pyramid Pooling (CNNSPP), and (3) MDenseNet. The proposed methods are evaluated on three public benchmark datasets.

Item Type: Thesis (Masters)
Additional Information: Call No.: QA76.87 .T36 2019
Uncontrolled Keywords: Neural networks (Computer science)
Subjects: Q Science > QA Mathematics > QA71-90 Instruments and machines > QA75.5-76.95 Electronic computers. Computer science
Divisions: Faculty of Information Science and Technology (FIST)
Depositing User: Ms Nurul Iqtiani Ahmad
Date Deposited: 21 Sep 2020 18:46
Last Modified: 21 Sep 2020 18:46
URII: http://shdl.mmu.edu.my/id/eprint/7745

Downloads

Downloads per month over past year

View ItemEdit (login required)