Wide or Narrow? A Visual Attention Inspired Model for View-Type Classification


Loh, Yuen Peng and Tong, Song and Xuefeng, Liang and Kumada, Takatsune (2019) Wide or Narrow? A Visual Attention Inspired Model for View-Type Classification. IEEE Access, 7. pp. 48725-48738. ISSN 2169-3536

[img] Text
142.pdf - Published Version
Restricted to Repository staff only

Download (14MB)


Emerging research revealed that the view-type of photos is not only related to the field of data sciences, such as the sentiment brought forth by sightseeing spots, but also in the field of social sciences about human emotions and behaviors. These potential usages of view-types trigger a challenging problem, that is to automatically distinguish them into wide or narrow. In this paper, we present a computational model to classify them inspired by the human visual system. We found two cues that can represent the visual attention, i.e., focus cue and scale cue. The focus cue is modeled in the frequency domain using the non-sampled contourlet transform (NSCT) and speeded up robust features (SURF). The scale cue is modeled by defining the spatial size and conceptual sizes of an object in the image, whereby AdobeBING and convolutional neural network are used for the respective measurements. By integrating these focus and scale models, a robust scheme is hence proposed for this non-trivial task. The experiments on a newly established dataset, which has 5050 natural images, show better performance by our proposal when compared to the state-of-the-arts.

Item Type: Article
Uncontrolled Keywords: View-type classification,visual attention
Subjects: Q Science > QA Mathematics > QA71-90 Instruments and machines > QA75.5-76.95 Electronic computers. Computer science
Divisions: Faculty of Computing and Informatics (FCI)
Depositing User: Ms Suzilawati Abu Samah
Date Deposited: 09 Mar 2022 01:26
Last Modified: 09 Mar 2022 01:26
URII: http://shdl.mmu.edu.my/id/eprint/9254


Downloads per month over past year

View ItemEdit (login required)