Utilization of Vision Transformer for Classification and Ranking of Video Distortions

Citation

AlDahoul, Nouar and Abdul Karim, Hezerul and Tan, Myles Joshua Toledo (2023) Utilization of Vision Transformer for Classification and Ranking of Video Distortions. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 13739. pp. 195-204. ISSN 0302-9743

Full text not available from this repository.

Abstract

The perceptual quality of video surveillance footage has impacts on several tasks involved in the surveillance process, such as the detection of anomalous objects. The videos captured by a camera are prone to various distortions such as noise, smoke, haze, low or uneven illumination, blur, rain, and compression, which affect visual quality. Automatic identification of distortions is important when enhancing video quality. Video quality assessment involves two stages: (1) classification of distortions affecting the video frames and (2) ranking of these distortions. A novel video dataset was utilized for training, validating, and testing. Working with this dataset was challenging because it included nine categories of distortions and four levels of severity. The greatest challenge was the availability of multiple types of distortions in the same video. The work presented in this paper addresses the problem of multi-label distortion classification and ranking. A vision transformer was used for feature learning. The experiment showed that the proposed solution performed well in terms of F1 score of single distortion (77.9%) and F1 score of single and multiple distortions (69.9%). Moreover, the average accuracy of level classification was 62% with an average F1 score of 61%.

Item Type: Article
Uncontrolled Keywords: Distortion classification and ranking, Multi-label classification, Video quality assessment, Vision transformer
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK5101-6720 Telecommunication. Including telegraphy, telephone, radio, radar, television
Divisions: Faculty of Engineering (FOE)
Depositing User: Ms Nurul Iqtiani Ahmad
Date Deposited: 07 Feb 2023 04:24
Last Modified: 27 Feb 2023 05:34
URII: http://shdl.mmu.edu.my/id/eprint/11125

Downloads

Downloads per month over past year

View ItemEdit (login required)