Citation
Liu, Hou-I and Galindo, Marco and Xie, Hongxia and Wong, Lai Kuan and Shuai, Hong Han and Li, Yung Hui and Cheng, Wen Huang (2024) Lightweight Deep Learning for Resource-Constrained Environments: A Survey. ACM Computing Surveys, 56 (10). pp. 1-42. ISSN 0360-0300
Text
Lightweight Deep Learning for Resource-Constrained Environments_ A Survey.pdf - Published Version Restricted to Repository staff only Download (2MB) |
Abstract
Over the past decade, the dominance of deep learning has prevailed across various domains of artificial intelligence, including natural language processing, computer vision, and biomedical signal processing. While there have been remarkable improvements in model accuracy, deploying these models on lightweight devices, such as mobile phones and microcontrollers, is constrained by limited resources. In this survey, we provide comprehensive design guidance tailored for these devices, detailing the meticulous design of lightweight models, compression methods, and hardware acceleration strategies. The principal goal of this work is to explore methods and concepts for getting around hardware constraints without compromising the model’s accuracy. Additionally, we explore two notable paths for lightweight deep learning in the future: deployment techniques for TinyML and Large Language Models. Although these paths undoubtedly have potential, they also present significant challenges, encouraging research into unexplored areas.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Quantization, tinyML, large language models |
Subjects: | Q Science > QA Mathematics > QA71-90 Instruments and machines > QA75.5-76.95 Electronic computers. Computer science |
Divisions: | Faculty of Computing and Informatics (FCI) |
Depositing User: | Ms Nurul Iqtiani Ahmad |
Date Deposited: | 01 Aug 2024 06:23 |
Last Modified: | 01 Aug 2024 06:23 |
URII: | http://shdl.mmu.edu.my/id/eprint/12715 |
Downloads
Downloads per month over past year
Edit (login required) |