Abstract
The accuracy of convolutional neural networks (CNNs) has significantly improved over the years. Meanwhile, due to the high portability and usefulness of edge devices, the demand for artificial intelligence (AI) based applications on edge computing devices has been soaring recently. Accordingly, CNN inference has become one of the mainstream AI applications on edge devices. However, the continually increasing leakage power of edge devices drags down the wide deployment of CNN inference applications, as the technology node scales down.In this work, we focus on reducing the power consumption in main memory, which consumes considerable power in CNN inference. Particularly, we observed that the idle state of memory is dominant in computationally intensive CNN inference. To achieve low-power CNN inference on edge devices, we first utilize next-generation nonvolatile memory (NVM) as the main memory device rather than dynamic random-access memory (DRAM) only for CNN inference tasks. To mitigate the increased latency caused by NVM, we propose a novel prefetcher that smartly leverages existing resources in commercial NVM system models; it is designed to predictably manage the locality-specific demands of CNN models while smartly leveraging existing resources in a modern NVM system. Furthermore, utilizing a prefetcher-based approach, we optimize the write allocation to enhance the data reuse and energy efficiency in CNN workloads. Based on simulation, our design improves the energy efficiency by 50% with a negligible impact on the performance compared with conventional DRAM-based platforms.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 29th International Symposium on Low Power Electronics and Design, ISLPED 2024 |
| Publisher | Association for Computing Machinery, Inc |
| ISBN (Electronic) | 9798400706882 |
| DOIs | |
| State | Published - 5 Aug 2024 |
| Event | 29th ACM/IEEE International Symposium on Low Power Electronics and Design, ISLPED 2024 - Newport Beach, United States Duration: 5 Aug 2024 → 7 Aug 2024 |
Publication series
| Name | Proceedings of the 29th International Symposium on Low Power Electronics and Design, ISLPED 2024 |
|---|
Conference
| Conference | 29th ACM/IEEE International Symposium on Low Power Electronics and Design, ISLPED 2024 |
|---|---|
| Country/Territory | United States |
| City | Newport Beach |
| Period | 5/08/24 → 7/08/24 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Keywords
- convolutional neural network (CNN)
- edge device
- low power
- non-volatile memory (NVM)
- prefetcher
Fingerprint
Dive into the research topics of 'EDeN: Enabling Low-Power CNN Inference on Edge Devices Using Prefetcher-assisted NVM Systems'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver