Talks and Poster Presentations (with Proceedings-Entry):
F. McNamee, S. Dustdar, P. Kilpatrick, W. Shi, I. Spence, B. Varghese:
"The Case for Adaptive Deep Neural Networks in Edge Computing";
Talk: IEEE 14th International Conference on Cloud Computing (CLOUD 2021) - Online Conference,
Chicago, IL, USA;
- 2021-09-11; in: "Proceedings of the IEEE 14th International Conference on Cloud Computing (CLOUD 2021)",
C. Ardagna, C. Chang, E. Damiani, R. Ranjan, Z. Wang, R. Ward, J. Zhang, W. Zhang (ed.);
Deep Neural Networks (DNNs) are an application class that benefit from being distributed across the edge and cloud. A DNN is partitioned such that specific layers of the DNN are deployed onto the edge and the cloud to meet performance and privacy objectives. However, there is limited understanding of: whether and how evolving operational conditions (increased CPU and memory utilization at the edge or reduced data transfer rates between the edge and cloud) affect the performance of already deployed DNNs, and whether a new partition configuration is required to maximize performance. A DNN that adapts to changing operational conditions is referred to as an `adaptive DNNī. This paper investigates whether there is a case for adaptive DNNs by considering four questions: (i) Are DNNs sensitive to operational conditions? (ii) How sensitive are DNNs to operational conditions? (iii) Do individual or a combination of operational conditions equally affect DNNs? (iv) Is DNN partitioning sensitive to hardware architectures? The exploration is carried out in the context of 8 pre-trained DNN models and the results presented are from analyzing nearly 8 million data points. The results highlight that network conditions affect DNN performance more than CPU or memory related operational conditions. Repartitioning is noted to provide a performance gain in a number of cases, but a specific trend is not noted in relation to the underlying hardware architecture. Nonetheless, the need for adaptive DNNs is confirmed.
"Official" electronic version of the publication (accessed through its Digital Object Identifier - DOI)
Created from the Publication Database of the Vienna University of Technology.