Smart City Gnosys

Smart city article details

Title Edgeld: Locally Distributed Deep Learning Inference On Edge Device Clusters
ID_Doc 21859
Authors Xue F.; Fang W.; Xu W.; Wang Q.; Ma X.; Ding Y.
Year 2020
Published Proceedings - 2020 IEEE 22nd International Conference on High Performance Computing and Communications, IEEE 18th International Conference on Smart City and IEEE 6th International Conference on Data Science and Systems, HPCC-SmartCity-DSS 2020
DOI http://dx.doi.org/10.1109/HPCC-SmartCity-DSS50907.2020.00078
Abstract Deep Neural Networks (DNN) have been widely used in a large number of application scenarios. However, DNN models are generally both computation-intensive and memory-intensive, thus difficult to be deployed on resource-constrained edge devices. Most previous studies focus on local model compression or remote cloud offloading, but overlook the potential benefits brought by distributed DNN execution on multiple edge devices. In this paper, we propose EdgeLD, a new framework for locally distributed execution of DNN-based inference tasks on a cluster of edge devices. In EdgeLD, DNN models' time cost will be firstly profiled in terms of computing capability and network bandwidth. Guided by profiling, an efficient model partition scheme is designed in EdgeLD to balance the assigned workload and the inference runtime among different edge devices. We also propose to employ layer fusion to reduce communication overheads on exchanging intermediate data among devices. Experiment results show that our proposed partition scheme saves up to 15.82% of inference time with regard to the conventional solution. Besides, applying layer fusion can speedup the DNN inference by 1.11-1.13X. When combined, EdgeLD can accelerate the original inference time by 1.77-3.57X on a cluster of 2-4 edge devices. © 2020 IEEE.
Author Keywords deep learning; deep neural network; distributed inference; edge computing


Similar Articles


Id Similarity Authors Title Published
20553 View0.918Li Q.; Huang L.; Tong Z.; Du T.-T.; Zhang J.; Wang S.-C.Dissec: A Distributed Deep Neural Network Inference Scheduling Strategy For Edge ClustersNeurocomputing, 500 (2022)
41334 View0.914Xiao D.; Wang X.; Yang Z.; Huang C.Partial Distributed Deep Learning Inference Model For Image Based Edge Device ClusterProceedings of 2024 8th International Conference on Electronic Information Technology and Computer Engineering, EITCE 2024 (2025)
13852 View0.881Prashanthi S.K.; Kesanapalli S.A.; Simmhan Y.Characterizing The Performance Of Accelerated Jetson Edge Devices For Training Deep Learning ModelsProceedings of the ACM on Measurement and Analysis of Computing Systems, 6, 3 (2022)
21857 View0.881Chen Y.; Luo T.; Fang W.; Xiong N.N.Edgeci: Distributed Workload Assignment And Model Partitioning For Cnn Inference On Edge ClustersACM Transactions on Internet Technology, 24, 2 (2024)
34376 View0.878Wang H.; Chen X.; Xu H.; Liu J.; Huang L.Joint Job Offloading And Resource Allocation For Distributed Deep Learning In Edge ComputingProceedings - 21st IEEE International Conference on High Performance Computing and Communications, 17th IEEE International Conference on Smart City and 5th IEEE International Conference on Data Science and Systems, HPCC/SmartCity/DSS 2019 (2019)
29005 View0.875Ilhan F.; Tekin S.F.; Hu S.; Huang T.; Chow K.-H.; Liu L.Hierarchical Deep Neural Network Inference For Device-Edge-Cloud SystemsACM Web Conference 2023 - Companion of the World Wide Web Conference, WWW 2023 (2023)
40058 View0.864Shahhosseini S.; Seo D.; Kanduri A.; Hu T.; Lim S.-S.; Donyanavard B.; Rahmani A.M.; Dutt N.Online Learning For Orchestration Of Inference In Multi-User End-Edge-Cloud NetworksACM Transactions on Embedded Computing Systems, 21, 6 (2022)
11308 View0.864Gutierrez-Torre A.; Bahadori K.; Baig S.-U.-R.; Iqbal W.; Vardanega T.; Berral J.L.; Carrera D.Automatic Distributed Deep Learning Using Resource-Constrained Edge DevicesIEEE Internet of Things Journal, 9, 16 (2022)
14625 View0.863Jiang Z.; Ling N.; Huang X.; Shi S.; Wu C.; Zhao X.; Yan Z.; Xing G.Coedge: A Cooperative Edge System For Distributed Real-Time Deep Learning TasksIPSN 2023 - Proceedings of the 2023 22nd International Conference on Information Processing in Sensor Networks (2023)
13513 View0.863Hu C.; Bai Y.; Wang R.; Liu C.; Wang X.Ccied: Cache-Aided Collaborative Intelligence Between Edge DevicesProceedings - 2020 IEEE 22nd International Conference on High Performance Computing and Communications, IEEE 18th International Conference on Smart City and IEEE 6th International Conference on Data Science and Systems, HPCC-SmartCity-DSS 2020 (2020)