Smart City Gnosys

Smart city article details

Title Dissec: A Distributed Deep Neural Network Inference Scheduling Strategy For Edge Clusters
ID_Doc 20553
Authors Li Q.; Huang L.; Tong Z.; Du T.-T.; Zhang J.; Wang S.-C.
Year 2022
Published Neurocomputing, 500
DOI http://dx.doi.org/10.1016/j.neucom.2022.05.084
Abstract New applications such as intelligent manufacturing, autonomous vehicles and smart cities drive large-scale deep learning models deployed in the Internet of Things (IoT) edge environments. However, deep learning models require substantial computations, storage and communication resources to run. It is generally difficult to deploy and execute a complete deep neural network (DNN) on a resource-constrained edge device. One possible solution is to slice the DNN into multiple tiles distributed to different edge devices, which can reduce the number of computations and quantity of data on each edge device. In this paper, we propose DISSEC, a distributed scheduling strategy for DNN inference on IoT edge clusters. DISSEC leverages spatial partitioning techniques through fusing the convolutional layers and dividing them into multiple partitions that can be executed independently, and proposes a method to express the dependencies between partitions. It further proposes a search algorithm based on heuristics to produce a distributed parallel strategy with the best overall inference execution latency. The evaluation shows that our strategy can fully utilize the edge device resources by cooperating with multiple edge devices to perform partitioning tasks in parallel. Furthermore, compared to the existing work scheduling strategy, our strategy reduces communication overhead by 20% and overall execution latency by 9% under different partitioning granularities and numbers of edge devices. © 2022 Elsevier B.V.
Author Keywords Deep neural network; Distributed inference; Edge computing; Internet of Things


Similar Articles


Id Similarity Authors Title Published
21859 View0.918Xue F.; Fang W.; Xu W.; Wang Q.; Ma X.; Ding Y.Edgeld: Locally Distributed Deep Learning Inference On Edge Device ClustersProceedings - 2020 IEEE 22nd International Conference on High Performance Computing and Communications, IEEE 18th International Conference on Smart City and IEEE 6th International Conference on Data Science and Systems, HPCC-SmartCity-DSS 2020 (2020)
41334 View0.912Xiao D.; Wang X.; Yang Z.; Huang C.Partial Distributed Deep Learning Inference Model For Image Based Edge Device ClusterProceedings of 2024 8th International Conference on Electronic Information Technology and Computer Engineering, EITCE 2024 (2025)
21857 View0.904Chen Y.; Luo T.; Fang W.; Xiong N.N.Edgeci: Distributed Workload Assignment And Model Partitioning For Cnn Inference On Edge ClustersACM Transactions on Internet Technology, 24, 2 (2024)
6307 View0.895Zhou L.; Samavatian M.H.; Bacha A.; Majumdar S.; Teodorescu R.Adaptive Parallel Execution Of Deep Neural Networks On Heterogeneous Edge DevicesProceedings of the 4th ACM/IEEE Symposium on Edge Computing, SEC 2019 (2019)
32524 View0.876Vigenesh M.; Katyal A.; Hemalatha S.; Ahluwalia G.; Kukreja M.; Mathurkar P.Intelligent Resource Scheduling For Edge-Integrated Iot Using Deep Learning2024 IEEE 4th International Conference on ICT in Business Industry and Government, ICTBIG 2024 (2024)
34376 View0.875Wang H.; Chen X.; Xu H.; Liu J.; Huang L.Joint Job Offloading And Resource Allocation For Distributed Deep Learning In Edge ComputingProceedings - 21st IEEE International Conference on High Performance Computing and Communications, 17th IEEE International Conference on Smart City and 5th IEEE International Conference on Data Science and Systems, HPCC/SmartCity/DSS 2019 (2019)
1511 View0.864Gali M.; Mahamkali A.A Distributed Deep Meta Learning Based Task Offloading Framework For Smart City Internet Of Things With Edge-Cloud ComputingJournal of Internet Services and Information Security, 12, 4 (2022)
14625 View0.863Jiang Z.; Ling N.; Huang X.; Shi S.; Wu C.; Zhao X.; Yan Z.; Xing G.Coedge: A Cooperative Edge System For Distributed Real-Time Deep Learning TasksIPSN 2023 - Proceedings of the 2023 22nd International Conference on Information Processing in Sensor Networks (2023)
9491 View0.863Ashouri M.; Lorig F.; Davidsson P.; Spalazzese R.; Svorobej S.Analyzing Distributed Deep Neural Network Deployment On Edge And Cloud Nodes In Iot SystemsProceedings - 2020 IEEE 13th International Conference on Edge Computing, EDGE 2020 (2020)
29005 View0.859Ilhan F.; Tekin S.F.; Hu S.; Huang T.; Chow K.-H.; Liu L.Hierarchical Deep Neural Network Inference For Device-Edge-Cloud SystemsACM Web Conference 2023 - Companion of the World Wide Web Conference, WWW 2023 (2023)