Smart City Gnosys

Smart city article details

Title Adaptive Parallel Execution Of Deep Neural Networks On Heterogeneous Edge Devices
ID_Doc 6307
Authors Zhou L.; Samavatian M.H.; Bacha A.; Majumdar S.; Teodorescu R.
Year 2019
Published Proceedings of the 4th ACM/IEEE Symposium on Edge Computing, SEC 2019
DOI http://dx.doi.org/10.1145/3318216.3363312
Abstract New applications such as smart homes, smart cities, and autonomous vehicles are driving an increased interest in deploying machine learning on edge devices. Unfortunately, deploying deep neural networks (DNNs) on resource-constrained devices presents significant challenges. These workloads are computationally intensive and often require cloud-like resources. Prior solutions attempted to address these challenges by either introducing more design efforts or by relying on cloud resources for assistance. In this paper, we propose a runtime adaptive convolutional neural network (CNN) acceleration framework that is optimized for heterogeneous Internet of Things (IoT) environments. The framework leverages spatial partitioning techniques through fusion of the convolution layers and dynamically selects the optimal degree of parallelism according to the availability of computational resources, as well as network conditions. Our evaluation shows that our framework outperforms state-of-art approaches by improving the inference speed and reducing communication costs while running on wirelessly-connected Raspberry-Pi3 devices. Experimental evaluation shows up to 1.9× ∼ 3.7× speedup using 8 devices for three popular CNN models. © 2019 Association for Computing Machinery.
Author Keywords Deep learning; Edge devices; Inference; Parallel execution


Similar Articles


Id Similarity Authors Title Published
20553 View0.895Li Q.; Huang L.; Tong Z.; Du T.-T.; Zhang J.; Wang S.-C.Dissec: A Distributed Deep Neural Network Inference Scheduling Strategy For Edge ClustersNeurocomputing, 500 (2022)
14476 View0.887Lilhore U.K.; Simaiya S.; Sharma Y.K.; Rai A.K.; Padmaja S.M.; Nabilal K.V.; Kumar V.; Alroobaea R.; Alsufyani H.Cloud-Edge Hybrid Deep Learning Framework For Scalable Iot Resource OptimizationJournal of Cloud Computing, 14, 1 (2025)
32524 View0.885Vigenesh M.; Katyal A.; Hemalatha S.; Ahluwalia G.; Kukreja M.; Mathurkar P.Intelligent Resource Scheduling For Edge-Integrated Iot Using Deep Learning2024 IEEE 4th International Conference on ICT in Business Industry and Government, ICTBIG 2024 (2024)
21857 View0.885Chen Y.; Luo T.; Fang W.; Xiong N.N.Edgeci: Distributed Workload Assignment And Model Partitioning For Cnn Inference On Edge ClustersACM Transactions on Internet Technology, 24, 2 (2024)
38056 View0.884Tang Y.; Jones A.K.; Xiong J.; Zhou P.; Hu J.Mtrain: Enable Efficient Cnn Training On Heterogeneous Fpga-Based Edge ServersIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (2025)
32139 View0.88Elhanashi A.; Dini P.; Saponara S.; Zheng Q.Integration Of Deep Learning Into The Iot: A Survey Of Techniques And Challenges For Real-World ApplicationsElectronics (Switzerland), 12, 24 (2023)
11308 View0.877Gutierrez-Torre A.; Bahadori K.; Baig S.-U.-R.; Iqbal W.; Vardanega T.; Berral J.L.; Carrera D.Automatic Distributed Deep Learning Using Resource-Constrained Edge DevicesIEEE Internet of Things Journal, 9, 16 (2022)
47323 View0.873Balaji C.G.; Damle M.; Chirputkar A.Scalable Distributed Computing And Intelligent Signal Processing For Massive Iot Data StreamsInternational Journal of Engineering Trends and Technology, 72, 11 (2024)
29022 View0.872Feng J.; Zhu K.; Zhang T.Hierarchical Scheduling Of Hybrid Dnn Tasks In Embedded Real-Time SystemsProceedings of the International Conference on Parallel and Distributed Systems - ICPADS (2023)
7415 View0.87Moghaddasi K.; Rajabi S.; Gharehchopogh F.S.; Ghaffari A.An Advanced Deep Reinforcement Learning Algorithm For Three-Layer D2D-Edge-Cloud Computing Architecture For Efficient Task Offloading In The Internet Of ThingsSustainable Computing: Informatics and Systems, 43 (2024)