Smart City Gnosys

Smart city article details

Title Online Learning For Orchestration Of Inference In Multi-User End-Edge-Cloud Networks
ID_Doc 40058
Authors Shahhosseini S.; Seo D.; Kanduri A.; Hu T.; Lim S.-S.; Donyanavard B.; Rahmani A.M.; Dutt N.
Year 2022
Published ACM Transactions on Embedded Computing Systems, 21, 6
DOI http://dx.doi.org/10.1145/3520129
Abstract Deep-learning-based intelligent services have become prevalent in cyber-physical applications, including smart cities and health-care. Deploying deep-learning-based intelligence near the end-user enhances privacy protection, responsiveness, and reliability. Resource-constrained end-devices must be carefully managed to meet the latency and energy requirements of computationally intensive deep learning services. Collaborative end-edge-cloud computing for deep learning provides a range of performance and efficiency that can address application requirements through computation offloading. The decision to offload computation is a communication-computation co-optimization problem that varies with both system parameters (e.g., network condition) and workload characteristics (e.g., inputs). However, deep learning model optimization provides another source of tradeoff between latency and model accuracy. An end-to-end decision-making solution that considers such computation-communication problem is required to synergistically find the optimal offloading policy and model for deep learning services. To this end, we propose a reinforcement-learning-based computation offloading solution that learns optimal offloading policy considering deep learning model selection techniques to minimize response time while providing sufficient accuracy. We demonstrate the effectiveness of our solution for edge devices in an end-edge-cloud system and evaluate with a real-setup implementation using multiple AWS and ARM core configurations. Our solution provides 35% speedup in the average response time compared to the state-of-the-art with less than 0.9% accuracy reduction, demonstrating the promise of our online learning framework for orchestrating DL inference in end-edge-cloud systems. © 2022 Association for Computing Machinery.
Author Keywords computation offloading; Edge computing; neural network; online learning


Similar Articles


Id Similarity Authors Title Published
29768 View0.924Shahhosseini S.; Hu T.; Seo D.; Kanduri A.; Donyanavard B.; Rahmani A.M.; Dutt N.Hybrid Learning For Orchestrating Deep Learning Inference In Multi-User Edge-Cloud NetworksProceedings - International Symposium on Quality Electronic Design, ISQED, 2022-April (2022)
7415 View0.886Moghaddasi K.; Rajabi S.; Gharehchopogh F.S.; Ghaffari A.An Advanced Deep Reinforcement Learning Algorithm For Three-Layer D2D-Edge-Cloud Computing Architecture For Efficient Task Offloading In The Internet Of ThingsSustainable Computing: Informatics and Systems, 43 (2024)
34376 View0.879Wang H.; Chen X.; Xu H.; Liu J.; Huang L.Joint Job Offloading And Resource Allocation For Distributed Deep Learning In Edge ComputingProceedings - 21st IEEE International Conference on High Performance Computing and Communications, 17th IEEE International Conference on Smart City and 5th IEEE International Conference on Data Science and Systems, HPCC/SmartCity/DSS 2019 (2019)
43827 View0.877Hosseinzadeh M.; Wachal A.; Khamfroush H.; Lucani D.E.Qos-Aware Priority-Based Task Offloading For Deep Learning Services At The EdgeProceedings - IEEE Consumer Communications and Networking Conference, CCNC (2022)
21852 View0.868Zhang L.; Wu J.; Mumtaz S.; Li J.; Gacanin H.; Rodrigues J.J.P.C.Edge-To-Edge Cooperative Artificial Intelligence In Smart Cities With On-Demand Learning OffloadingProceedings - IEEE Global Communications Conference, GLOBECOM (2019)
21859 View0.864Xue F.; Fang W.; Xu W.; Wang Q.; Ma X.; Ding Y.Edgeld: Locally Distributed Deep Learning Inference On Edge Device ClustersProceedings - 2020 IEEE 22nd International Conference on High Performance Computing and Communications, IEEE 18th International Conference on Smart City and IEEE 6th International Conference on Data Science and Systems, HPCC-SmartCity-DSS 2020 (2020)
21063 View0.861He B.; Li H.; Chen T.Drl-Based Computing Offloading Approach For Large-Scale Heterogeneous Tasks In Mobile Edge ComputingConcurrency and Computation: Practice and Experience, 36, 19 (2024)
21825 View0.859Peng K.; Zhang H.; Zhao B.; Liu P.Edge-Cloud Collaborative Computation Offloading For Federated Learning In Smart CityProceedings of the 2022 IEEE International Conference on Dependable, Autonomic and Secure Computing, International Conference on Pervasive Intelligence and Computing, International Conference on Cloud and Big Data Computing, International Conference on Cyber Science and Technology Congress, DASC/PiCom/CBDCom/CyberSciTech 2022 (2022)
40621 View0.859Hassan M.T.; Hosain M.K.Optimization Of Computation Offloading In Mobile-Edge Computing Networks With Deep Reinforcement Approach2024 IEEE International Conference on Communication, Computing and Signal Processing, IICCCS 2024 (2024)
41334 View0.858Xiao D.; Wang X.; Yang Z.; Huang C.Partial Distributed Deep Learning Inference Model For Image Based Edge Device ClusterProceedings of 2024 8th International Conference on Electronic Information Technology and Computer Engineering, EITCE 2024 (2025)