Smart City Gnosys

Smart city article details

Title Multiple Cnn-Based Tasks Scheduling Across Shared Gpu Platform In Research And Development Scenarios
ID_Doc 38608
Authors Chen Z.; Luo L.; Quan W.; Shi Y.; Yu J.; Wen M.; Zhang C.
Year 2019
Published Proceedings - 20th International Conference on High Performance Computing and Communications, 16th International Conference on Smart City and 4th International Conference on Data Science and Systems, HPCC/SmartCity/DSS 2018
DOI http://dx.doi.org/10.1109/HPCC/SmartCity/DSS.2018.00107
Abstract n the scope of numerous AI enterprises and research institutes, a shared server or cluster, which are based on commodity GPU hardwares, need to process multiple diverse CNN-based tasks simultaneously which are submitted by different developers and researchers. Scheduling and processing multiple CNN-based tasks, including training and batch inference, are a significant challenge in these practical scenarios. Previous studies, which focus on either the latency of a single training task or the throughput of multiple inference tasks, cannot effectively exploit the limited system resources available for diverse CNN-based tasks. This paper, for the first time, focuses on this specific AI Research and Development scenario and conducts an series of explorations on characteration and scheduling for CNN-based tasks. In order to evaluate the qualities of processing and scheduling, we propose a series of comprehensive metrics, including user satisfaction and system efficiency. With the metrics, we characterize diverse CNN behaviors of a few typical CNN models under different application and system configurable factors. Then, a heuristic scheduling algorithm informed by our characterization is explored to better allocate computing resources for the upcoming tasks and to schedule them dynamically on the cluster or server. Compared with two baseline strategies, the results, which are evaluated on multi-GPU platforms, show that our proposed algorithm can improve system efficiency by up to 40% and decrease average response latency by around 38% for multiple CNN-based tasks. © 2018 IEEE.
Author Keywords AI Research and Development Scenario; Characterizing; CNN; GPU platform; Scheduling Exploration


Similar Articles


Id Similarity Authors Title Published
34322 View0.863Li H.; Sun T.; Li X.; Xu H.Job Placement Strategy With Opportunistic Resource Sharing For Distributed Deep Learning ClustersProceedings - 2020 IEEE 22nd International Conference on High Performance Computing and Communications, IEEE 18th International Conference on Smart City and IEEE 6th International Conference on Data Science and Systems, HPCC-SmartCity-DSS 2020 (2020)
38056 View0.853Tang Y.; Jones A.K.; Xiong J.; Zhou P.; Hu J.Mtrain: Enable Efficient Cnn Training On Heterogeneous Fpga-Based Edge ServersIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (2025)