Smart City Gnosys

Smart city article details

Title A Machine Learning Based Decision Support Framework For Big Data Pipeline Modeling And Design
ID_Doc 2463
Authors Dhaouadi A.; Bousselmi K.; Monnet S.; Gammoudi M.M.; Hammoudi S.
Year 2024
Published Jordanian Journal of Computers and Information Technology, 10, 3
DOI http://dx.doi.org/10.5455/jjcit.71-1711356163
Abstract The data warehousing process requires an architectural revolution to settle big-data challenges and address new data sources, such as social networks, recommendation systems, smart cities and the web to extract value from shared data. In this respect, the pipeline-modeling community for the acquisition, storage and processing of data for analysis purposes is enacting a wide range of technological solutions that present significant challenges and difficulties. More specifically, the choice of the most appropriate tool for the user’s specific business needs and the interoperability between the different tools have become primary challenges. From this perspective, we propose in this paper a new interactive framework based on machine learning (ML) techniques to assist experts in the process of modeling a customized pipeline for data warehousing. More precisely, we elaborate first (i) an analysis of the experts’ requirements and the characteristics of the data to be processed, then (ii) we propose the most appropriate architecture to their requirements from a multitude of specific architectures instantiated from a generic one, by using (iii) several ML methods to predict the most suitable tool for each phase and task within the architecture. Additionally, our framework is validated through two real-world use cases and user feedback. © 2024, Scientific Research Support Fund of Jordan. All rights reserved.
Author Keywords Big data; Data-warehousing modeling; ML methods; Modeling assistance; Tools and technologies


Similar Articles


Id Similarity Authors Title Published
2630 View0.857Ribeiro J.L.; Figueredo M.; Araujo A., Jr.; Cacho N.; Lopes F.A Microservice Based Architecture Topology For Machine Learning Deployment5th IEEE International Smart Cities Conference, ISC2 2019 (2019)
6523 View0.856Cuzzocrea A.Advanced Machine Learning Structures Over Big Data Repositories: Definitions, Models, Properties, AlgorithmsFrontiers in Artificial Intelligence and Applications, 378 (2023)