Smart City Gnosys

Smart city article details

Title Pre-Processing And Data Validation In Iot Data Streams
ID_Doc 42642
Authors Baban P.
Year 2020
Published DEBS 2020 - Proceedings of the 14th ACM International Conference on Distributed and Event-Based Systems
DOI http://dx.doi.org/10.1145/3401025.3406443
Abstract In the last few years, distributed stream processing engines have been on the rise due to their crucial impacts on real-time data processing with guaranteed low latency in several application domains such as financial markets, surveillance systems, manufacturing, smart cities, etc. Stream processing engines are run-time libraries to process data streams without knowing the lower level streaming mechanics. Apache Storm, Apache Flink, Apache Spark, Kafka Streams and Hazelcast Jet are some of the popular stream processing engines. Nowadays, critical systems like energy systems, are interconnected and automated. As a result, these systems are vulnerable to cyber-attacks. In real-world applications, the sensing values come from sensor devices contains missing values, redundant data, data outliers, manipulated data, data failures, etc. Therefore, our system must be resilient to these conditions. In this paper, we present an approach to check if there is any above mentioned conditions by pre-processing data streams using a stream processing engine like Apache Flink which will be updated as a library in future. Then, the pre-processed streams are forwarded to other stream processing engines like Apache Kafka for real stream processing. As a result, data validation, data consistency and integrity for a resilient system can be accomplished before initiating the actual stream processing. © 2020 ACM.
Author Keywords data pre-processing; data validation; resiliency; stream processing


Similar Articles


Id Similarity Authors Title Published
12160 View0.9Shahverdi E.; Awad A.; Sakr S.Big Stream Processing Systems: An Experimental EvaluationProceedings - 2019 IEEE 35th International Conference on Data Engineering Workshops, ICDEW 2019 (2019)
24778 View0.862Nasiri, H; Nasehi, S; Goudarzi, MEvaluation Of Distributed Stream Processing Frameworks For Iot Applications In Smart CitiesJOURNAL OF BIG DATA, 6, 1 (2019)