Smart City Gnosys

Smart city article details

Title Aggregated Spatio-Temporal Mlp-Mixer For Violence Recognition In Video Clips
ID_Doc 6855
Authors Shen Y.-S.; Chen J.
Year 2023
Published Proceedings - 2023 6th International Symposium on Computer, Consumer and Control, IS3C 2023
DOI http://dx.doi.org/10.1109/IS3C57901.2023.00020
Abstract Existing violent behavior datasets are not perfect in quantity and quality due to the difficulty of collecting. Although the state-of-the-art Transformer models had shown their capability in behavior recognition, it is unsuitable for the task of short-term behavior understanding (e.g., violent behavior recognition) due to the need for a large amount of data to achieve their best performance. Recently, a simple deep learning architecture, an all multilayer perceptron (MLP) architecture called MLP-Mixer, was proposed against Transformer in the task of a few-sample dataset to obtain competitive results. Motivated by spatio-temporal features on neurons, we invent a dual-form dataset for MLP-Mixer-based model training called aggregated spatio-temporal MLP-Mixer (ASM) to handle video understanding tasks. We show that ASM outperforms the state-of-the-art Transformer models as well as some of the best-performed convolutional neural network (CNN) approaches on three public datasets, smart-city CCTV violence detection dataset (SCVD), real-life violence situations (RLVS) dataset, and Hockey fight. Experimental results further validate our idea on short-term behavior scene understanding improvement. © 2023 IEEE.
Author Keywords


Similar Articles


Id Similarity Authors Title Published
61137 View0.919Tu Y.-S.; Shen Y.-S.; Chan Y.Y.; Wang L.; Chen J.Violent Video Recognition By Using Sequential Image CollageSensors, 24, 6 (2024)
22444 View0.871Ren X.; Fan W.; Wang Y.Efficiently Adapting Large Pre-Trained Models For Real-Time Violence Recognition In Smart City SurveillanceJournal of Real-Time Image Processing, 21, 4 (2024)
3466 View0.871Elzein A.; Basaran E.; Yang Y.D.; Qaraqe M.A Novel Multi-Scale Violence And Public Gathering Dataset For Crowd Behavior ClassificationFrontiers in Computer Science, 6 (2024)
61134 View0.858Khan H.; Yuan X.; Qingge L.; Roy K.Violence Detection From Industrial Surveillance Videos Using Deep LearningIEEE Access, 13 (2025)
57685 View0.852Huszar V.D.; Adhikarla V.K.; Negyesi I.; Krasznay C.Toward Fast And Accurate Violence Detection For Automated Video Surveillance ApplicationsIEEE Access, 11 (2023)
60873 View0.852Khan M.; Saddik A.E.; Gueaieb W.; De Masi G.; Karray F.Vd-Net: An Edge Vision-Based Surveillance System For Violence DetectionIEEE Access, 12 (2024)
17204 View0.851Abdali A.R.Data Efficient Video Transformer For Violence Detection10th IEEE International Conference on Communication, Networks and Satellite, Comnetsat 2021 - Proceedings (2021)