38752  | 0.914 | Alzubi T.M.; Mukhtar U.R. | Mvr: Synergizing Large And Vision Transformer For Multimodal Natural Language-Driven Vehicle Retrieval | IEEE Access, 13 (2025) |
39699  | 0.885 | Du Y.; Zhang B.; Ruan X.; Su F.; Zhao Z.; Chen H. | Omg: Observe Multiple Granularities For Natural Language-Based Vehicle Retrieval | IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2022-June (2022) |
7243  | 0.878 | Scribano C.; Sapienza D.; Franchini G.; Verucchi M.; Bertogna M. | All You Can Embed: Natural Language Based Vehicle Retrieval With Spatio-Temporal Transformers | IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (2021) |
47389  | 0.877 | Sadiq T.; Omlin C.W. | Scene Retrieval In Traffic Videos With Contrastive Multimodal Learning | Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI (2023) |
12878  | 0.875 | Bo X.; Liu J.; Yang D.; Ma W. | Bridging The Gap: Multi-Granularity Representation Learning For Text-Based Vehicle Retrieval | Complex and Intelligent Systems, 11, 1 (2025) |