The future of intelligent transport systems

L3S Best Publication of the Quarter (Q3/2024)     
Category: Information Retrieval and Language Model 

LaMMOn: language model combined graph neural network for multi-target multi-camera tracking in online scenarios 

Authors: Tuan T Nguyen, Hoang H Nguyen, Mina Sartipi, Marco Fisichella 

Published in the Machine Learning Journal

The paper in a nutshell:

 We developed LaMMOn, an innovative AI system for tracking vehicles across multiple city cameras in real time. Unlike previous systems, LaMMOn uses advanced language and graph-based AI techniques to automatically adapt to different scenarios without manual setup. It outperforms existing methods on several public datasets and can easily scale to large urban areas. This technology promises to revolutionize traffic management and urban planning by providing more accurate and comprehensive vehicle tracking data. 

Which problem do you solve with your research?  

Our research addresses the challenge of efficiently tracking and managing urban traffic across multiple cameras. LaMMOn eliminates the need for manual setup for each new camera, making it far more practical for large-scale deployment. This technology enables cities to better manage traffic flow, improve emergency response times, and make data-driven decisions for urban planning and environmental initiatives. 

What is the potential impact of your findings?  

 LaMMOn could significantly enhance urban living by enabling smarter, more efficient cities. It provides real-time, comprehensive data on citywide traffic patterns, facilitating more effective traffic management, urban planning, and emergency response. This could result in reduced congestion, optimized public transportation, and lower vehicle emissions. The system’s adaptability also allows for cost-effective implementation of advanced traffic monitoring at scale, ultimately leading to more livable urban environments. 

What is new about your research?  

Our research introduces a novel end-to-end approach for multi-target multi-camera tracking. LaMMOn uniquely combines language models, graph neural networks, and federated learning techniques. It adapts to new camera setups without manual rule-setting, leverages synthetic data to overcome dataset limitations, and preserves privacy through federated learning. This innovative approach marks a significant advancement in smart city technology, offering a more scalable, efficient, and privacy-preserving solution for urban vehicle tracking. 

Paper link: link.springer.com/article/10.1007/s10994-024-06592-1