LLM-Driven Adaptive Autonomous Robot Navigation via Multimodal Fusion for Dynamic Environments

マルチモーダル融合とLLM駆動による動的環境適応型自律ロボットナビゲーション

LLM-Driven Adaptive Autonomous Robot Navigation via Multimodal Fusion for Dynamic Environments

Autonomous NavigationMultimodal FusionFPGA AccelerationLLM (Large Language Model)Socially Compliant Path PlanningDynamic Environment Adaptation

This research addresses the challenges of autonomous robot navigation in dynamic, high-density environments (e.g., train stations and shopping malls) by proposing a novel framework that integrates multimodal sensor fusion (LiDAR and vision) with a Large Language Model (LLM). To overcome the limitations of rule-based methods in handling unpredictable human behavior and dynamic obstacles, our system combines FPGA-accelerated real-time data processing and LLM-driven socially compliant path planning. Specifically, LiDAR point clouds and Triple-RGB camera data are fused on an FPGA using the Hungarian algorithm, while the LLM analyzes pedestrian attributes (age, wheelchair usage) to dynamically adjust navigation priorities. Experimental results demonstrate a 40% reduction in pedestrian prediction error compared to baseline models, with FPGA processing achieving sub-10ms latency. Future work includes enhancing inference accuracy via Q-LoRA and independent FPGA module verification.


A Digital Twin Approach for Crowd Flow Modeling on Railway Station Platforms

Effective crowd tracking at railway station platforms is essential for ensuring passenger safety and optimizing pedestri...

Digital Twin Crowd Simulation Railway Station

Efficient Machine Unlearning for Mobility Logs with Spatio-Temporal and Natural-Language Data

Modern mobility platforms store vast streams of GPS trajectories, temporal metadata, free-form textual notes, and other ...

Machine Unlearning Privacy Mobility
Adaptive Message Scheduling Assisted by Natural Language Processing Models

Adaptive Message Scheduling Assisted by Natural Language Processing Models

This study proposes an adaptive message delivery mechanism for a broker in the WMO Information System 2.0 (WIS2.0), a ne...

WIS2.0 Pub/Sub Reinforcement Learning
Indoor Object Recognition with WiFi RSSI-Integrated Visual-Language Models

Indoor Object Recognition with WiFi RSSI-Integrated Visual-Language Models

This study proposes a novel method for automatic object recognition and classification in indoor environments using visu...

Point Cloud Visual Language Model AIoT +3
Victim Detection Based on Shape Features from 3D Point Clouds

Victim Detection Based on Shape Features from 3D Point Clouds

Japan experiences frequent natural disasters, including large-scale earthquakes. As seen in the case of the Great Hanshi...

Drone LiDAR Victim Detection
Map-Aware Super-Resolution GPS Trajectory Reconstruction via Machine Learning

Map-Aware Super-Resolution GPS Trajectory Reconstruction via Machine Learning

This research aims to realize a novel system for reconstructing high-resolution GPS trajectory data from truncated or lo...

Mobility Data Reconstruction Spatial-Temporal Data Processing Transformer Models +2