Video-based Human Tracking Robust to Dynamic Camera Position and Orientation Changes

スマートグラスの可視領域情報を用いた不連続動画上の人物追跡

Video-based Human Tracking Robust to Dynamic Camera Position and Orientation Changes

Keywords

ARPerson Re-IDDeep Metric Learning

In recent years, RGB cameras and depth cameras integrated into Augmented Reality (AR) glasses and smart glasses have been utilized for recognizing people and objects in real-world environments. Various applications have been proposed to overlay cyber information onto real-world visuals or directly onto smart glasses. For instance, in the field of sports, systems have been developed to detect and identify players during games or training sessions and provide relevant player information to spectators and coaches wearing smart glasses. To realize such smart glasses applications, tracking people in captured video footage is essential.
Numerous object tracking methods have been proposed. For example, DeepSORT detects target objects in images using the object detection method YOLO and tracks them across consecutive frames in a 2D image sequence using a Kalman filter, which assumes stable motion. However, most existing person tracking methods, including DeepSORT, are designed for fixed-camera video footage. In situations where the camera position and orientation change, extended occlusions lasting several frames, unexpected movement within or outside the image frame, and other disruptions frequently occur. As a result, tracking accuracy significantly deteriorates when relying on a Kalman filter, which assumes smooth movement.
This study proposes a method to determine whether a newly detected person in a video corresponds to a previously tracked individual, thereby extending conventional multiple object tracking (MOT) methods designed for fixed-camera footage. By appropriately applying person re-identification (Re-ID) when a previously detected individual reappears, we enable continuous tracking of people in variable viewpoint videos captured by smart glasses. Furthermore, our approach estimates the last known location and disappearance time of a detected person to predict their likely position over time. By utilizing the gaze direction and position of the smart glasses, we estimate whether a person is within the current field of view, thereby suppressing unnecessary re-identification of individuals who are unlikely to be present. This enhances the accuracy of person re-identification.



発表論文

  • 高橋直也, 天野辰哉, 山口弘純, & 東野輝夫. (2021). 深層距離学習を用いた AR デバイス向けの人物識別手法. マルチメディア, 分散協調とモバイルシンポジウム 2021 論文集2021(1), 388-394.
  • 高橋直也, 天野辰哉, & 山口弘純. (2021). スマートグラスの可視領域情報を用いた不連続動画上の人物追跡. 研究報告モバイルコンピューティングと新社会システム (MBL)2021(29), 1-8.
  • Takahashi, N., Amano, T., & Yamaguchi, H. (2023, June). Multi-Person Tracking Method Robust to Dynamic Viewport Changes for AR apps. In 2023 19th International Conference on Intelligent Environments (IE) (pp. 1-4). IEEE.

Environment-Aware Distributed Scheduling for Emergency LoRa Networks

Yuto Inaba, Tatsuya Amano, Akihito Hiromori, Hirozumi Yamaguchi

2026 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops), SPT-IoT 2026, pp. 1366–1371

Disaster CommunicationLoRa +4

A Lightweight Vision-Language Model for Disaster Image Summarization

Hibiki Yoshizaki, Akira Uchiyama, Akihito Hiromori, Mineo Takai, Hirozumi Yamaguchi

2026 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops), PerconAI 2026, pp. 1203–1208

Semantic CommunicationDisaster Response +4

Physics-Integrated Deep Learning for Urban Landslide Prediction

Ren Ozeki, Hamada Rizk, Hirozumi Yamaguchi

2026 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops), URBSENSE 2026, pp. 1094–1099

Landslide PredictionPhysics-Integrated Learning +3

Ray-Tracing-Driven Pattern-Based Vehicle Recognition in ISAC Radar

Heetae Jin, Akira Uchiyama

2026 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops), PerRad 2026, pp. 328–333

ISACBeyond 5G +4

A Simulation Framework for Precision Formation Flying of Massive Satellite Swarms

Tatsuya Amano, Akihito Hiromori, Hirozumi Yamaguchi, Sumio Morioka

2026 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops), PerVehicle , pp. 230–235

Satellite Formation FlyingDistributed Simulation +4

A Digital Twin Approach for Crowd Flow Modeling on Railway Station Platforms

Yu Yasuda, Tatsuya Amano and Hirozumi Yamaguchi

IEEE International Conference on Smart Computing (SMARTCOMP), pp. 82-89

DOI 10.1109/SMARTCOMP65954.2025.00069

Digital TwinCrowd Simulation +1