TY - JOUR
T1 - SPU-BERT
T2 - Faster human multi-trajectory prediction from socio-physical understanding of BERT
AU - Na, Ki In
AU - Kim, Ue Hwan
AU - Kim, Jong Hwan
N1 - Publisher Copyright:
© 2023
PY - 2023/8/15
Y1 - 2023/8/15
N2 - Accurately predicting pedestrian trajectories requires a human-like socio-physical understanding of movement, nearby pedestrians, and obstacles. However, traditional methods struggle to generate multiple trajectories in the same situation based on socio-physical understanding and are computationally intensive, making real-time application difficult. To overcome these limitations, we propose SPU-BERT, a fast multi-trajectory prediction model that incorporates two non-recursive BERTs for multi-goal prediction (MGP) and trajectory-to-goal prediction (TGP). First, MGP predicts multiple goals through generative models, followed by TGP generating trajectories that approach the predicted goals. SPU-BERT can simultaneously understand movement, social interaction, and scene context from trajectories and semantic maps using a single Transformer encoder, providing explainable results as evidence of socio-physical understanding. In experiments, SPU-BERT accurately predicted future trajectories (with 0.19 m and 7.54 pixels of ADE20 for the ETH/UCY datasets and SDD) with over 100 times faster computation (0.132 s) than the state-of-the-art method. The code is available at https://github.com/kina4147/SPUBERT.
AB - Accurately predicting pedestrian trajectories requires a human-like socio-physical understanding of movement, nearby pedestrians, and obstacles. However, traditional methods struggle to generate multiple trajectories in the same situation based on socio-physical understanding and are computationally intensive, making real-time application difficult. To overcome these limitations, we propose SPU-BERT, a fast multi-trajectory prediction model that incorporates two non-recursive BERTs for multi-goal prediction (MGP) and trajectory-to-goal prediction (TGP). First, MGP predicts multiple goals through generative models, followed by TGP generating trajectories that approach the predicted goals. SPU-BERT can simultaneously understand movement, social interaction, and scene context from trajectories and semantic maps using a single Transformer encoder, providing explainable results as evidence of socio-physical understanding. In experiments, SPU-BERT accurately predicted future trajectories (with 0.19 m and 7.54 pixels of ADE20 for the ETH/UCY datasets and SDD) with over 100 times faster computation (0.132 s) than the state-of-the-art method. The code is available at https://github.com/kina4147/SPUBERT.
KW - BERT
KW - Multi-trajectory prediction
KW - Pedestrian trajectory prediction
KW - Socio-physical understanding
KW - Transformer
UR - http://www.scopus.com/inward/record.url?scp=85162746024&partnerID=8YFLogxK
U2 - 10.1016/j.knosys.2023.110637
DO - 10.1016/j.knosys.2023.110637
M3 - Article
AN - SCOPUS:85162746024
SN - 0950-7051
VL - 274
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
M1 - 110637
ER -