Multi-modal 3D object understanding has gained significant attention, yet current approaches often rely on rigid object-level modality alignment or assume complete data availability across all ...
Abstract: In this paper, we propose a novel task of 3D Human Motion Moment Retrieval (HMMR), which aims to find the motion segments from a large motion corpus that semantically correspond to a given ...