Publications by Tags

a) video and multi-view understanding

Prompt-augmented Boundary Attentive Learning for Weakly Supervised Temporal Sentence Grounding

Published in IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2025

Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition

Published in European Conference on Computer Vision (ECCV), 2024

EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World

Published in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024

Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction

Published in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023

Optical Flow in the Dark

Published in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

Optical Flow in the Dark

Published in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020

b) vision-language multimodal models

Prompt-augmented Boundary Attentive Learning for Weakly Supervised Temporal Sentence Grounding

Published in IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2025

c) human body perception

SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training

Published in International Conference on Learning Representations (ICLR), 2025

Single-to-Dual-View Adaptation for Egocentric 3D Hand Pose Estimation

Published in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024

GazeOnce: Real-Time Multi-Person Gaze Estimation

Published in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022