
Mingfang Zhang
PhD student at UTokyo
- Tokyo, Japan
- The University of Tokyo
- Google Scholar
- Github
- Unsplash
Publications by Tags
Tags
a) video and multi-view understanding
Prompt-augmented Boundary Attentive Learning for Weakly Supervised Temporal Sentence Grounding
Published in IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2025
An Egocentric Vision-Language Model based Portable Real-time Smart Assistant
Published in Arxiv preprint, 2025
Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition
Published in European Conference on Computer Vision (ECCV), 2024
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World
Published in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction
Published in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
Optical Flow in the Dark
Published in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Optical Flow in the Dark
Published in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020
b) vision-language multimodal models
Prompt-augmented Boundary Attentive Learning for Weakly Supervised Temporal Sentence Grounding
Published in IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2025
Egocentric Inertial Localization with Vision-Language Informed Action Cues
Published in Arxiv preprint, 2025
An Egocentric Vision-Language Model based Portable Real-time Smart Assistant
Published in Arxiv preprint, 2025
c) human body perception
SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training
Published in International Conference on Learning Representations (ICLR), 2025
Single-to-Dual-View Adaptation for Egocentric 3D Hand Pose Estimation
Published in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
GazeOnce: Real-Time Multi-Person Gaze Estimation
Published in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022