I am a PhD student at Nanyang Technological University (NTU), Singapore, supervised by Prof. Ziwei Liu and Dr. Hongyuan Zhu.
Previously, I obtained my Bachelor's Degree from NTU, and I spent a wonderful time working with Prof. Ranjay Krishna at University of Washington on vision-language model reasoning, and Prof. Bihan Wen at NTU on low-light image enhancement.
News
[05/2026] I am recognized as an outstanding reviewer for CVPR 2026. News link: here.
[05/2026] Ego-R1 is accepted to TPAMI. Congrats to all coauthors!
[04/2026] We release SimpleStream, a simple baseline for streaming video understanding.
[04/2026] We release HippoCamp and FileGram, two papers related to the file-system agentic memory.
[03/2026] We release Insight-V++, towards advanced long-chain visual reasoning with multimodal large language models.
[02/2026] We release Demo-ICL, in-context learning for procedural video knowledge acquisition.
[06/2025] Evaluation Agent was selected for an oral presentation and SAC Highlight Award (43/8350) at ACL 2025. Congrats to all coauthors!
[06/2025] We release the Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning. Code and data can be found here.