I am currently a first-year Ph.D. student at The Hong Kong University of Science and Technology (Guangzhou), where I am fortunate to be advised by Prof. Chengwei Qin. Before joining HKUST(GZ), I obtained my B.S. degree in Automation from Xiamen University. I also work closely with Dr. Lidong Bing and Dr. Xingxuan Li at MiroMind.ai. My research interests lie at the intersection of computer vision and natural language processing. Specifically, I focus on Multimodal Large Language Models (MLLMs), with a particular emphasis on interpretability, complex reasoning, and agentic tool use. My goal is to build more transparent and capable AI systems that can effectively reason through real-world tasks.
News
- 2025.06 - One paper was accepted by ICCV 2025.
- 2025.03 - One paper was accepted by CVPR 2025.
Publications
LongVT: Incentivizing “Thinking with Long Videos” via Native Tool Calling [Paper] [Code]
Zuhao Yang, Sudong Wang, Kaichen Zhang, Keming Wu, Sicong Leng, Yifan Zhang, Chengwei Qin, Shijian Lu, Xingxuan Li, Lidong BingSHIFT: Smoothing Hallucinations by Information Flow Tuning for Multimodal Large Language Models [Paper]
Sudong Wang, Yunjian Zhang, Yao Zhu, Enci Liu, Jianing Li, Yanwei Liu, Xiangyang Ji
ICCV 2025Towards Understanding How Knowledge Evolves in Large Vision-Language Models [Paper][Code]
Sudong Wang, Yunjian Zhang, Yao Zhu, Jianing Li, Zizhe Wang, Yanwei Liu, Xiangyang Ji
CVPR 2025
Educational Background
- 2025.08 - Present: Doctor of Philosophy, Thrust of Artificial Intelligence, Hong Kong University of Science and Technology (Guangzhou)
- 2024.08 - 2025.06: Master in Computer Control and Automation, School of Electrical and Electronic Engineering, Nanyang Technological University
- 2020.09 - 2024.06: Bachelor in Automation, College of Aeronautics and Astronautics, Xiamen University
🧑⚖️ Working Experiences
- 2025.06 - Present: AI Scientist Intern, Shanda AI Research Institute & MiroMind.ai, Singapore
- 2024.03 - 2024.11: AI Research Intern, Qiyuan National Lab
