Sun Yat-sen University, Guangzhou, China
I am a first-year M.S. student at Sun Yat-sen University (SYSU), admitted via the direct-recommendation (推免) pathway. I received my B.S. in Software Engineering from South China University of Technology (SCUT) in 2024. Currently, I am a research intern at Tencent RoboticX, focusing on VLA-based mobile manipulation.
My research interests lie in Embodied AI, Vision-Language-Action (VLA) models, Multimodal Large Language Models, and Controllable Video Generation. I aim to build robust, generalizable embodied agents that can seamlessly operate in diverse real-world environments.
News
- 2025.05 🤖 Joined Tencent RoboticX as a research intern, working on VLA-based mobile manipulation.
- 2025.12 🎉 One paper accepted at CVPR 2026 (CCF-A): VLA Models Are More Generalizable Than You Think.
- 2025.12 📄 New preprint: ACD submitted to IJCV. [arXiv:2512.21268]
- 2025.09 📄 HumanGenesis submitted to NeurIPS 2026. [arXiv:2508.09858]
- 2024.09 🎓 Started M.S. at Sun Yat-sen University (direct recommendation / 推免).
Publications
* denotes equal contribution | underline denotes corresponding author
Research Experience
Education
Technical Skills
Last updated: May 2026