Hi! I am a fourth-year CS PhD student at Tsinghua University, advised by Prof. Jianyu Chen (Founder of RobotEra).
Previously, I got my bachelor's degree from Dept. of EE, Tsinghua University in 2022. I have also spent time at Tencent, SenseTime, RobotEra and Shanghai Qi Zhi Institute as interns.
My research focuses on Embodied AI and Generative Models, with a particular emphasis on training robot foundation models capable of performing a wide range of tasks in physical world. I prefer simple and scalable methods :)
Honors and Awards: [2024.07] Best Paper Award Finalists in RSS 2024.
[2022.06] Outstanding Graduates Award (Top 10% Tsinghua undergraduate students).
[2017.11] Silver Medal in 34th National Physics Olympiad (CPhO).
Selected Research (* indicates equal contribution)
We incoperate both multi-modal understanding (MMU) and future prediction into VLA model, enhancing both high-level semantic knowledge and low-level visual dynamics.
We make some initial exploration on leveraging online RL to improve the VLA model! We notice that online RL for VLA can be extremely unstable and thus we adopted a iterative approach.