📢 Hiring highly self-motavated full-time / interns interested in VLA, World Models, and RL. (all-year-round)
Benjin ZHU is a research scientist at Li Auto. He got his Ph.D from the Department of Electronic Engineering, The Chinese University of Hong Kong in 2025. He was affiliated to the MultiMedia Lab, and supervised by Prof. Hongsheng LI and Prof. Xiaogang WANG. He earned his Bachelor’s in Software Engineering from South China University of Technology in 2018.
Benjin’s current research interests include corss-embodiment VLA, and World Models with RL. He won mutliple championships of TOP international competitions like the first nuScenes 3D Object Detection Challenge at WAD, CVPR 2019. Benjin has also made significant contributions to open-source computer vision frameworks, including Det3D, CVPods, and EFG that garner substantial popularity. Prior to his doctoral studies, Benjin worked at world-leading AI companies like MEGVII Research, where he was fortunate to collaborate with Dr. Gang Yu, Dr. Xiangyu Zhang and Dr. Jian Sun on topics like object detection and representation learning.
Ph.D in Electronic Engineering, 2021 ~ 2025
The Chinese University of Hong Kong (CUHK)
B.Eng in Software Engineering, 2014 ~ 2018
South China University of Technology (SCUT)
ICCV 2025, Scene Generation