📢 Open to Full-Time positions related to VLMs, VLA, or Content Generation (2025)
Benjin ZHU is a final-year Ph.D candidate at the Department of Electronic Engineering, The Chinese University of Hong Kong since 2021, where he is affiliated to the MultiMedia Lab, and supervised by Prof. Hongsheng LI and Prof. Xiaogang WANG. He earned his Bachelor’s in Software Engineering from South China University of Technology in 2018.
Benjin’s current research interests include VLA Models, Image/Video Generation, and Driving World Simulators. His recent works cover 3D driving scene understanding, reconstruction & synthesis, and HD Mapping. His works have been recognized at TOP conferences like CVPR/ICCV/ECCV. He has also published influential works on Object Detection and Self-Supervised Pretraining. His achievements include winning multiple TOP international competitions like the first nuScenes 3D Object Detection Challenge at WAD, CVPR 2019, where he proposed CBGS (widely adopted by both academia and industry). Benjin has also made significant contributions to open-source computer vision frameworks, including Det3D, CVPods, and EFG that garner substantial popularity.
Ph.D. in Electronic Engineering, 2021 ~ Present
The Chinese Universityh of Hong Kong (CUHK)
B.Eng. in Software Engineering, 2014 ~ 2018
South China University of Technology (SCUT)