I am currently a Ph.D. candidate at HMI Lab, NERCV²T, School of Computer Science, Peking University, supervised by Prof. Shanghang Zhang. Before that, I received my Bachelor's degree in Artificial Intelligence (Turing Honor Degree) from PKU, where I also obtained a Bachelor's degree in Economics.
My research interests lie in computer vision and multimodal learning, including visual foundation models, vision language models, visual complex reasoning, visual token compression, visual continual learning, and embodied artificial intelligence. The overall goal of my research is to develop a large-scale efficient visual perception system with human-like expression, adaptation, and generalization, equipped with powerful abilities including fundamental perception, cognitive reasoning, and autonomous creativity.
More specifically, my research interests include:
Ph.D. Candidate in Visual Information Processing and Brain-inspired Intelligence
Sep. 2023 -- Jun. 2028 (ETA)
Peking University, Beijing, China
Bachelor of Intelligence Science and Technology & Economics (Dual Degree)
Sep. 2019 -- Jun. 2023
Peking University, Beijing, China
Intern at AI Lab (Model Efficiency for MLLM)
Mar. 2024 -- Now
ByteDance, Beijing, China
Intern in AGI (Memory Mechanism for MLLM)
Jul. 2023 -- Sep. 2023
BAAI, Beijing, China
Intern in Computer Vision (Autonomous Driving)
Sep. 2022 -- Feb. 2023
OPPO, Beijing, China
Intern at GCV Lab (Multi-Modal Learning)
Oct. 2021 -- Feb. 2022
BIGAI, Beijing, China