About

I am a second-year CS Ph.D. student at the University of Washington, advised by Prof. Ranjay Krishna. I am also a student researcher at the Allen Institute for AI. Before UW, I was an undergraduate at the University of Michigan and Shanghai Jiao Tong University, where I was fortunate to also work with Prof. Andrew Owens and Dr. Adam Harley. My primary research interest lies in computer vision and multimodal learning.

Email  ·  Google Scholar  ·  Twitter


News


Research

I am generally interested in various problems in world modeling and vision-language models. My research so far has focused on two directions.

1) Modeling object dynamics from in-the-wild videos. I believe learning object dynamics from large-scale internet videos can enable tons of applications in generative synthesis, robotics, and behavior prediction. My research proposes a new tokenization scheme and a motion representation to model and predict such object dynamics.

2) Multimodal representation learning. I proposed a novel low-level representation and a training algorithm for vision-language models.


Work Experience

Meta AI Research (FAIR)Incoming,   Starting from Sept. 2026
Visiting Researcher


Salesforce AI ResearchIncoming,   June 2026 ~ Sept. 2026
Research Scientist Intern
Host: Juan Carlos Niebles and Silvio Savarese


Allen Institute for AI,   Jan. 2025 ~ June 2026
Student Researcher (also Research Intern, Summer 2025)




Selected Publications

MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction
Jianing Zhang*, Chenhao Zheng*, Yajun Yang, Rustin Soraki, Winson Han, Chun-Liang Li, Jason Ren, Max Argus, Jieyu Zhang, Ranjay Krishna
In submission

TrajTok: Learning Trajectory Tokens Enhances Video Understanding
Chenhao Zheng, Jieyu Zhang, Jianing Zhang, Weikai Huang, Ashutosh Kumar, Quan Kong, Oncel Tuzel, Chun-Liang Li, Ranjay Krishna
CVPR 2026
paper

One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory
Chenhao Zheng, Jieyu Zhang, Reza Salehi, Ziqi Gao, Vishnu Iyengar, Norimasa Kobori, Quan Kong, Ranjay Krishna
ICCV 2025 (Highlight)
project page  ·  paper

Synthetic Visual Genome
Jae Sung Park, Zixian Ma, Linjie Li, Chenhao Zheng, Cheng-Yu Hsieh, Ximing Lu, Khyathi Chandu, Quan Kong, Norimasa Kobori, Ali Farhadi, Yejin Choi, Ranjay Krishna
CVPR 2025
project page  ·  paper

Acoustic Volume Rendering for Neural Impulse Response Field
Zitong Lan, Chenhao Zheng, Zhiwei Zheng, Mingmin Zhao
NeurIPS 2024 (Spotlight)
project page  ·  paper

Iterated Learning Improves Compositionality in Large Vision-Language Models
Chenhao Zheng, Jieyu Zhang, Aniruddha Kembhavi, Ranjay Krishna
CVPR 2024
project page  ·  paper  ·  video

EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata
Chenhao Zheng, Ayush Shrivastav, Andrew Owens
CVPR 2023 (Highlight)
project page  ·  paper  ·  video


Undergrad Capstone

A VR capstone project from my undergrad at Michigan that I really love.

Urban Rush: Making Fitness Fun with VR
with Rahmy Salmon, Kalpit Haresh Sutariya, Akash Nallani, and Ashish Patel.
An immersive VR experience encouraging exercise via a narrative-based exploration in a procedurally generated city.
project page