Chenhao Zheng

About

I am a second-year CS Ph.D. student at the University of Washington, advised by Prof. Ranjay Krishna. I am also a student researcher at the Allen Institute for AI. Before UW, I was an undergraduate at the University of Michigan and Shanghai Jiao Tong University, where I was fortunate to also work with Prof. Andrew Owens and Dr. Adam Harley. My primary research interest lies in computer vision and multimodal learning.

Email · Google Scholar · Twitter

News

[2026/06] We release the MolmoMotion model at AI2! It's a state-of-the-art goal-conditioned motion forecasting model. Read the blog here.
[2026/04] I will join FAIR as a visiting researcher this fall. Also honored to receive Meta PhD fellowship from the AI Mentorship Program.
[2026/02] TrajTok and Synthetic Object Compositions were accepted to CVPR 2026.
[2026/01] I will join Salesforce AI Research as a research scientist intern this summer, working with Juan Carlos Niebles and Silvio Savarese.
[2025/06] TrajViT was selected as a Highlight paper at ICCV 2025.

Research

I am generally interested in various problems in world modeling and vision-language models. My research so far has focused on two directions.

1) Modeling object dynamics from in-the-wild videos. I believe learning object dynamics from large-scale internet videos can enable tons of applications in generative synthesis, robotics, and behavior prediction. My research proposes a new tokenization scheme and a motion representation to model and predict such object dynamics.

2) Multimodal representation learning. I proposed a novel low-level representation and a training algorithm for vision-language models.

Work Experience

Meta AI Research (FAIR)Incoming, Starting from Sept. 2026
Visiting Researcher

Salesforce AI Research, June 2026 ~ Sept. 2026
Research Scientist Intern
Host: Juan Carlos Niebles and Silvio Savarese

Allen Institute for AI, Jan. 2025 ~ June 2026
Student Researcher (also Research Intern, Summer 2025)

Selected Publications

MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction
Jianing Zhang*, Chenhao Zheng*, Yajun Yang, Rustin Soraki, Winson Han, Chun-Liang Li, Jason Ren, Max Argus, Jieyu Zhang, Ranjay Krishna
In submission
paper · Blog at AI2 · project page

TrajTok: Learning Trajectory Tokens Enhances Video Understanding
Chenhao Zheng, Jieyu Zhang, Jianing Zhang, Weikai Huang, Ashutosh Kumar, Quan Kong, Oncel Tuzel, Chun-Liang Li, Ranjay Krishna
CVPR 2026
paper

One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory
Chenhao Zheng, Jieyu Zhang, Reza Salehi, Ziqi Gao, Vishnu Iyengar, Norimasa Kobori, Quan Kong, Ranjay Krishna
ICCV 2025 (Highlight)
project page · paper

Synthetic Visual Genome
Jae Sung Park, Zixian Ma, Linjie Li, Chenhao Zheng, Cheng-Yu Hsieh, Ximing Lu, Khyathi Chandu, Quan Kong, Norimasa Kobori, Ali Farhadi, Yejin Choi, Ranjay Krishna
CVPR 2025
project page · paper

Acoustic Volume Rendering for Neural Impulse Response Field
Zitong Lan, Chenhao Zheng, Zhiwei Zheng, Mingmin Zhao
NeurIPS 2024 (Spotlight)
project page · paper

Iterated Learning Improves Compositionality in Large Vision-Language Models
Chenhao Zheng, Jieyu Zhang, Aniruddha Kembhavi, Ranjay Krishna
CVPR 2024
project page · paper · video

EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata
Chenhao Zheng, Ayush Shrivastav, Andrew Owens
CVPR 2023 (Highlight)
project page · paper · video

Undergrad Capstone

A VR capstone project from my undergrad at Michigan that I really love.

Urban Rush: Making Fitness Fun with VR
with Rahmy Salmon, Kalpit Haresh Sutariya, Akash Nallani, and Ashish Patel.
An immersive VR experience encouraging exercise via a narrative-based exploration in a procedurally generated city.
project page