Photo

Yunze Man 「满运泽」

AI Researcher. PhD in UIUC. Solid experience in AI agents, embodied AI, VLM post-training. Research supported by NVIDIA Fellowship.

My research interests lie at the intersection of vision, machine learning, and robotics. I develop vision-centric large multimodal models, and embodied and agentic AI agents. I am interested in AI agents that interact with the digital world and the physical world.

Email
[Google Scholar] [Github] [X]

News

  • [12/2024]    Received the NVIDIA Graduate Fellowship 2025.
  • [11/2024]    Selected as one of the Top Reviewers in NeurIPS 2024.
  • [09/2024]    Lexicon3D accepted to NeurIPS 2024!
  • [09/2024]    SceneCraft accepted to NeurIPS 2024!
  • [05/2024]    Selected as one of the Outstanding Reviewers in CVPR 2024.
  • [05/2024]    Started my internship at NVIDIA Research. Look forward to seeing you in Bay Area!
  • [01/2024]    LLM4Vision accepted to ICLR 2024 (Spotlight)!
  • [02/2023]    I passed the qualifying exam and officially became a Ph.D. candidate!

Selected Publications

Please refer to my Google Scholar profile for the full list of publications.

(* indicates equal contribution)
locateanything3d
LocateAnything3D: Vision-Language 3D Detection with Chain-of-Sight
Tech Report / Paper / Project
gr00t
GR00T N1.5: An Improved Open Foundation Model for Generalist Humanoid Robots
GR00T Team
argus
Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought
RandAR
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders
Oral presentation
org
Floating No More: Object-Ground Reconstruction from a Single Image
Lexicon3D
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Reasoning
Lexicon3D
SceneCraft: Layout-Guided 3D Scene Generation
LM4Vision
LLM4Vision: Frozen Transformers from Language Models are Effective Visual Encoder Layers
Spotlight presentation
situation3d
SituationVLM: Situational Awareness Matters in 3D Vision Language Reasoning
bevguide
BEV-Guided Multi-Modality Fusion for Driving Perception

Internship Experience

  • [05/2024 ~ 12/2024], NVIDIA Research, Research Scientist Intern
  • [05/2022 ~ 01/2023], Adobe Research, Research Scientist Intern

Professional Service

  • Reviewer for CVPR, ECCV, ICCV, ICLR, NeurIPS, ICML, AAAI, IROS, ICRA, TMLR
                2021 - 2025
  • Teaching Assisant
    • Learining to Learn (CS598), UIUC
      Fall 2022

    • Efficient & Predictive Vision (CS598), UIUC
      Spring 2022

    • Machine Learning (CS446), UIUC
      Fall 2021

    • Computer Vision Capstone (16-621), CMU
      Spring 2020, 2021


© University of Illinois Urbana-Champaign. Last Updated: 2025