About me
I am currently a tenure-track Assistant Professor in the Information Systems Technology and Design (ISTD) Pillar at Singapore University of Technology and Design (SUTD). Prior to joining SUTD, I was a Research Fellow in Computer Vision and Robot Perception Lab, Department of Computer Science, National University of Singapore (NUS). I recieved my Ph.D. in Computer Science from NUS in March 2021, supervised by Professor Tat-Seng Chua.
I am heading Intelligent Machine Perception Lab (IMPL) at SUTD, with a primary focus on, but not limited to: (1) 3D Computer Vision: 3D scene understanding, 3D reconstruction, 3D generation and editing. (2) Machine Learning: data-efficient learning, multi-modal learning, continual learning, out-of-distribution learning, robust learning. (3) Embodied AI: multi-modal perception, spatial intelligence, embodied navigation, embodied manipulation.
Open Positions
- I am looking for PhD applicants with strong background in computer science, fully-supported by SUTD/AISG/industry scholarship.
- I am recruiting research fellow (holds a doctoral degree) with relevant research experience on computer vision.
- I am welcoming self-fund or CSC-fund visiting PhD/Master students and local MComp/BComp students with interests in the area of (3D) computer vision and machine learning.
News
- [Jan 2026] I will serve as the General Chair for the 33rd International Conference on Multimedia Modeling (MMM 2027)!
- [Jan 2026] One paper about tuning-free long video generation is accepted by ToMM 2026!
- [Jan 2026] I am invited to serve as senior Area Chair at IEEE ICME 2026!
- [Jan 2026] One paper about incremental few-shot semantic segmentation is accepted by TIP 2026!
- [Dec 2025] I am invited to serve as Publicity Chair at ACM ICMR 2026!
- [Nov 2025] Two papers about point cloud representation learning and radar-LiDAR scene flow estimation are accepted by AAAI 2026, for oral and poster presentations, respectively!
- [Oct 2025] I am invited to give a keynote talk at the workshop on Multimodal Foundation Models for Spatial Intelligence at ACM Multimedia 2025!
- [Oct 2025] I am invited serve as an Associate Editor for IEEE Transactions on Circuits and Systems for Video Technology (Impact Factor: 11.1)!
- [Sep 2025] One paper about 3D fine-grained embodied reasoning is accepted by NeurIPS 2025!
- [Sep 2025] I am invited to give a talk at NEXUS Japan–Singapore Joint Workshop 2025!
- [Aug 2025] I will serve as an Area Chair for ICLR 2026!
- [Jul 2025] One paper about assumptive reasoning in MLLMs is accepted by MM 2025!
- [Jun 2025] Four papers are accepted by ICCV 2025!
- [May 2025] One paper about multi-modal 3D panoptic segmentation is accepted by ICML 2025!
- [Apr 2025] One paper about multi-view clutering is accepted by IJCAI 2025!
- [Apr 2025] I will serve as an Area Chair for MM 2025!
- [Mar 2025] One paper about occluded human reconstruction is accepted by ICME 2025!
- [Feb 2025] Two papers about active 3D object detection and embodied multi-agent collaboration are accepted by CVPR 2025!
- [Feb 2025] One paper about 3D object detection for autonomous driving is accepted by IJCV 2025!
- [Feb 2025] I am invitated to serve as an Associate Editor for Knowledge-Based Systems (Impact Factor: 7.6)!
- [Feb 2025] One paper about semi-supervised medical domain generalization is accepted by TMM 2025!
- [Jan 2025] One paper about 3D reconstruction and editing is accepted by ICLR 2025!
- [Dec 2024] I will serve as an Area Chair for NLPCC 2025!
- [Dec 2024] One paper about 3D visual grounding is accepted by AAAI 2025!
- [Dec 2024] One paper about class-incremental 3D object detection is accepted by TIP 2024!
- [Nov 2024] I will serve as a senior PC for IJCAI 2025!
- [Nov 2024] I am awarded a grant titled "Bridging Language and Physical Real-world for 3D Reasoning and Object Manipulation" from TL@SUTD as the sole Principal Investigator!
- [Oct 2024] I am invited to serve as Demo Chair at ACM Multimedia 2025!
- [Oct 2024] One paper about open-set single-source domain generalization is accepted by TMM 2024!
- [Sep 2024] I am awarded a joint SMU-SUTD grant titled "Synthesis and Resilience: Generative Models for Generalizable 3D World Understanding" as the co-Principal Investigator!
- [Sep 2024] I will serve as an Area Chair for ICLR 2025!
- [Aug 2024] I am awarded a MoE Tier 2 grant titled "Empowering Real-World 3D Scene Understanding: Navigating Noise, Distribution Shifts, and Incremental Learning" as the sole Principal Investigator!
- [Aug 2024] I am appointed as a Technical Committee member for IEEE-CAS Multimedia Systems and Applications!
- [Jul 2024] Two papers about domain generalized 3D semantic segmentation and UDA for 3D object detection are accepted by BMVC 2024!
- [Jul 2024] Two papers about generalizable neural semantic fields and point cloud representation learning are accepted by MM 2024!
- [Jul 2024] Two papers about open-vocabulary 3D object detection and 3D Gaussain splatting editing are accepted by ECCV 2024!
- [Jan 2024] One paper about language-guided 3D affordance segmentation is accepted by CVPR 2024!
- [Jan 2024] One paper about semi-supervised 3D instance segmentation is accepted by ICRA 2024!
- [Dec 2023] I am awarded a grant titled "MANTIS - Cross-modality Resiliency against Real-world Attacks" from DSO as the sole Principal Investigator!
- [Dec 2023] Two papers about semi-supervised 3D object detection and robust visual recognition are accepted by AAAI 2024!
- [Oct 2023] One paper about self-supervised point cloud representation learning is accepted by 3DV 2024 as an oral paper!
- [Sep 2023] One paper about visual domain generalization is accepted by IJCV 2023!
- [Aug 2023] One paper about robust few-shot point cloud segmentation is accepted by BMVC 2023!
- [Aug 2023] I am awarded a grant titled "Towards Realistic Deep Learning for 3D Vision" from A*STAR as the co-Investigator!
- [Jul 2023] One paper about generalized few-shot point cloud segmentation is accepted by ICCV 2023!
- [Jun 2023] One paper about 6-DoF grasps synthesis is accepted by IROS 2023!
- [May 2023] One paper about monocular 3D object detection is accepted by TCSVT 2023!
- [Mar 2023] I am invited to serve as Demo Chair at Sixth IEEE International Conference on Multimedia Information Processing and Retrieval (MIPR 2023)!
- [Feb 2023] I am invited to join the Organising Committee of IEEE ICME 2023 Workshop on 3D Multimedia Analytics, Search and Generation!
- [Oct 2022] I am awarded a grant titled "Multi-modal Joint Learning for Scene Understanding" from SUTD-ZJU IDEA as the sole Principal Investigator!
- [Sep 2022] I am awarded a grant titled "Data-efficient 3D Object Detection for Robot Perception" from TL@SUTD as the sole Principal Investigator!
- [Aug 2022] I join the Singapore University of Technology and Design as an Assistant Professor!
- [Jul 2022] Three papers are accepted by ECCV 2022!
- [Dec 2021] One paper about class-incremental 3D object detection is accepted by AAAI 2022 as an oral paper!
- [Jun 2021] I am selected for the CVPR 2021 Doctoral Consortium. My mentor is Prof. Serge Belongie!
- [May 2021] I win the IMDA Excellent Prize (best thesis) for my PhD thesis!
- [Mar 2021] I successfully defended my PhD thesis "Towards Learning Scene Semantics on 3D Point Clouds"!
- [Mar 2021] One paper about few-shot 3D semantic segmentation is accepted by CVPR 2021!
- [Aug 2020] I recieve the Research Achievement Award from SoC!
- [Feb 2020] One paper about semi-supervised 3D object detection is accepted by CVPR 2020 as an oral paper!
Selected Publications
Please visit my google scholar profile for the full publication list.
* indicates corresponding author, and # indicates co-corresponding author
![]() | Towards Generative Understanding: Incremental Few-shot Semantic Segmentation with Diffusion Models Qun Li, Lu Huang, Fu Xiao, Na Zhao, Bir Bhanu IEEE Transactions on Image Processing (TIP), 2026 [Project] [Paper] [Code] |
![]() | RaLiFlow: Scene Flow Estimation with 4D Radar and LiDAR Point Clouds Jingyun Fu, Zhiyu Xiang#, Na Zhao# 40th AAAI Conference on Artificial Intelligence, 2026 [Preprint] [Code] |
![]() | Graph Smoothing for Enhanced Local Geometry Learning in Point Cloud Analysis Shangbo Yuan, Jie Xu, Ping Hu, Xiaofeng Zhu, Na Zhao 40th AAAI Conference on Artificial Intelligence, 2026 Oral Presentation [Preprint] [Code] |
![]() | AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models Xinyi Wang, Xun Yang#, Yanlong Xu, Yuchen Wu, Zhen Li, Na Zhao# 39th Annual Conference on Neural Information Processing Systems (NeurIPS), 2025 [Preprint] [Code] |
![]() | MotionLab: Unified Human Motion Generation and Editing via the Motion-Condition-Motion Paradigm Ziyan Guo, Zeyu HU, De Wen Soh, Na Zhao* International Conference on Computer Vision (ICCV), 2025 [Project][Preprint] [Paper] [Code] |
![]() | H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction Heng Jia, Linchao Zhu, Na Zhao International Conference on Computer Vision (ICCV), 2025 [Preprint] [Paper] [Code] |
![]() | Robust Multi-View Learning via Representation Fusion of Sample-Level Attention and Alignment of Simulated Perturbation Jie Xu, Na Zhao#, Gang Niu, Masashi Sugiyama, Xiaofeng Zhu# International Conference on Computer Vision (ICCV), 2025 [Preprint] [Paper] [Code] |
![]() | Geometric Alignment and Prior Modulation for View-Guided Point Cloud Completion on Unseen Categories Jingqiao Xiu, Yicong Li, Na Zhao, Han Fang, Xiang Wang, Angela Yao International Conference on Computer Vision (ICCV), 2025 [Paper] |
![]() | Look Before You Decide: Prompting Active Deduction of MLLMs for Assumptive Reasoning Yian Li, Wentao Tian, Yang Jiao, Tianwen Qian, Na Zhao, Bin Zhu, Jingjing Chen, Yu-Gang Jiang ACM Multimedia (MM), 2025 [Paper] |
![]() | How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation Yining Pan, Qiongjie Cui, Xulei Yang, Na Zhao* International Conference on Machine Learning (ICML), 2025 [Preprint] [Paper] [Code] |
![]() | OcSplats: Rendering Occluded Humans with Prior Knowledge Jie Zhang, Qiongjie Cui, Xulei Yang, Na Zhao* IEEE International Conference on Multimedia & Expo (ICME), 2025 [Paper] |
![]() | Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection Jiangyi Wang, Na Zhao* IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025 [Preprint] [Paper] [Code] |
![]() | Collaborative Tree Search for Enhancing Embodied Multi-Agent Collaboration Lizheng Zu, Lin Lin, Song Fu, Na Zhao, Pan Zhou IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025 [Paper] |
![]() | CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer Hualian Sheng, Sijia Cai, Na Zhao, Bing Deng, Qiao Liang, Min-Jian Zhao, Jieping Ye International Journal on Computer Vision (IJCV), 2025 [Preprint] [Paper] [Code] |
![]() | Dual-supervised Asymmetric Co-training for Semi-supervised Medical Domain Generalization Jincai Song, Haipeng Chen, Jun Qin#, Na Zhao# IEEE Transactions on Multimedia (TMM), 2025 [Preprint] [Paper] |
![]() | GaussianBlock: Building Part-Aware Compositional and Editable 3D Scene by Primitives and Gaussians Shuyi Jiang, Qihao Zhao, Hossein Rahmani, De Wen Soh, Jun Liu, Na Zhao* International Conference on Learning Representations (ICLR), 2025 [Preprint] [Code] |
![]() | AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring Xinyi Wang, Na Zhao*, Zhiyuan Han, Dan Guo, Xun Yang Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025 [Preprint] [Paper] [Code (Coming soon) ] |
![]() | Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization Pengkun Jiao, Na Zhao#, Jingjing Chen#, Yu-Gang Jiang IEEE Transactions on Multimedia (TMM), 2025 [Preprint] [Paper] |
![]() | GS^2-GNeSF: Geometry-Semantics Synergy for Generalizable Neural Semantic Fields Chengshun Wang, Na Zhao* ACM Multimedia (MM), 2024 [Paper] |
![]() | On-the-fly Point Feature Representation for Point Clouds Analysis Jiangyi Wang, Zhongyao Cheng, Na Zhao#, Jun Cheng, Xulei Yang# ACM Multimedia (MM), 2024 [Preprint] [Paper] |
![]() | Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image Pengkun Jiao, Na Zhao*, Jingjing Chen, Yu-Gang Jiang European Conference on Computer Vision (ECCV), 2024 [Preprint] [Paper] |
![]() | View-Consistent 3D Editing with Gaussian Splatting Yuxuan Wang, Xuanyu Yi, Zike Wu, Na Zhao, Long Chen, Hanwang Zhang European Conference on Computer Vision (ECCV), 2024 [Preprint] [Paper] [Code] |
![]() | LASO: Language-guided Affordance Segmentation on 3D Object Yicong Li, Na Zhao#, Junbin Xiao, Chun Feng, Xiang Wang#, Tat-Seng Chua IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024 [Paper] [Code] |
![]() | End-to-End Semi-Supervised 3D Instance Segmentation with PCTeacher Linfeng Li, Na Zhao* IEEE International Conference on Robotics and Automation (ICRA), 2024 [Paper] |
![]() | Dual-Perspective Knowledge Enrichment for Semi-Supervised 3D Object Detection Yucheng Han, Na Zhao*, Weiling Chen, Keng-Teck Ma, Hanwang Zhang Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024 [Preprint] [Paper] [Code] |
![]() | Robust Visual Recognition with Class-Imbalanced Open-World Noisy Data Na Zhao*, Gim Hee Lee Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024 [Paper] [Code] |
![]() | SDCoT++: Improved Static-Dynamic Co-Teaching for Class-Incremental 3D Object Detection Na Zhao, Peisheng Qian, Fang Wu, Xun Xu, Xulei Yang, Gim Hee Lee IEEE Transactions on Image Processing (TIP), 2024 [Paper] [Code (Coming soon) ] |
![]() | Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection Yunsong Wang, Na Zhao, Gim Hee Lee The British Machine Vision Conference (BMVC), 2024 [Preprint] [Code] |
![]() | Synthetic-to-Real Domain Generalized Semantic Segmentation for 3D Indoor Point Clouds Yuyang Zhao, Na Zhao, Gim Hee Lee The British Machine Vision Conference (BMVC), 2024 [Preprint] |
![]() | Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding Yunsong Wang, Na Zhao, Gim Hee Lee International Conference on 3D Vision (3DV), 2024 Oral Presentation [Preprint] [Paper] |
![]() | Style-Hallucinated Dual Consistency Learning: A Unified Framework for Visual Domain Generalization Yuyang Zhao, Zhun Zhong, Na Zhao, Nicu Sebe, Gim Hee Lee International Journal on Computer Vision (IJCV), 2023 [Preprint] [Paper] [Code] |
![]() | Towards Robust Few-shot Point Cloud Semantic Segmentation Yating Xu, Na Zhao, Gim Hee Lee The British Machine Vision Conference (BMVC), 2023 [Preprint] [Paper] [Code] |
![]() | Generalized Few-Shot Point Cloud Segmentation Via Geometric Words Yating Xu, Conghui Hu, Na Zhao, Gim Hee Lee International Conference on Computer Vision (ICCV), 2023 [Preprint] [Paper] [Code] |
![]() | Refining 6-DoF Grasps with Context-Specific Classifiers Tasbolat Taunyazov, Heng Zhang, John Patrick Eala, Na Zhao, Harold Soh International Conference on Intelligent Robots and Systems (IROS), 2023 [Preprint] [Code] |
![]() | PDR: Progressive Depth Regularization for Monocular 3D Object Detection Hualian Sheng, Sijia Cai, Na Zhao#, Bing Deng, Min-Jian Zhao#, Gim Hee Lee IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023 [Paper] |
![]() | Teaching with Soft Label Smoothing for Mitigating Noisy Labels in Facial Expressions Tohar Lukov, Na Zhao, Gim Hee Lee, Ser-Nam Lim European Conference on Computer Vision (ECCV), 2022 [Paper] [Code] |
![]() | Rethinking IoU-based Optimization for Single-stage 3D Object Detection Hualian Sheng, Sijia Cai, Na Zhao*, Bing Deng, Jianqiang Huang, Xian-Sheng Hua, Min-Jian Zhao, Gim Hee Lee European Conference on Computer Vision (ECCV), 2022 [Preprint] [Paper] [Code] |
![]() | Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation Yuyang Zhao, Zhun Zhong, Na Zhao, Nicu Sebe, Gim Hee Lee European Conference on Computer Vision (ECCV), 2022 [Preprint] [Paper] [Code] |
![]() | Static-Dynamic Co-Teaching for Class-Incremental 3D Object Detection Na Zhao, Gim Hee Lee Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022 Oral Presentation [Preprint] [Paper] [Code] |
![]() | Few-shot 3D Point Cloud Semantic Segmentation Na Zhao, Tat-Seng Chua, Gim Hee Lee IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 [Preprint] [Paper] [Code] |
![]() | SESS: Self-Ensembling Semi-Supervised 3D Object Detection Na Zhao, Tat-Seng Chua, Gim Hee Lee IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020 Oral Presentation [Preprint] [Paper] [Code] |
![]() | PS^2-Net: A Locally and Globally Aware Network for Point-Based Semantic Segmentation Na Zhao, Tat-Seng Chua, Gim Hee Lee 25th International Conference on Pattern Recognition (ICPR), 2020 [Preprint] [Paper] [Code] |
Research Grants
- Principal Investigator. TL@SUTD Seed Grant. S$200,000. Mar 2025 - Mar 2027.
Topic: Bridging Language and Physical Real-world for 3D Reasoning and Object Manipulation - Principal Investigator. MoE Tier 2 Research Grant. S$994,411. Feb 2025 - Feb 2028.
Topic: Empowering Real-World 3D Scene Understanding: Navigating Noise, Distribution Shifts, and Incremental Learning - co-Principal Investigator. AISG Research Grant. S$999,999. Jan 2025 - Jan 2028.
Topic: Sequential Deepfake Model Attribution - SUTD Principal Investigator. SMU-SUTD Joint Research Grant. S$275,000. Nov 2024 - Oct 2026.
Topic: Synthesis and Resilience: Generative Models for Generalizable 3D World Understanding - Principal Investigator. DSO Research Grant. S$998,000. Dec 2023 - Dec 2026.
Topic: Cross-modality Resiliency against Real-world Attacks - Co-Investigator. A*STAR MTC Programmatic Grant. S$9,773,400. Aug 2023 - Jul 2026.
Topic: Towards Realistic Deep Learning for 3D Vision - Principal Investigator. SUTD-ZJU Thematic Research Grant. S$148,187. Dec 2022 - Nov 2024.
Topic: Multi-modal Joint Learning for Scene Understanding - Principal Investigator. TL@SUTD Seed Grant. S$85,000. Oct 2022 - Apr 2024.
Topic: Data-efficient 3D Object Detection for Robot Perception
Academic Experience
- Research Fellow. Computer Vision and Robotic Perception Laboratory, NUS. Apr 2021 - Jul 2022.
- Research Associate. Computer Vision and Robotic Perception Laboratory, NUS. Jan 2021 - Mar 2021.
- Research Assistant. NExT++ Rearch Center, NUS. Aug 2015 - Dec 2016.
Academic Services
- Conference Reviewer: CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR, SIGGRAPH, AAAI, IJCAI, MM, etc
- Journal Reviewer: TPAMI, TIP, TKDE, RAL, TCSVT, TOMM, TMM, RAL, Multimedia Systems, etc
- Organizing Committee: General Chair (MMM 2027), Publicity Chair (ICMR 2026), Demo Chair (MM 2025), Demo Chair (MIPR 2023), Publication Chair (MMM 2016)
- Technical Committee Member: IEEE-CAS Multimedia Systems & Applications (2024-2028)
- Journal Associate Editor (AE): Knowledge-Based Systems (Feb 2025-), IEEE TCSVT (Nov 2025-)
- Conference Senior Area Chair (SAC): ICME 2026
- Conference Area Chair (AC): ICLR 2026, MM 2025, ICLR 2025, NLPCC 2025
- Conference Senior Program Committee (PC) Member: IJCAI 2025
Teaching Experience
- Mentor, 01.400 Capstone (3), Fall 2025 & Spring 2026
- Mentor, 01.400 Capstone (2), Fall 2024 & Spring 2025
- Instructor, 50.007 Machine Learning, Spring 2023/2024/2025/2026.
- Mentor, 01.400 Capstone (11), Fall 2023 & Spring 2024.
- Instructor, 10.020 Data Driven World, Fall 2023.
- Teaching Assistant, CS4242 Social Media Computing, Spring 2018 & Spring 2019.
- Teaching Assistant, CS5340 Uncertainty Modeling in AI, Fall 2018.
- Teaching Assistant, CG3002 Embedded Systems Design Project, Fall 2017.










































