About me

I am currently a tenure-track Assistant Professor in the Information Systems Technology and Design (ISTD) Pillar at Singapore University of Technology and Design (SUTD). Prior to joining SUTD, I was a Research Fellow in Computer Vision and Robot Perception Lab, Department of Computer Science, National University of Singapore (NUS). I recieved my Ph.D. in Computer Science from NUS in March 2021, supervised by Professor Tat-Seng Chua.

I am heading Intelligent Machine Perception Lab (IMPL) at SUTD, with a primary focus on, but not limited to: (1) Computer Vision: 3D computer vision, (3D) scene understanding, 3D reconstruction. (2) Machine Learning: data-efficient learning, multi-modal learning, continual learning, out-of-distribution learning, robust learning.

Open Positions

I am looking for PhD applicants with strong background in computer science, fully-supported by SUTD/AISG/SINGA/industry scholarship.
I am recruiting research fellow (holds a doctoral degree) with relevant research experience on computer vision.
I am welcoming self-fund or CSC-fund visiting PhD/Master students and local MComp/BComp students with interests in the area of (3D) computer vision and machine learning.

Please check here for more information, and feel free to reach out via email if you are interested in working with me.

News

[May 2025] One paper about consistent video customization is accepted by EGSR 2025!
[May 2025] One paper about multi-modal 3D panoptic segmentation is accepted by ICML 2025!
[Apr 2025] One paper about multi-view clutering is accepted by IJCAI 2025!
[Apr 2025] I will serve as an Area Chair for MM 2025!
[Mar 2025] One paper about occluded human reconstruction is accepted by ICME 2025!
[Feb 2025] Two papers about active 3D object detection and embodied multi-agent collaboration are accepted by CVPR 2025!
[Feb 2025] One paper about 3D object detection for autonomous driving is accepted by IJCV 2025!
[Feb 2025] I am honored to serve as an Associate Editor for Knowledge-Based Systems (Impact Factor: 7.2)!
[Feb 2025] One paper about semi-supervised medical domain generalization is accepted by TMM 2025!
[Jan 2025] One paper about 3D reconstruction and editing is accepted by ICLR 2025!
[Dec 2024] I will serve as an Area Chair for NLPCC 2025!
[Dec 2024] One paper about 3D visual grounding is accepted by AAAI 2025!
[Dec 2024] One paper about class-incremental 3D object detection is accepted by TIP 2024!
[Nov 2024] I will serve as a senior PC for IJCAI 2025!
[Nov 2024] I am awarded a grant titled "Bridging Language and Physical Real-world for 3D Reasoning and Object Manipulation" from TL@SUTD as the sole Principal Investigator!
[Oct 2024] I am invited to serve as Demo Chair at ACM Multimedia 2025!
[Oct 2024] One paper about open-set single-source domain generalization is accepted by TMM 2024!
[Sep 2024] I am awarded a joint SMU-SUTD grant titled "Synthesis and Resilience: Generative Models for Generalizable 3D World Understanding" as the co-Principal Investigator!
[Sep 2024] I will serve as an Area Chair for ICLR 2025!
[Aug 2024] I am awarded a MoE Tier 2 grant titled "Empowering Real-World 3D Scene Understanding: Navigating Noise, Distribution Shifts, and Incremental Learning" as the sole Principal Investigator!
[Aug 2024] I am appointed as a Technical Committee member for IEEE-CAS Multimedia Systems and Applications!
[Jul 2024] Two papers about domain generalized 3D semantic segmentation and UDA for 3D object detection are accepted by BMVC 2024!
[Jul 2024] Two papers about generalizable neural semantic fields and point cloud representation learning are accepted by MM 2024!
[Jul 2024] Two papers about open-vocabulary 3D object detection and 3D Gaussain splatting editing are accepted by ECCV 2024!
[Jan 2024] One paper about language-guided 3D affordance segmentation is accepted by CVPR 2024!
[Jan 2024] One paper about semi-supervised 3D instance segmentation is accepted by ICRA 2024!
[Dec 2023] I am awarded a grant titled "MANTIS - Cross-modality Resiliency against Real-world Attacks" from DSO as the sole Principal Investigator!
[Dec 2023] Two papers about semi-supervised 3D object detection and robust visual recognition are accepted by AAAI 2024!
[Oct 2023] One paper about self-supervised point cloud representation learning is accepted by 3DV 2024 as an oral paper!
[Sep 2023] One paper about visual domain generalization is accepted by IJCV 2023!
[Aug 2023] One paper about robust few-shot point cloud segmentation is accepted by BMVC 2023!
[Aug 2023] I am awarded a grant titled "Towards Realistic Deep Learning for 3D Vision" from A*STAR as the co-Investigator!
[Jul 2023] One paper about generalized few-shot point cloud segmentation is accepted by ICCV 2023!
[Jun 2023] One paper about 6-DoF grasps synthesis is accepted by IROS 2023!
[May 2023] One paper about monocular 3D object detection is accepted by TCSVT 2023!
[Mar 2023] I am invited to serve as Demo Chair at Sixth IEEE International Conference on Multimedia Information Processing and Retrieval (MIPR 2023)!
[Feb 2023] I am invited to join the Organising Committee of IEEE ICME 2023 Workshop on 3D Multimedia Analytics, Search and Generation!
[Oct 2022] I am awarded a grant titled "Multi-modal Joint Learning for Scene Understanding" from SUTD-ZJU IDEA as the sole Principal Investigator!
[Sep 2022] I am awarded a grant titled "Data-efficient 3D Object Detection for Robot Perception" from TL@SUTD as the sole Principal Investigator!
[Aug 2022] I join the Singapore University of Technology and Design as an Assistant Professor!
[Jul 2022] Three papers are accepted by ECCV 2022!
[Dec 2021] One paper about class-incremental 3D object detection is accepted by AAAI 2022 as an oral paper!
[Jun 2021] I am selected for the CVPR 2021 Doctoral Consortium. My mentor is Prof. Serge Belongie!
[May 2021] I win the IMDA Excellent Prize (best thesis) for my PhD thesis!
[Mar 2021] I successfully defended my PhD thesis "Towards Learning Scene Semantics on 3D Point Clouds"!
[Mar 2021] One paper about few-shot 3D semantic segmentation is accepted by CVPR 2021!
[Aug 2020] I recieve the Research Achievement Award from SoC!
[Feb 2020] One paper about semi-supervised 3D object detection is accepted by CVPR 2020 as an oral paper!

Selected Publications

Please visit my google scholar profile for the full publication list.
* indicates corresponding author, and # indicates co-corresponding author

	How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation Yining Pan, Qiongjie Cui, Xulei Yang, Na Zhao* International Conference on Machine Learning (ICML), 2025 [Preprint] [Code]
	OcSplats: Rendering Occluded Humans with Prior Knowledge Jie Zhang, Qiongjie Cui, Xulei Yang, Na Zhao* IEEE International Conference on Multimedia & Expo (ICME), 2025 [Paper (coming soon)]
	Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection Jiangyi Wang, Na Zhao* IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025 [Preprint] [Paper]
	Collaborative Tree Search for Enhancing Embodied Multi-Agent Collaboration Lizheng Zu, Lin Lin, Song Fu, Na Zhao, Pan Zhou IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025 [Paper]
	CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer Hualian Sheng, Sijia Cai, Na Zhao, Bing Deng, Qiao Liang, Min-Jian Zhao, Jieping Ye International Journal on Computer Vision (IJCV), 2025 [Preprint] [Paper] [Code]
	Dual-supervised Asymmetric Co-training for Semi-supervised Medical Domain Generalization Jincai Song, Haipeng Chen, Jun Qin#, Na Zhao# IEEE Transactions on Multimedia (TMM), 2025 [Paper (coming soon)]
	GaussianBlock: Building Part-Aware Compositional and Editable 3D Scene by Primitives and Gaussians Shuyi Jiang, Qihao Zhao, Hossein Rahmani, De Wen Soh, Jun Liu, Na Zhao* International Conference on Learning Representations (ICLR), 2025 [Preprint] [Code (Coming soon) ]
	AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring Xinyi Wang, Na Zhao, Zhiyuan Han, Dan Guo, Xun Yang Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025* [Preprint] [Code (Coming soon) ]
	Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization Pengkun Jiao, Na Zhao#, Jingjing Chen#, Yu-Gang Jiang IEEE Transactions on Multimedia (TMM), 2024 [Preprint] [Paper]
	GS^2-GNeSF: Geometry-Semantics Synergy for Generalizable Neural Semantic Fields Chengshun Wang, Na Zhao* ACM Multimedia (MM), 2024 [Paper]
	On-the-fly Point Feature Representation for Point Clouds Analysis Jiangyi Wang, Zhongyao Cheng, Na Zhao#, Jun Cheng, Xulei Yang# ACM Multimedia (MM), 2024 [Preprint] [Paper]
	Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image Pengkun Jiao, Na Zhao, Jingjing Chen, Yu-Gang Jiang European Conference on Computer Vision (ECCV), 2024* [Preprint] [Paper]
	View-Consistent 3D Editing with Gaussian Splatting Yuxuan Wang, Xuanyu Yi, Zike Wu, Na Zhao, Long Chen, Hanwang Zhang European Conference on Computer Vision (ECCV), 2024 [Preprint] [Paper] [Code]
	LASO: Language-guided Affordance Segmentation on 3D Object Yicong Li, Na Zhao#, Junbin Xiao, Chun Feng, Xiang Wang#, Tat-Seng Chua IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024 [Paper] [Code]
	End-to-End Semi-Supervised 3D Instance Segmentation with PCTeacher Linfeng Li, Na Zhao* IEEE International Conference on Robotics and Automation (ICRA), 2024 [Paper]
	Dual-Perspective Knowledge Enrichment for Semi-Supervised 3D Object Detection Yucheng Han, Na Zhao, Weiling Chen, Keng-Teck Ma, Hanwang Zhang Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024* [Preprint] [Paper] [Code]
	Robust Visual Recognition with Class-Imbalanced Open-World Noisy Data Na Zhao, Gim Hee Lee Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024* [Paper] [Code]
	SDCoT++: Improved Static-Dynamic Co-Teaching for Class-Incremental 3D Object Detection Na Zhao, Peisheng Qian, Fang Wu, Xun Xu, Xulei Yang, Gim Hee Lee IEEE Transactions on Image Processing (TIP), 2024 [Paper] [Code (Coming soon) ]
	Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection Yunsong Wang, Na Zhao, Gim Hee Lee The British Machine Vision Conference (BMVC), 2024 [Preprint] [Code]
	Synthetic-to-Real Domain Generalized Semantic Segmentation for 3D Indoor Point Clouds Yuyang Zhao, Na Zhao, Gim Hee Lee The British Machine Vision Conference (BMVC), 2024 [Preprint]
	Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding Yunsong Wang, Na Zhao, Gim Hee Lee International Conference on 3D Vision (3DV), 2024 Oral Presentation [Preprint] [Paper]
	Style-Hallucinated Dual Consistency Learning: A Unified Framework for Visual Domain Generalization Yuyang Zhao, Zhun Zhong, Na Zhao, Nicu Sebe, Gim Hee Lee International Journal on Computer Vision (IJCV), 2023 [Preprint] [Paper] [Code]
	Towards Robust Few-shot Point Cloud Semantic Segmentation Yating Xu, Na Zhao, Gim Hee Lee The British Machine Vision Conference (BMVC), 2023 [Preprint] [Paper] [Code]
	Generalized Few-Shot Point Cloud Segmentation Via Geometric Words Yating Xu, Conghui Hu, Na Zhao, Gim Hee Lee International Conference on Computer Vision (ICCV), 2023 [Preprint] [Paper] [Code]
	Refining 6-DoF Grasps with Context-Specific Classifiers Tasbolat Taunyazov, Heng Zhang, John Patrick Eala, Na Zhao, Harold Soh International Conference on Intelligent Robots and Systems (IROS), 2023 [Preprint] [Code]
	PDR: Progressive Depth Regularization for Monocular 3D Object Detection Hualian Sheng, Sijia Cai, Na Zhao^#, Bing Deng, Min-Jian Zhao^#, Gim Hee Lee IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023 [Paper]
	Teaching with Soft Label Smoothing for Mitigating Noisy Labels in Facial Expressions Tohar Lukov, Na Zhao, Gim Hee Lee, Ser-Nam Lim European Conference on Computer Vision (ECCV), 2022 [Paper] [Code]
	Rethinking IoU-based Optimization for Single-stage 3D Object Detection Hualian Sheng, Sijia Cai, Na Zhao, Bing Deng, Jianqiang Huang, Xian-Sheng Hua, Min-Jian Zhao, Gim Hee Lee European Conference on Computer Vision (ECCV), 2022* [Preprint] [Paper] [Code]
	Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation Yuyang Zhao, Zhun Zhong, Na Zhao, Nicu Sebe, Gim Hee Lee European Conference on Computer Vision (ECCV), 2022 [Preprint] [Paper] [Code]
	Static-Dynamic Co-Teaching for Class-Incremental 3D Object Detection Na Zhao, Gim Hee Lee Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022 Oral Presentation [Preprint] [Paper] [Code]
	Few-shot 3D Point Cloud Semantic Segmentation Na Zhao, Tat-Seng Chua, Gim Hee Lee IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 [Preprint] [Paper] [Code]
	SESS: Self-Ensembling Semi-Supervised 3D Object Detection Na Zhao, Tat-Seng Chua, Gim Hee Lee IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020 Oral Presentation [Preprint] [Paper] [Code]
	PS^2-Net: A Locally and Globally Aware Network for Point-Based Semantic Segmentation Na Zhao, Tat-Seng Chua, Gim Hee Lee 25th International Conference on Pattern Recognition (ICPR), 2020 [Preprint] [Paper] [Code]

Research Grants

Principal Investigator. TL@SUTD Seed Grant. S$200,000. Mar 2025 - Mar 2027.
Topic: Bridging Language and Physical Real-world for 3D Reasoning and Object Manipulation
Principal Investigator. MoE Tier 2 Research Grant. S$994,411. Feb 2025 - Feb 2028.
Topic: Empowering Real-World 3D Scene Understanding: Navigating Noise, Distribution Shifts, and Incremental Learning
co-Principal Investigator. AISG Research Grant. S$999,999. Jan 2025 - Jan 2028.
Topic: Sequential Deepfake Model Attribution
SUTD Principal Investigator. SMU-SUTD Joint Research Grant. S$275,000. Nov 2024 - Oct 2026.
Topic: Synthesis and Resilience: Generative Models for Generalizable 3D World Understanding
Principal Investigator. DSO Research Grant. S$998,000. Dec 2023 - Dec 2026.
Topic: Cross-modality Resiliency against Real-world Attacks
Co-Investigator. A*STAR MTC Programmatic Grant. S$9,773,400. Aug 2023 - Jul 2026.
Topic: Towards Realistic Deep Learning for 3D Vision
Principal Investigator. SUTD-ZJU Thematic Research Grant. S$148,187. Dec 2022 - Nov 2024.
Topic: Multi-modal Joint Learning for Scene Understanding
Principal Investigator. TL@SUTD Seed Grant. S$85,000. Oct 2022 - Apr 2024.
Topic: Data-efficient 3D Object Detection for Robot Perception

Academic Experience

Research Fellow. Computer Vision and Robotic Perception Laboratory, NUS. Apr 2021 - Jul 2022.
Research Associate. Computer Vision and Robotic Perception Laboratory, NUS. Jan 2021 - Mar 2021.
Research Assistant. NExT++ Rearch Center, NUS. Aug 2015 - Dec 2016.

Academic Services

Conference Reviewer: CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR, SIGGRAPH, AAAI, IJCAI, MM, BMVC, etc
Journal Reviewer: TPAMI, TIP, TKDE, TCSVT, TOMM, TMM, Multimedia Systems, Neurocomputing, etc
Organizer: The 33rd ACM International Conference on Multimedia 2025 (Demo Chair), The 6th IEEE International Conference on Multimedia Information Processing and Retrieval 2023 (Demo Chair), The 2nd ICME Workshop on 3D Multimedia Analytics, Search and Generation 2023 (Chair), The 22nd international conference on Multimedia Modeling 2016 (Publication Chair)
Technical Committee Member: IEEE-CAS Multimedia Systems & Applications (2024-2028)
Jounral Associate Editor (AE): Knowledge-Based Systems (2025-)
Conference Area Chair (AC): ICLR 2025, MM 2025, NLPCC 2025
Conference Senior Program Committee (PC) Member: IJCAI 2025

Teaching Experience

Mentor, 01.400 Capstone (2), Fall 2024 & Spring 2025
Instructor, 50.007 Machine Learning, Spring 2023 & Spring 2024 & Spring 2025.
Mentor, 01.400 Capstone (11), Fall 2023 & Spring 2024.
Instructor, 10.020 Data Driven World, Fall 2023.
Teaching Assistant, CS4242 Social Media Computing, Spring 2018 & Spring 2019.
Teaching Assistant, CS5340 Uncertainty Modeling in AI, Fall 2018.
Teaching Assistant, CG3002 Embedded Systems Design Project, Fall 2017.

Na ZHAO