Bin Zhao



📍 Xi'an, Shannxi, China
📧binzhao111@gmail.com
📧bin@nwpu.edu.cn
[Google Scholar]
[IPEC Group]


Github Homepage

Welcome to Bin Zhao (赵斌)'s Homepage


I am a Associate Professor at Northwestern Polytechnical University, China. My research focuses on the integration of artificial intelligence with hardware and software, as well as embodied intelligence. Our team is dedicated to leveraging physical priors, multi-sensor integration, and mobile platforms to enhance the environmental perception, semantic interaction, and autonomous decision-making capabilities of intelligent agents such as humanoid robots, drones, robotic arms, and quadruped robots. We warmly welcome undergraduate students, as well as master's and doctoral students interested in computer vision, embodied intelligence, and robotic hardware, to join us for internships and academic exchanges.


News:

  • [12/2024]Two papers are accepted by AAAI2024
  • [9/2024]One paper is accepted by CoRL2024
  • [7/2024]One paper is accepted by ECCV2024
  • [5/2024]One paper is accepted by RSS2024
  • [5/2024]Three papers are accepted by ICML2024
  • [4/2024]Two papers (EN-SLAM and GS-SLAM) are accepted as CVPR2024 Highlights
  • [2/2024]Three papers are accepted by CVPR2024
  • [1/2024]Two papers are accepted by ICRA2024

Group:


Zhaojian Li 李昭健
Ph.D. candidate

Pengfei Han 韩鹏飞
Ph.D. candidate

Guanzhou Lan 兰冠洲
Ph.D. candidate

BinHao Ren 任炳浩
Ph.D. candidate

Kehui Liu 刘坷卉
Ph.D. candidate

Alumni:


Selected Publications:

AlignBot: Aligning VLM-powered Customized Task Planning with User Reminders Through Fine-Tuning for Household Robots
Arxiv, 2024
Zhaxizhuoma, Pengan Chen, Ziniu Wu, Jiawei Sun, Dong Wang, Peng Zhou, Nieqing Cao, Yan Ding, Bin Zhao, Xuelong Li.
PDF| Project Page







COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models
Arxiv, 2024
Kehui Liu, Zixin Tang2, Dong Wang, Zhigang Wang, Bin Zhao, Xuelong Li.
PDF| Project Page| Code







Fast-UMI: A Scalable and Hardware-Independent Universal Manipulation Interface
arxiv, 2024
Ziniu Wu, Tianyu Wang, Zhaxizhuoma, Chuyue Guan, Zhongjie Jia, Shuai Liang, Haoming Song, Delin Qu, Dong Wang, Zhigang Wang, Nieqing Cao, Yan Ding, Bin Zhao, Xuelong Li.
PDF| Project Page| Code







Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning
Arxiv, 2024
Yunpeng Gao, Zhigang Wang, Linglin Jing, Dong Wang, Xuelong Li, Bin Zhao.
PDF| Project Page| Code







HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation
CVPR, 2024
Linglin Jing, Yiming Ding, Yunpeng Gao, Zhigang Wang, Xu Yan, Dong Wang1, Gerald Schaefer,Hui Fang, Bin Zhao, Xuelong Li.
PDF| Project Page








KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance
CoRL, 2024
Jingxian Lu, Wenke Xia, Dong Wang, Zhigang Wang, Bin Zhao, Di Hu, Xuelong Li.
PDF







SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
ICML, 2024
Junjie Zhang, Chenjia Bai, Haoran He, Wenke Xia, Zhigang Wang, Bin Zhao, Xiu Li, Xuelong Li.
PDF| Project Page





Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
ICML, 2024
Xiaoyu Wen, Chenjia Bai, Kang Xu, Xudong Yu, Yang Zhang, Xuelong Li, Zhen Wang.
PDF







Constrained Ensemble Exploration for Unsupervised Skill Discovery
ICML, 2024
Chenjia Bai, Rushuai Yang, Qiaosheng Zhang, Kang Xu, Yi Chen, Ting Xiao, Xuelong Li.
PDF







Implicit Event-RGBD Neural SLAM
CVPR, 2024 Highlight
Delin Qu, Chi Yan, Dong Wang, Jie Yin, Dan Xu, Bin Zhao, Xuelong Li.
PDF| Project Page| Code







GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting
CVPR, 2024
Chi Yan, Delin Qu, Dan Xu, Bin Zhao, Zhigang Wang, Dong Wang, Xuelong Li.
PDF| Project Page| Code







X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-modal Knowledge Transfer
AAAI, 2024
Linglin Jing, Ying Xue, Xu Yan, Chaoda Zheng, Dong Wang, Ruimao Zhang, Zhigang Wang, Hui Fang, Bin Zhao, Zhen Li.
PDF| Code





Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models
AAAI, 2024
Yiwen Tang, Ray Zhang, Zoey Guo, Dong Wang, Zhigang Wang, Bin Zhao, Xuelong Li.
PDF| Project Page| Code







Color Event Enhanced Single-Exposure HDR Imaging
AAAI, 2024
Mengyao Cui, Zhigang Wang, Dong Wang, Bin Zhao, Xuelong Li.
PDF





ASF-Transformer
Optics Express, 2024
Ziran Zhang, Bin Zhao, Yueting Chen, Zhigang Wang, Dong Wang, Jiawei Sun, Jie Zhang, Zhihai Xu, Xuelong Li.
PDF| Project Page| Code



AI-driven projection tomography with multicore fibre-optic cell rotation
Nature Communications, 2024
Jiawei Sun, Bin Yang, Nektarios Koukourakis, Jochen Guck, Juergen W. Czarske.
PDF







Calibration-free quantitative phase imaging in multi-core fiber endoscopes using end-to-end deep learning
Optics Letters, 2024
Jiawei Sun, Bin Zhao, Dong Wang, Zhigang Wang, Jie Zhang, Nektarios Koukourakis, Júergen W. Czarske, and Xuelong Li.
PDF





Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding
ECCV, 2024
Yiwen Tang, Ray Zhang, Jiaming Liu, Zoey Guo, Dong Wang, Zhigang Wang, Bin Zhao, Shanghang Zhang, Peng Gao, Hongsheng Li, Xuelong Li.
PDF| Project Page| Code





Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning
Arxiv, 2024
Haoran He, Chenjia Bai, Ling Pan, Weinan Zhang, Bin Zhao, Xuelong Li.
PDF| Project Page





Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs
ICRA, 2024
Wenke Xia, Dong Wang, Xincheng Pang, Zhigang Wang, Bin Zhao, Di Hu, Xuelong Li.
PDF| Project Page| Code





Robust quadrupedal locomotion via risk-averse policy learning
ICRA, 2024
Jiyuan Shi, Chenjia Bai, Haoran He, Lei Han, Dong Wang2, Bin Zhao, Mingguo Zhao, Xiu Li, Xuelong Li.
PDF| Project Page| Code





Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
NeurIPS , 2023
Haoran He, Chenjia Bai, Kang Xu, Zhuoran Yang, Weinan Zhang, Dong Wang, Bin Zhao, Xuelong Li.
PDF





Cross-Domain Policy Adaptation via Value-Guided Data Filtering
NIPS, 2023
Kang Xu1, Chenjia Bai, Xiaoteng Ma, Dong Wang, Bin Zhao, Zhen Wang, Xuelong Li, Wei Li.
PDF





Motion-Aware Video Frame Interpolation
arXiv , 2023
Pengfei Han, Fuhua Zhang, Bin Zhao, Xuelong Li.
PDF





Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
Artificial Intelligenc, 2023
Chenjia Bai, Lingxiao Wang, Jianye Hao, Zhuoran Yang, Bin Zhaoa, Zhen Wange, Xuelong Li.
PDF| Project Page| Code





Vehicle Perception from Satellite
IEEE transactions on pattern analysis and machine intelligence, 2023
Bin Zhao, Pengfei Han, Xuelong Li.
PDF| Project Page| Code





ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance
ICCV, 2023
Zoey Guo, Yiwen Tang, Ray Zhang, Dong Wang, Zhigang Wang, Bin Zhao, Xuelong Li.
PDF| Project Page| Code





Towards Nonlinear-Motion-Aware and Occlusion-Robust Rolling Shutter Correction
ICCV, 2023
Delin Qu, Yizhen Lao, Zhigang Wang, Dong Wang, Bin Zhao, Xuelong Li.
PDF| Project Page| Code