Dan Qiao’s Homepage

要人心之自由，胸襟开放。要拿全世界人类曾经走过的路，都要算是我走过的路之一；要有一个远见，能超越你未见。要想办法设想，我没见到的地方，那个世界还有可能什么样。 —— 许倬云

To have the freedom of the heart and an open mind. To take all the paths that humanity has walked around the world, they must be considered as one of the paths I have walked; Have a vision that can surpass what you haven’t seen before. Think of a way to imagine what the world could be like in a place I haven’t seen. – Prof. Xu Zhuoyun

About me

I’m currently a year-3 Ph.D. candidate advised by Assistant Prof. Baoxiang Wang and Prof. Hongyuan Zha in the School of Data Science at the Chinese University of Hong Kong, Shenzhen (CUHKSZ). My research interests mainly focus on multi-agent reinforcement learning, LLM post-training, multi-agent systems, and social welfare. For more details, please refer to my google scholar.

Before that, I worked as a research assistant with Prof. Junge Zhang at the Institute of Automation, Chinese Academy of Sciences (CASIA). I got my Master’s Degree and B.Eng. Degree in Automotive Engineering, advised by Associate Prof. Zhaoxia Peng, at the School of Transportation Science and Engineering, Beihang University in 2021 and 2018 respectively. I was fortunate to advised by Assistant Prof. Wenhao Li in Tongji University.

Education

2022 - Present, Ph.D. Student - Computer Science, the Chinese University of Hong Kong, Shenzhen, China
2018 - 2021, M. Eng - Automotive Engineering, Beihang University, China
2014 - 2018, B. Eng - Automotive Engineering, Beihang University, China

News

[March 20, 2025] The paper about Automatic Subgoal Generation (ASG) of MARL with LLM was accepted by [AISTATS 2025]. [paper]

Research Interests

My research interests include:

Multi-agent Reinforcement Learning
Sequetial Social Dilemma
Diffusion Models
Large Language Models & Agents

Pre-prints

D. Qiao, J. Zhang*, Y. Zhang, S. Xiao, H. Chen, "Privacy-preserved Fully Decentralized Multi-agent Reinforcement Learning for Networked Social Systems ". Chinese Patent, 2022. [pdf]
D. Qiao, Z. Peng*, G. Wen, T. Huang, "Novel Saturated Nussbaum-type Function based Adaptive Distributed Consensus Control of Multi-agent Systems with Unknown Arbitrary Control Directions". Preprint arXiv:2201.09453, 2022. [pdf]

Publications

W. Li, D. Qiao, B. Wang, X. Wang, B. Jin, H. Zha, " Multi-Agent Credit Assignment with Pretrained Language Models ". AISTATS, 2025. [pdf]
S. Yang, Y. Hua, D. Qiao, Y. Lian, Y. Pan*, Y. He, " A coupled electrochemical-thermal-mechanical degradation modelling approach for lifetime assessment of lithium-ion batteries ". Electrochimica Acta, Vol. 326, Dec. 2019, 134928. [pdf]

Misc

Welcome to follow my Zhihu account and BiliBili.

Contact

Office: Floor 4, Zhixin Building, CUHKSZ, Shenzhen, 518172