工作簡歷
2011.09 - 2015.06.18,南京航空航天大學(xué),本科
毛航宇研究方向?yàn)閺?qiáng)化學(xué)習(xí)、大模型智能體系統(tǒng)及芯片架構(gòu)。在人工智能頂會(huì)和頂刊上發(fā)表論文 50 余篇,申請(qǐng)專利 10 余項(xiàng),作為負(fù)責(zé)人和核心骨干承擔(dān) 10 余項(xiàng)國家自然科學(xué)基金項(xiàng)目、中國科學(xué)院引才項(xiàng)目、千萬級(jí)別企業(yè)內(nèi)部項(xiàng)目、校企合作項(xiàng)目等,獲多項(xiàng)省部級(jí)及以上獎(jiǎng)勵(lì)。長期擔(dān)任人工智能頂會(huì)的高級(jí)程序委員會(huì)委員、地方政府智庫專家;曾在多家高科技互聯(lián)網(wǎng)企業(yè)擔(dān)任研發(fā)團(tuán)隊(duì)負(fù)責(zé)人,具備豐富的產(chǎn)業(yè)落地經(jīng)驗(yàn),主導(dǎo)開發(fā)的大模型智能體系統(tǒng)實(shí)現(xiàn)超千萬的用戶規(guī)模及月活。
強(qiáng)化學(xué)習(xí)、智能體與多智能體系統(tǒng)、大模型、AI 芯片與系統(tǒng)
全部論文參考谷歌學(xué)術(shù):??
https://scholar.google.com/citations?user=EtVHsgcAAAAJ
本人指導(dǎo)的學(xué)生一作/本人二作或共同一作(5 篇代表作):??
1. Guanting Dong*, Hangyu Mao*, Kai Ma, Licheng Bao, Yifei Chen, Zhongyuan??
Wang, Zhongxia Chen, Jiazhen Du, Huiyang Wang, Fuzheng Zhang, Guorui Zhou,??
Yutao Zhu, Ji-Rong Wen, and Zhicheng Dou. Agentic Reinforced Policy??
Optimization. ICLR 2026.
2. Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, and Guoliang??
Fan. Sequential Asynchronous Action Coordination in Multi-Agent Systems: A??
Stackelberg Decision Transformer Approach. ICML 2024 (CCF-A).
3. Yiqun Chen, Hangyu Mao, Jiaxin Mao, Shiguang Wu, Tianle Zhang, Bin Zhang, Wei??
Yang, and Hongxing Chang. PTDE: Personalized Training with Distilled Execution??
for Multi-Agent Reinforcement Learning. IJCAI 2024 (CCF-A).
4. Mingzhe Xing, Hangyu Mao, Shenglin Yin, Lichen Pang, Zhengchao Zhang, Zhen??
Xiao, and Jieyi Long. A Dual-Agent Scheduler for Distributed Deep Learning Jobs??
on Public Cloud via Reinforcement Learning. KDD 2023 (CCF-A).
5. Mingzhe Xing, Hangyu Mao, and Zhen Xiao. Fast and Fine-grained Autoscaler for??
Streaming Jobs with Reinforcement Learning. IJCAI 2022 (CCF-A)
人才隊(duì)伍