研究背景

非完全信息动态博弈决策在复杂经济活动、人机对抗等领域有广泛用途,一直是博弈论研究的热点。在人工智能时代,数据量的大规模增长、机器智能的巨大飞跃和人机对抗等应用的飞速发展分别形成了“信息大数据”、 “算法高智能”和“对抗强动态”复杂环境,进一步导致了巨量数据不能直接利用的 “不可用”非完全信息形态、深度学习算法不可解释的“不可知”非完全信息形态和对抗欺骗状态下“不可信”非完全信息形态。这些全新的非完全信息形 态的出现使得非完全信息动态博弈决策理论面临着全新的挑战。应对这一挑战 一方面是对传统博弈理论在人工智能时代的重塑,另一方面也会带来全新的人工智能理论和技术的发展,在当前机遇窗口具有深远的意义。

研究目标

给出复杂环境下面向博弈决策的不同形态非完全信息的来源、种类和静态刻画, 提出信息大数据“不可用”、算法高智能“不可知”和对抗强动态“不可信”非完全信息下动态博弈决策的描述方法,并最终建立复杂环境下非完全信息博弈决策的智能基础模型。

主要研究内容

  • 复杂环境下不同形态非完全信息的静态刻画
  • 不同形态非完全信息下动态博弈决策的特征分析
  • 非完全信息下博弈决策的智能基础模型构建

拟解决的重大科学问题或关键技术问题

  • 理解复杂环境的“不确定性”:如何刻画复杂环境非完全信息并分析其对动态博弈模型的影响?
  • 构建“不确定性”下的智能基础模型:如何将复杂环境 “不确定”特点融入到动态博弈框架下,构建新形式的智能基础模型?
  • 智能基础模型的可用性:如何保证构建的智能基础模型满足所需的实时性要求,使得其实际可用?

基本研究框架

相关阅读

研究成果

Journal Articles

  1. A Novel Prescribed-Time Control Approach of State-Constrained High-Order Nonlinear Systems Yangang Yao, Yu Kang, Yunbo Zhao, Pengfei Li, and Jieqing Tan IEEE Trans. Syst. Man Cybern, Syst. 2024 [Abs] [doi] [pdf]
  2. 基于动态信道切换的无线网络化控制系统的资源调度策略 郝小梅, and 赵云波 高技术通讯 2023 [Abs] [pdf]
  3. Compound Event-Triggered Distributed MPC for Coupled Nonlinear Systems Yu Kang, Tao Wang, Pengfei Li, Zhenyi Xu, and Yun-Bo Zhao IEEE Trans. Cybern. 2023 [Abs] [doi] [pdf]
  4. Robust Nonsingular Fixed Time Terminal Sliding Mode Control for Atmospheric Pollution Detection Lidar Scanning Mechanism Yu Kang, Yuxiao Yang, Cai Chen, Wenjun Lü, and Yunbo Zhao J Syst Sci Complex 2023 [Abs] [doi] [pdf]
  5. Disturbance Prediction-Based Adaptive Event-Triggered Model Predictive Control for Perturbed Nonlinear Systems Pengfei Li, Yu Kang, Tao Wang, and Yun-Bo Zhao IEEE Trans. Automat. Contr. 2023 [Abs] [doi] [pdf]
  6. Leader-Following Cluster Consensus of Multiagent Systems With Measurement Noise and Weighted Cooperative–Competitive Networks Cui-Qin Ma, Tian-Ya Liu, Yu Kang, and Yun-Bo Zhao IEEE Trans. Syst. Man Cybern, Syst. 2023 [Abs] [doi] [pdf]
  7. 基于轨迹预测与改进人工势场法的机械臂动态避障规划方法 吴芳, and 赵云波 高技术通讯 2023 [Abs] [pdf]
  8. 基于丢包率估计的无线网络化控制系统的逼近控制策略 吴芳, 梁启鹏, 叶睿卿, and 赵云波 高技术通讯 2023 [Abs] [pdf]
  9. 基于优先级预测器的无线网络化控制系统的动态传输策略 闫文晓, and 赵云波 高技术通讯 2023 [Abs] [doi] [pdf]
  10. 面向人机序贯决策实现共享控制下的仲裁优化 张倩倩, 赵云波, 吕文君, and 陈谋 中国科学:信息科学 2023 [Abs] [doi] [pdf]
  11. 非完全信息下人机合作对抗博弈专题编者按 康宇, 段海滨, and 赵云波 中国科学:信息科学 2022 [doi] [pdf]
  12. Integrated Channel-Aware Scheduling and Packet-Based Predictive Control for Wireless Cloud Control Systems Pengfei Li, Yun-Bo Zhao , and Yu Kang IEEE Trans. Cybern. 2022 [Abs] [doi] [pdf]
  13. Event-Based Model Predictive Control for Nonlinear Systems with Dynamic Disturbance Pengfei Li, Tao Wang, Yu Kang, Kun Li, and Yun-Bo Zhao Automatica 2022 [Abs] [doi] [pdf]
  14. Cluster Consensus for Coupled Harmonic Oscillators Under a Weighted Cooperative-Competitive Network Cui-Qin Ma, Tian-Ya Liu, and Yun-Bo Zhao International Journal of Control 2022 [Abs] [doi] [pdf]
  15. 非全时有效人类决策下的人机共享自主方法 游诗艺, 康宇, 赵云波, and 张倩倩 中国科学:信息科学 2022 [Abs] [doi] [pdf]
  16. Traded Control of Human–Machine Systems for Sequential Decision-Making Based on Reinforcement Learning Qianqian Zhang, Yu Kang, Yun-Bo Zhao , Pengfei Li, and Shiyi You IEEE Trans. Artif. Intell. 2022 [Abs] [doi] [pdf]
  17. 利用人的分歧介入增强珍珠自动分拣可靠性研究 花婷婷, 王岭人, and 赵云波 计算机测量与控制 2021 [Abs] [pdf]
  18. A Novel Self-Triggered MPC Scheme for Constrained Input-Affine Nonlinear Systems Pengfei Li, Yu Kang, Yun-Bo Zhao , and Tao Wang IEEE Trans. Circuits Syst. II 2021 [Abs] [doi] [pdf]
  19. A Novel Inertial-Visual Heading Determination System for Wheeled Mobile Robots Wenjun Lv, Yu Kang, Yun-Bo Zhao , Yuping Wu, and Wei Xing Zheng IEEE Trans. Contr. Syst. Technol. 2021 [Abs] [doi] [pdf]
  20. Robust Approximation-Based Event-Triggered MPC for Constrained Sampled-Data Systems Tao Wang, Yu Kang, Pengfei Li, Yun-Bo Zhao , and Peilong Yu J Syst Sci Complex 2021 [Abs] [doi] [pdf]
  21. Multi-Path Switching Protection for Networked Control Systems Under Unbounded DoS Attacks Qiaohui Zhu, Qipeng Liang, Yu Kang, and Yun-Bo Zhao J. Univ. Sci. Technol. China 2021 [Abs] [doi] [pdf]
  22. Robust Model Predictive Control for Constrained Networked Nonlinear Systems: An Approximation-Based Approach Tao Wang, Yu Kang, Pengfei Li, Yun-Bo Zhao , and Peilong Yu Neurocomputing 2020 [doi] [pdf]

Conference Articles

  1. A Human-Machine Trust Model Integrating Machine Estimated Performance Shaojun Chen, Yun-Bo Zhao , Yang Wang, and Junsen Lu In 2023 6th Int. Symp. Auton. Syst. ISAS 2023 [Abs] [doi] [pdf]
  2. Spectrally Normalized Adaptive Neural Identifier for Dynamic Modeling and Trajectory Tracking Control of Unmanned Aerial Vehicle Shaofeng Chen, Yu Kang, Yunbo Zhao, and Yang Cao In Adv. Guid. Navig. Control 2023 [Abs] [doi] [pdf]
  3. Swap Softmax Twin Delayed Deep Deterministic Policy Gradient Chaohu Liu, and Yunbo Zhao In 2023 6th Int. Symp. Auton. Syst. ISAS 2023 [Abs] [doi] [pdf]
  4. Shared Autonomy Based on Human-in-the-loop Reinforcement Learning with Policy Constraints Ming Li, Yu Kang, Yun-Bo Zhao , Jin Zhu, and Shiyi You In 2022 41st Chin. Control Conf. CCC 2022 [Abs] [doi] [pdf]
  5. Strategy Generation Based on DDPG with Prioritized Experience Replay for UCAV Junsen Lu, Yun-Bo Zhao , Yu Kang, Yuhui Wang, and Yimin Deng In 2022 Int. Conf. Adv. Robot. Mechatron. ICARM 2022 [Abs] [doi] [pdf]
  6. Air Combat Maneuver Decision Based on Deep Reinforcement Learning and Game Theory Shuhui Yin, Yu Kang, Yun-Bo Zhao , and Jian Xue In 2022 41st Chin. Control Conf. CCC 2022 [doi] [pdf]
  7. Self-Triggered Model Predictive Control for Perturbed Nonlinear Systems: An Iterative Implementation Tao Wang, Pengfei Li, Yu Kang, and Yun-Bo Zhao In 2021 60th IEEE Conf. Decis. Control CDC 2021 [Abs] [doi] [pdf]
  8. Approximation-Based Self-Triggered Model Predictive Control for Perturbed Nonlinear Systems Chang Xu, Yu Kang, Yun-Bo Zhao , Pengfei Li, and Tao Wang In 2021 China Autom. Congr. CAC 2021 [Abs] [doi] [pdf]
  9. Adaptive Arbitration for Minimal Intervention Shared Control via Deep Reinforcement Learning Shiyi You, Yu Kang, Yun-Bo Zhao , and Qianqian Zhang In 2021 China Autom. Congr. CAC 2021 [Abs] [doi] [pdf]
  10. Event-Triggered Adaptive Horizon Model Predictive Control for Perturbed Nonlinear Systems Pengfei Li, Tao Wang, Yu Kang, and Yun-Bo Zhao In 2020 59th IEEE Conf. Decis. Control CDC 2020 [Abs] [doi] [pdf]
  11. Synthesis of Wireless Networked Control System Based on Round-trip Delay Online Estimation Liang Lu, Qipeng Liang, Qiaohui Zhu, and Yun-Bo Zhao In 2020 Chin. Autom. Congr. CAC 2020 [Abs] [doi] [pdf]
  12. Detection of Distracted Driving Based on Multi-Granularity and Middle-Level Features Min Tang, Fang Wu, Li-Li Zhao, Qi-Peng Liang, Jian-Wu Lin, and Yun-Bo Zhao In 2020 Chin. Autom. Congr. CAC 2020 [Abs] [doi] [pdf]
  13. Autonomous Boundary of Human-Machine Collaboration System Based on Reinforcement Learning Qianqian Zhang, Yun-Bo Zhao , and Yu Kang In 2020 Aust. N. Z. Control Conf. ANZCC 2020 [Abs] [doi] [pdf]

Theses

  1. 基于训练和执行双阶段联合设计的人机智能决策方法研究 李明 中国科学技术大学, 合肥 2023 [Abs] [pdf]
  2. 基于博弈模型的无人机机动决策方法研究 殷书慧 中国科学技术大学, 合肥 2023 [Abs] [pdf]
  3. 基于人类决策有效性的人机混合决策方法研究 游诗艺 中国科学技术大学, 合肥 2022 [Abs] [pdf]
  4. 基于机器视觉的驾驶员注意力检测系统设计 唐敏 浙江工业大学, 杭州 2021 [Abs] [pdf]
  5. 面向人机序贯决策的混合智能方法研究 张倩倩 中国科学技术大学, 合肥 2021 [Abs] [pdf]
  6. 无界 DoS 攻击下网络化控制系统的防护设计 朱巧慧 浙江工业大学, 杭州 2021 [Abs] [pdf]

Books

  1. 人机混合智能系统自主性理论和方法 赵云波, 康宇, and 朱进 科学出版社 2021 [Abs] [pdf]

patent

  1. 一种基于多算法集成的分歧介入珍珠分拣方法 赵云波, 花婷婷, 赵丽丽, and 崔奇 2024 [Abs] [pdf]
  2. 一种基于POMDP和面部行为分析的驾驶培训辅助方法 赵云波, 吴芳, 赵丽丽, and 崔奇 2024 [Abs] [pdf]
  3. 一种基于动态信道选择的资源调度方法 赵云波, and 郝小梅 2022 [Abs] [pdf]
  4. 一种基于多路径切换的无限制DoS攻击防护方法 赵云波, and 朱巧慧 2022 [Abs] [pdf]
  5. 一种基于偏见神经元的数据去偏方法和装置 陈晋音, 陈一鸣, 陈奕芃, 郑海斌, and 赵云波 [Abs]
  6. 一种面向深度学习的数据去偏方法及装置 陈晋音, 陈奕芃, 郑海斌, and 赵云波 [Abs]
  7. 一种基于深度强化学习的共享自主方法 康宇, 游诗艺, 赵云波, and 吕文君 [Abs]
  8. 一种基于多粒度特征与中层特征的分心驾驶检测方法 赵云波, 唐敏, 花婷婷, and 赵丽丽 [Abs]
  9. 一种基于图像的电脑操作员的疲劳检测方法 赵云波, 唐敏, 朱创, and 孙悦铖 [Abs]

项目人员

赵云波 刘畅 卢峻森 吴芳 周雅情 夏睿钰 张雯 李明 殷书慧 汪洋 游诗艺 王中月 田霞 花婷婷 蒋舒悦 谢祖浩 赵丽丽 陈少军 黄康杰 黄润山

项目合作

  • 康宇 教授, 中国科学技术大学自动化系
  • 朱进 副教授, 中国科学技术大学自动化系
  • 陈晋音 教授, 浙江工业大学信息工程学院&网络安全研究院