共撰写/参与撰写专利 0 项，录用/发表论文 1 篇，投出待录用论文0篇。学术成果部分从赵云波教授个人维护的bib文件自动生成，只包含其共同署名的论文/专利（联合培养或代为指导学生可能有未署名论文/专利，不会在此展示），会因为更新不及时而缺失部分论文/专利，如有缺失请及时与老师联系添加更新。
Shared Autonomy Based on Human-in-the-loop Reinforcement Learning with Policy Constraints
Yun-Bo Zhao ,
and Shiyi You
In 2022 41st Chin. Control Conf. CCC
In shared autonomous systems, humans and agents cooperate to complete tasks. Since reinforcement learning enables agents to train good policies through trial and error without knowing the dynamic model of the environment, it has been well applied in shared autonomous systems. After inferring the target from human inputs, agents trained by RL can accurately act accordingly. However, existing methods of this kind have big problems: the training of reinforcement learning algorithms require lots of exploration, which is time-consuming, lack of security guarantee and likely to cause great losses in the training process. Moreover, most of shared control methods are human-oriented, and do not consider the situation that humans may make wrong actions. In view of the above problems, this paper proposes human-in-the-loop reinforcement learning with policy constraints. In the training process, human prior knowledge is used to constrain the exploration of agents to achieve fast and efficient learning. In the process of testing we incorporate policy constraints in the arbitration to avoid serious consequences caused by human mistakes.