Reinforcement Learning for Turn-Taking Management in Incremental Spoken Dialogue Systems
- Hatim Khouzaimi ,
- Romain Laroche ,
- Fabrice Lefevre
Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI) |
In this article, reinforcement learning is used to learn an optimal turn-taking strategy for vocal human-machine dialogue. The Orange Labs’ Ma-jordomo dialogue system, which allows the users to have conversations within a smart home, has been upgraded to an incremental version. First, a user simulator is built in order to generate a dialogue corpus which thereafter is used to optimise the turn-taking strategy from delayed rewards with the Fitted-Q reinforcement learning algorithm. Real users test and evaluate the new learnt strategy, versus a non-incremental and a handcrafted incremen-tal strategies. The data-driven strategy is shown to significantly improve the task completion ratio and to be preferred by the users according to subjective metrics.