Hybridizing Constraint Programming and Monte-Carlo Tree Search: Application to the Job Shop Problem
- Manuel Loth ,
- Michéle Sebag ,
- Youssef Hamadi ,
- Marc Schoenauer ,
- Christian Schulte
Learning and Intelligent Optimization |
Published by Springer, Berlin, Heidelberg
PDF | Publication | Publication | Publication | Publication
Constraint Programming CP solvers classically explore the solution space using tree search-based heuristics. Monte-Carlo Tree-Search MCTS, a tree-search based method aimed at sequential decision making under uncertainty, simultaneously estimates the reward associated to the sub-trees, and gradually biases the exploration toward the most promising regions. This paper examines the tight combination of MCTS and CP on the job shop problem JSP. The contribution is twofold. Firstly, a reward function compliant with the CP setting is proposed. Secondly, a biased MCTS node-selection rule based on this reward is proposed, that is suitable in a multiple-restarts context. Its integration within the Gecode constraint solver is shown to compete with JSP-specific CP approaches on difficult JSP instances.