PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control
- Ruijie Zheng ,
- Ching-An Cheng ,
- Hal Daumé III ,
- Furong Huang ,
- Andrey Kolobov
ICML 2024 |
ORAL
Télécharger BibTexTemporal action abstractions, along with belief state representations, are a powerful knowledge sharing mechanism for sequential decision making. In this work, we propose a novel view that treats inducing temporal action abstractions as a sequence compression problem. To do so, we bring a subtle but critical component of LLM training pipelines — input tokenization via byte pair encoding (BPE) — to the seemingly distant task of learning skills of variable time span in continuous control domains. We introduce an approach called Primitive Sequence Encoding (PRISE) that combines continuous action quantization with BPE to learn powerful action abstractions. We empirically show that high-level skills discovered by PRISE from a multitask set of robotic manipulation demonstrations significantly boost the learning performance of Behavior Cloning on downstream tasks.