Employing Web Search Query Click Logs For Multi-Domain Spoken Language Understanding
- Dilek Hakkani-Tur ,
- Gokhan Tur ,
- Larry Heck ,
- Ashley Fidler ,
- Rukmini Iyer ,
- Sarangarajan Parthasarathy
IEEE Workshop on Automatic Speech Recognition & Understanding |
Published by IEEE | Organized by IEEE
Logs of user queries from a search engine (such as Bing or Google) together with the links clicked provide valuable implicit feedback to improve statistical spoken language understanding (SLU) models. In this work, we propose to enrich the existing classification feature set for domain detection with features computed using the click distribution over a set of clicked URLs from search query click logs (QCLs) of user utterances. Since the form of natural language utterances differs stylistically from that of keyword search queries, to be able to match natural language utterances with related search queries, we perform a syntax-based transformation of the original utterances, after filtering out domain-independent salient phrases. This approach results in significant improvements for domain detection, especially when detecting the domains of web-related user utterances.