How would you say it? Eliciting lexically diverse data for supervised semantic parsing

Abhilasha Ravichander, Thomas Manzini, Matthias Grabmair, Graham Neubig, Jonathan Francis, Eric Nyberg

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

12 Zitate (Scopus)

Abstract

Building dialogue interfaces for real-world scenarios often entails training semantic parsers starting from zero examples. How can we build datasets that better capture the variety of ways users might phrase their queries, and what queries are actually realistic? Wang et al. (2015) proposed a method to build semantic parsing datasets by generating canonical utterances using a grammar and having crowdworkers paraphrase them into natural wording. A limitation of this approach is that it induces bias towards using similar language as the canonical utterances. In this work, we present a methodology that elicits meaningful and lexically diverse queries from users for semantic parsing tasks. Starting from a seed lexicon and a generative grammar, we pair logical forms with mixed text-image representations and ask crowdworkers to paraphrase and confirm the plausibility of the queries that they generated. We use this method to build a semantic parsing dataset from scratch for a dialog agent in a smart-home simulation. We find evidence that this dataset, which we have named SMARTHOME, is demonstrably more lexically diverse and difficult to parse than existing domain-specific semantic parsing datasets.

OriginalspracheEnglisch
TitelSIGDIAL 2017 - 18th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference
Herausgeber (Verlag)Association for Computational Linguistics (ACL)
Seiten374-383
Seitenumfang10
ISBN (elektronisch)9781945626821
PublikationsstatusVeröffentlicht - 2017
Extern publiziertJa
Veranstaltung18th Annual Meeting of the Special Interest Group on Discourse and Dialogue, SIGDIAL 2017 - Saarbrucken, Deutschland
Dauer: 15 Aug. 201717 Aug. 2017

Publikationsreihe

NameSIGDIAL 2017 - 18th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference

Konferenz

Konferenz18th Annual Meeting of the Special Interest Group on Discourse and Dialogue, SIGDIAL 2017
Land/GebietDeutschland
OrtSaarbrucken
Zeitraum15/08/1717/08/17

Fingerprint

Untersuchen Sie die Forschungsthemen von „How would you say it? Eliciting lexically diverse data for supervised semantic parsing“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren