Table of Links
2 Survey with Industry Professionals
3 RQ1: Real-World use cases that necessitate output constraints
4.2 Integrating with Downstream Processes and Workflows
4.3 Satisfying UI and Product Requirements and 4.4 Improving User Experience, Trust, and Adoption
5.2 The Case for NL: More Intuitive and Expressive for Complex Constraints
6 The Constraint maker Tool and 6.1 Iterative Design and User Feedback
5 HOW TO ARTICULATE OUTPUT CONSTRAINTS TO LLMS
Fig. 1 shows distributions of respondentsâ preferences towards specifying output constraints either through GUI or natural language. An overarching observation is that respondents preferred using GUI to specify low-level constraints and natural language to express high-level constraints. We discuss their detailed rationale below:
5.1 The case for GUI: A Quick, Reliable, and Flexible Way of Prototyping Constraints
First and foremost, respondents considered GUIs particularly effective for defining âhard requirements,â providing more reliable results, and reducing ambiguity compared to natural language instructions. For example, one argued that choosing âbooleanâ as the output type via a GUI felt much more likely to be âhonouredâ compared to âtyp[ing] that I want a Yes / No response [...] in a prompt.â Another claimed that âflagging a âJSONâ buttonâ provides a much better user experience than âtyping âoutput as JSONâ across multiple prompts.â In addition, respondents preferred using GUI when the intended constraint is âobjectiveâ and âquantifiableâ, such as âuse only items x,y,z,â or âa JSON with certain fields specified.â Moreover, respondents found GUI to be more flexible for rapid prototyping and experimentation (e.g., âwhen I want to play around with different numbers, moving a slider around seems easier than typingâ). Finally, for novice LLM users, the range of choices afforded by a GUI constraint can help clarify the modelâs capabilities and limitations, âmaking the model seems less like a black box.â One respondent drew from their experience working with text-to-image models to underscore this point: âby seeing âIllustrationâ as a possible output style [among others like âPhoto realisticâ or âCartoonâ], I became aware of [the modelâs] capabilities.â
This paper is available on arxiv under CC BY-NC-SA 4.0 DEED license.
Authors:
(1) Michael Xieyang Liu, Google Research, Pittsburgh, PA, USA ([email protected]);
(2) Frederick Liu, Google Research, Seattle, Washington, USA ([email protected]);
(3) Alexander J. Fiannaca, Google Research, Seattle, Washington, USA ([email protected]);
(4) Terry Koo, Google, Indiana, USA ([email protected]);
(5) Lucas Dixon, Google Research, Paris, France ([email protected]);
(6) Michael Terry, Google Research, Cambridge, Massachusetts, USA ([email protected]);
(7) Carrie J. Cai, Google Research, Mountain View, California, USA ([email protected]).