Text-to-structure interpretation of user requests in BIM interaction

Yinyi Wei, Xiao Li, Frank Petzold

Research output: Contribution to journalArticlepeer-review

Abstract

Numerous efforts have been devoted to utilizing a natural language-based interface for BIM interaction. These interfaces require extracting user's intent (i.e., the operation type) and slots (i.e., the targeted elements and properties). However, there is a lack of a fine-grained approach for extracting intent and slot information simultaneously. This paper introduces a text-to-structure approach based on language models to interpret user requests for BIM interaction (T2S4BIM). It proposed a synthetic data generation method and a curated dataset as data support. Employing Transformer-based models, T2S4BIM converts unstructured user requests into a structured format with intent and slot information. Experiments demonstrated that T2S4BIM outperformed existing approaches, with encoder-decoder models like T5 and FLAN-T5 achieving performance comparable to larger, decoder-only models such as Llama3.1-8B and Qwen2.5-7B, while improving efficiency. The practical applicability of T2S4BIM was illustrated through a Revit plug-in that interprets user requests and executes corresponding actions (e.g., manipulating object properties).

Original languageEnglish
Article number106119
JournalAutomation in Construction
Volume174
DOIs
StatePublished - Jun 2025

Keywords

  • BIM interaction
  • Building information modeling
  • Language models
  • Natural language processing
  • User request understanding

Fingerprint

Dive into the research topics of 'Text-to-structure interpretation of user requests in BIM interaction'. Together they form a unique fingerprint.

Cite this