Leveraging synthetic trace generation of modeling operations for intelligent modeling assistants using large language models

V. Muttillo, C. Di Sipio, R. Rubei, L. Berardinelli: Leveraging synthetic trace generation of modeling operations for intelligent modeling assistants using large language models, Journal Information and Software Technology, Volume 186, 2025, ISSN 0950-5849, Doi: 10.1016/j.infsof.2025.107806

Due to the proliferation of generative AI models in different software engineering tasks, the research community has started to exploit those models, spanning from requirement specification to code development. Model-Driven Engineering (MDE) is a paradigm that leverages software models as primary artifacts to automate tasks. In this respect, modelers have started to investigate the interplay between traditional MDE practices and Large Language Models (LLMs) to push automation. Although powerful, LLMs exhibit limitations that undermine the quality of generated modeling artifacts, e.g., hallucination or incorrect formatting. Recording modeling operations relies on human-based activities to train modeling assistants, helping modelers in their daily tasks. Nevertheless, those techniques require a huge amount of training data that cannot be available due to several factors, e.g., security or privacy issues.