Bridging LLM Planning Agents and Formal Methods: A Case Study in Plan Verification
Keshav Ramani (J.P. Morgan AI Research), Vali Tawosi (JP Morgan AI Research), Salwa Alamir (J.P. Morgan AI Research), Daniel Borrajo (J.P. Morgan AI Research)
PRIMA: Enabling User Agency and Control in Mobile GUI Agent Autonomy
Ching-Ting Lin (National Yang Ming Chiao Tung University), Zhi-Hong Ye (National Yang Ming Chiao Tung University), Yung-Ju Chang (National Yang Ming Chiao Tung University)
LLMs in Debate: Does Arguing Make Them Better at Detecting Metamorphic Relations?
Dibyendu Brinto Bose (Virginia Tech), Yoseph Berhanu Alebachew (Virginia Tech), Chris Brown (Virginia Tech)
A 3-Layer Agentic Model for Nonfunctional Requirements in Software Engineering
Ehsan Zabardast (Blekinge Institute of Technology), Tiago Vieira (Independent Researcher), Tony Gorschek (Blekinge Institute of Technology)
Leveraging Large Language Models for Cybersecurity Risk Assessment — A Case from Forestry Cyber-Physical Systems
Fikret Mert Gültekin (Chalmers University of Technology and University of Gothenburg), Oscar Lilja (Chalmers University of Technology and University of Gothenburg), Ranim Khojah (Chalmers University of Technology and University of Gothenburg), Rebekka Wohlrab (Chalmers University of Technology and University of Gothenburg | Carnegie Mellon University), Marvin Damschen (RISE Research Institutes of Sweden), Mazen Mohamad (RISE Research Institutes of Sweden | Chalmers University of Technology and University of Gothenburg)
Transforming Natural Language into Formal Specifications
Kuangxiangzi Liu (Volkswagen AG / Saarland University), Alexander Liggesmeyer (CISPA Helmholtz Center for Information Security), Dhiman Chakraborty (Volkswagen AG), Andreas Zeller (CISPA Helmholtz Center for Information Security)
The Last Dependency Crusade: Solving Python Dependency Conflicts with LLMs
Antony Bartlett (Delft University of Technology), Cynthia C. S. Liem (Delft University of Technology), Annibale Panichella (Delft University of Technology)
AgentGuard: Runtime Verification of AI Agents
Roham Koohestani (JetBrains Research)