Risk analysis techniques for governed LLM-based multiagent systems

Date published:

29 July 2025

Date updated:

1 June 2026

Topics

Publisher

AI Safety Institute

Introduction

This report looks at the risks of multiple AI agents interacting with each other. Agents are AI systems that can perceive and take actions in its digital environment with some degree of autonomy.

We commissioned the Gradient Institute to do this study as part of our work to better understand emerging risks from AI for Australia.

The report found that when multiple agents are deployed within a single environment, this fundamentally changes the risks landscape. The report identified 6 key failure modes which could arise from multi-agent interactions. These result from a lack of validity, which is whether assessments measure what they claim to measure and produce reliable results that align with real-world outcomes.

The report equips organisations with foundational knowledge and tools for analysing the distinctive challenges emerging from multi-agent interactions in their governed environments.

This helps give organisations important practical guidance to effectively manage implementation risks when deploying AI technologies.

Read the report

Risk analysis techniques for governed LLM-based multi-agent systems (gradientinstitute.org)

More information

Read about the Australian AI Safety Institute

Risk analysis techniques for governed LLM-based multiagent systems

Topics

Publisher

Introduction

Read the report

More information

Contact us at the department

Connect with us at the department

Acknowledgement of Country

Risk analysis techniques for governed LLM-based multiagent systems

Share

Topics

Publisher

Introduction

Read the report

More information

Acknowledgement of Country