Second key update: technical safeguards and risk management

Date published:

25 November 2025

Date updated:

1 June 2026

Topics

Publisher

AI Safety Institute

Introduction

This is the second of 2 key updates since the publication of the International AI safety report in January 2025. It was written by independent experts and aims to provide scientific information to support informed policymaking.

Australia’s representative on the expert advisory panel was Dr Liming Zhu, Research Director, Data61, CSIRO. Australian civil society and industry contributors included:

the Gradient Institute
Old Ways New
Harmony Intelligence
Good Ancestors Policy.

The International AI safety report aims to synthesise scientific evidence to support informed policymaking. It does not make specific policy recommendations.

This update presents the following key findings:

12 companies have published their own frontier AI safety frameworks in 2025. However, implementation can be mixed, and these commitments are difficult to verify externally.
While safeguards preventing the misuse of AI models have improved, they remain fragile against sophisticated attacks and complex tasks.

These results reflect an increasing emphasis on managing the risks of AI technologies and highlight the need for further research and practical solutions in this area internationally.

Read the report

Second key update: technical safeguards and risk management (internaionalaisafetyreport.org)

More information

Find more information on the International AI Safety Report website

Read about the Australian AI Safety Institute

Second key update: technical safeguards and risk management

Topics

Publisher

Introduction

Read the report

More information

Contact us at the department

Connect with us at the department

Acknowledgement of Country

Second key update: technical safeguards and risk management

Share

Topics

Publisher

Introduction

Read the report

More information

Acknowledgement of Country