AI safety science

What is AI safety science

AI technologies will be vital to Australia’s future economic growth, competitiveness and productivity. To capitalise on the opportunities of AI, we need to progress the science of AI safety.

AI safety science is a field of research, engineering and policy that measures and monitors emerging risks and capabilities of AI technologies. This includes frontier AI, defined as highly capable general-purpose AI models that can perform a wide variety of tasks and match or exceed the capabilities present in today’s most advanced models.

AI safety science aims to ensure that people develop and use AI systems in ways that are secure and align with human values.

We’re committed to strengthening Australia’s scientific understanding of the capabilities and risks associated with frontier AI systems to make them safer for everyone.

Participating in the International Network of AI Safety Institutes

Australia is a founding member of the International Network of AI Safety Institutes. We signed the Seoul Declaration in May 2024, confirming a commitment among countries to advance the science of AI safety, building on the Bletchley Declaration.

Current members of the network are:

Australia
Canada
European Commission
France
Japan
Kenya
Republic of Korea
Singapore
United Kingdom
United States.

The network currently has 3 workstreams:

research to manage risks from AI-generated content
testing frontier AI systems
conducting risk assessments for frontier AI systems.

Our role

Our department leads Australia’s participation in the network, in line with the Seoul Declaration.

We’re bringing together technical AI experts across Australia and internationally. We meet regularly with the network’s members to progress activities under the 3 workstreams.

Research agenda

Australia and Canada are co-leading a research agenda on managing risks from AI-generated content. AI-generated content in text, audio, image and video is increasing and widely available. When deployed at scale, AI-generated content poses significant risks, including:

creating harmful content
facilitating fraud and impersonation
undermining trust.

The research agenda looks at ways to reduce these risks by:

building safeguards into AI models
evaluating and advancing techniques that show when content is AI-generated
understanding the impacts of AI-generated content spreading online.

Read the network’s research agenda

Joint testing exercises

Australia contributes to joint testing of frontier AI systems. These exercises aim to improve our ability to accurately measure AI capabilities and risks. This will help us better identify, understand and manage the risks of AI systems before they cause harm.

Australia has contributed to the following joint testing exercises:

In these testing exercises, we draw on technical expertise from researchers in:

CSIRO’s Data61
Gradient Institute
Harmony Intelligence
Mileva Security Labs
UNSW’s AI Institute.

Advancing international AI safety research

We bring together Australia’s AI experts to contribute to the independent International AI safety report.

The International AI safety report 2025 outlines current risks and capabilities of advanced AI systems. The authors are 100 AI experts from 33 countries and intergovernmental organisations.

The interim version of the upcoming 2026 report will be available in September 2025, and the final report in early 2026.

Advancing domestic AI safety research

We are partnering with technical experts on 2 projects to better understand emerging risks from AI:

Gradient Institute is helping organisations to understand the risks of multiple AI agents (AI systems that can make decisions and use tools) interacting with each other.
CSIRO’s Data61 is helping organisations to identify, analyse and evaluate risks from general purpose AI, including conducting risk assessments and setting risk thresholds.

We will publish the Data61 report shortly.

Latest news

More news

Jul 2025

Australia continues research on AI safety science

Australia attended the third directors-level meeting of the International Network of AI Safety Institutes in Vancouver this July.
Jul 2025

New report highlights emerging risks in multi-agent AI systems

We've commissioned research by the Gradient Institute to strengthen AI safety science and support responsible deployment of AI agents.
Feb 2025

Australia signs Paris AI Action Summit statement

Read about our work with international partners to promote an inclusive approach to AI safety and governance.

More information

The National AI Centre is helping Australia become a global leader in trusted, secure and responsible AI

Was this page helpful?

Yes

Report a problem with this page

Feedback you provide will not be directly answered. If you require a reply, please reach out to the page contact directly. For any other queries, please use our general enquiries web form.

Would you like to tell us more about your experiences with this page? (optional)

Please do not include personal or financial information (e.g. email addresses, phone numbers or credit card details).

Feedback you provide will not be directly answered. If you require a reply, please reach out to the page contact directly. For any other queries, please use our general enquiries web form.

Please tell us more about the problems you experienced. (required)

Please do not include personal or financial information (e.g. email addresses, phone numbers or credit card details).

Your feedback is covered by our privacy policy.

AI safety science

What is AI safety science

Participating in the International Network of AI Safety Institutes

Our role

Research agenda

Read the network’s research agenda

Joint testing exercises

Advancing international AI safety research

Advancing domestic AI safety research

Latest news

Australia continues research on AI safety science

New report highlights emerging risks in multi-agent AI systems

Australia signs Paris AI Action Summit statement

More information

Contact us

Contact us at the department

Connect with us at the department

Acknowledgement of Country

AI safety science

What is AI safety science

Participating in the International Network of AI Safety Institutes

Our role

Research agenda

Read the network’s research agenda

Joint testing exercises

Advancing international AI safety research

Advancing domestic AI safety research

Latest news

Australia continues research on AI safety science

New report highlights emerging risks in multi-agent AI systems

Australia signs Paris AI Action Summit statement

More information

Webpage feedback

Contact us

Acknowledgement of Country