UK evaluates frontier AI for operational cybersecurity applications

UK evaluates frontier AI for operational cybersecurity applications

UK evaluates frontier AI for operational cybersecurity applications

https://dig.watch/updates/uk-frontier-ai-operational-cybersecurity-apps

Publish Date: 2026-06-15 07:25:00

Source Domain: dig.watch

A new UK pilot demonstrated how AI can support cyber teams in finding critical weaknesses.

The UK Government Cyber Coordination Centre (GC3), in partnership with the National Cyber Security Centre (NCSC) and the AI Security Institute, has completed a pilot programme exploring how frontier AI models could strengthen cyber defence across government systems.

The initiative forms part of the UK’s Government Cyber Action Plan, which seeks to improve public-sector cyber resilience through the use of emerging technologies.

Teams participated in a series of hackathons that used advanced AI systems to analyse public government code repositories for potential security weaknesses.

Different approaches were tested, including multi-agent workflows, AI-assisted vulnerability investigation and specialised AI skills designed to automate parts of the security auditing process. Rather than relying on a single methodology, participants tested different architectures and workflows to determine which approaches produced the most effective results.

The exercise identified 407 security findings, including vulnerabilities that could have enabled authentication bypass, data exposure and remote code execution. AI models demonstrated an ability to identify relationships between technical weaknesses across multiple services and uncover attack paths that conventional scanners often struggle to detect.

Government departments validated the findings through existing security processes and remediated all critical vulnerabilities.

UK officials concluded that successful deployment depends less on the choice of AI model and more on how AI is integrated into structured security workflows. Human experts remained responsible for validating findings, prioritising risks and managing remediation efforts.

Following the results, GC3 plans to launch a second phase involving additional government departments, more AI systems and assessments of…

Source