Agentic AI Transformation

Executive-grade guidance for organisations that need to adopt agentic AI safely, calmly and at scale.

L1 detail

Governance

8 Level 2 areas40 Level 3 activities

Linked Level 2 areas

Linked Level 3 activities

Risk management

Agent risk assessment framework

M

Extend model risk assessment to include action risk, control dependency risk, systemic interaction risk (multi-agent/tool chains), and operational resilience risk

Open Level 3 detail

Policies

AI usage policies

XS

Extend acceptable use into explicit autonomy tiers, defined agent red lines, permitted action classes, tool boundaries, and approval thresholds (and link these to enforcement mechanisms in architecture/policy enforcement)

Open Level 3 detail

Policy enforcement

Automated policy checks

M

Enforce constraints at runtime on agent actions/tool calls (allowlists, thresholds, jurisdiction, data sensitivity), not post-hoc reviews; implement via a policy decision point intercepting tool invocation and configuration changes

Open Level 3 detail

Controls

Automated validation and testing

M

Move from prompt tests to agent evaluations including tool-use, multi-step plans, trajectory correctness, and failure-mode behaviour

Open Level 3 detail

Operating model and committees

Committee mapping (Risk, IT, Data, Responsible AI, etc)

XS

Clarify which committees approve what for autonomous actions and cross-domain tool access, beyond standard GenAI oversight

Open Level 3 detail

Accountability

Decision rights and approvals

XS

Define who approves autonomy levels, tool integrations, policy exceptions, and go-live, not just who owns the model

Open Level 3 detail

Portfolio prioritisation and funding

Portfolio triage and risk-value scoring

S

Expand portfolio triage to include agentic risk indicators (autonomy, tool access, action impact, resilience dependencies) and value indicators (throughput, cycle time, quality uplift), and use it to drive the required mitigations/controls

Open Level 3 detail

Compliance

Regulatory mapping

S

Map agent workflows/actions to applicable rules (recordkeeping, Consumer Duty, operational resilience, etc), not only model use

Open Level 3 detail

Accountability

Agent audit trails (human oversight mapping)

S

Establish traceability linking each agent bundle and action class to accountable humans and committees

Open Level 3 detail

Portfolio prioritisation and funding

Benefits realisation tracking

S

Track realised value vs expected and incremental value attributable to autonomy (cycle time, throughput, error reduction, avoided escalations), alongside safety KRIs (incidents, overrides, drift)

Open Level 3 detail

Operating model and committees

Expand AI committee remit to include oversight of agentic AI

XS

Expand the existing AI/Responsible AI committee remit to include agentic AI and review membership to reflect deeper IT and operational involvement

Open Level 3 detail

Compliance

Explainability standards

S

Move from “how was this answer generated” to “why did the agent act” with action-level rationales and evidence links (supported by technology instrumentation)

Open Level 3 detail

Risk management

Fairness audits

M

Shift from static output bias checks to monitoring fairness of autonomous decisions and impacts (allocations, prioritisation, service levels) over time

Open Level 3 detail

Controls

Human-in-the-loop control points

S

Define where humans must approve/override autonomous actions (by action class and impact), not merely review generated content

Open Level 3 detail

Policies

Policy suite uplift across risk taxonomy

M

Update the broader policy suite impacted by agents (eg responsible AI/data ethics, AI usage, privacy, cyber, resilience/operational risk, third-party risk, data quality, model risk/validation) and align definitions/requirements across them

Open Level 3 detail

Policy enforcement

Policy-as-code

L

Codify the most operational policies (autonomy tiers/red lines, privacy, security/tool access, resilience, validation/evals, recordkeeping) plus the SOPs that operationalise them, into versioned machine-readable rules

Open Level 3 detail

Controls

Audit logging and traceability

M

Expand logs from prompts/outputs to full action traces (inputs, tools, decisions, state, approvals) with consistent identifiers across systems

Open Level 3 detail

Compliance

Audit readiness and evidence packs

M

Maintain continuously updated evidence (policies, tests, logs, approvals, monitoring), enabled by automated evidence pipelines

Open Level 3 detail

Accountability

Escalation paths (update existing)

XS

Update existing escalation paths to include agent boundary violations, kill-switch invocation, and tool misuse scenarios

Open Level 3 detail

Risk management

Incident response and crisis playbooks

S

Upgrade playbooks to include agent disablement, tool credential rotation, rollback bundles, and customer remediation

Open Level 3 detail

Policy enforcement

Policy update workflows

S

Create rapid update mechanisms with testing and controlled rollout for policy rules and constraints

Open Level 3 detail

Operating model and committees

RACI and decision pathways for agent go-live

S

Define decision flows for go-live approvals, exception approvals, and incident decisions while preserving the ability to use ad hoc escalation when needed

Open Level 3 detail

Policies

Responsible agent rules

S

Move from static principles to scenario-based rules for autonomous choices and trade-offs (eg prioritisation, customer impact, escalation)

Open Level 3 detail

Controls

Agentic control library and automated control selection

M

Expand the enterprise control library to include agent-specific controls (ring-fencing, tool governance, eval thresholds, monitoring) and automate control selection/application by risk tier and agent classification

Open Level 3 detail

Operating model and committees

Business sponsorship and ownership

XS

Shift ownership from “IT and data science innovation” to accountable business owners for agent outcomes and customer impact (with IT/data as delivery partners)

Open Level 3 detail

Policies

Compliance framework definitions

S

Expand compliance definitions from “AI use case” to explicit objects - agents, agent bundles, foundation models, connectors, tools, and AI platforms - and define required records, controls, and responsibilities per object

Open Level 3 detail

Risk management

Continuous monitoring standards

S

Define minimum monitoring for agent fleets (KPIs, KRIs, alert thresholds, evidencing, sampling regimes, response SLAs) and how it is operated

Open Level 3 detail

Policy enforcement

Exception handling processes

S

Define how exceptions are requested, approved, time-bound, monitored, and automatically revoked when expired, supported by workflow tooling and policy engines

Open Level 3 detail

Accountability

Performance and incentive alignment

S

Tie objectives to safe autonomy outcomes (quality, controls adherence, incident rates), not just delivery

Open Level 3 detail

Controls

Enhanced agent observability requirements (incl drift triggers)

M

Extend observability to include autonomy patterns, tool selection, boundary hits, override rates, and drift triggers, not only model accuracy

Open Level 3 detail

Policy enforcement

Governance platform (system of record and workflow)

L

Stand up a governance platform to manage agent inventory/bundles, policies, approvals, monitoring views, and evidence automation (distinct from a “control hub” control library)

Open Level 3 detail

Operating model and committees

Programme governance cadence (forums, packs, KPIs)

S

Add operational rhythms for agent fleets (observability, KRIs, incidents, drift, overrides, value metrics), not occasional reporting

Open Level 3 detail

Risk management

Scenario stress-testing

M

Move from prompt edge cases to simulation of real workflows, adversarial tool inputs, cascading failures, and boundary violations (ideally executed in a controlled sandbox)

Open Level 3 detail

Policies

Third-party agreements and procurement clauses for agents

S

Update supplier due diligence and contract clauses for agent connectors/tools (data use, logging, breach handling, residency, sub-processors, change notification, audit rights)

Open Level 3 detail

Policies

AI risk appetite statements and autonomy bounds

S

Update risk appetite to include measurable autonomy limits (impact thresholds, decision classes, spend caps, customer harm tolerance, override requirements)

Open Level 3 detail

Controls

Fallback mechanisms

M

Govern and test degrade modes (stop, revert to human, limited autonomy), not just infrastructure failover

Open Level 3 detail

Policy enforcement

Policy audit logs (across risk taxonomy areas)

M

Log policy versions, decisions, enforcement outcomes, and overrides across multiple policy domains (privacy, cyber, resilience, responsible AI) with consistent identifiers

Open Level 3 detail

Risk management

Responsible AI ceremonies

S

Combine pre-mortems and unintended consequence scanning into structured ceremonies (ethical purpose, explainability, and control-breakdown workshops) focused on autonomous action pathways

Open Level 3 detail

Policies

Accountability policy and ownership model

S

Extend from “model owner” to defined owners for foundation models, agent service/bundle, solution design, infrastructure design, connector use, and tool ownership with clear obligations

Open Level 3 detail

Policies

Data retention policies (agent memory and traces)

S

Extend retention to agent memory, action traces, and tool outputs, aligned to privacy, evidencing, and dispute requirements

Open Level 3 detail