L1 detail

Process & tooling

6 Level 2 areas24 Level 3 activities

Linked Level 2 areas

Level 2

Inventory and discovery

4 Level 3 activities

Ensure visibility and control of what exists, where it runs, what it can do, and where risk sits

Open Level 2 detail

Level 2

Monitoring and observability

4 Level 3 activities

Detect issues early, manage cost and performance, and support governance decisions with evidence

Open Level 2 detail

Level 2

Lifecycle management

5 Level 3 activities

Manage change safely and sustain performance and safety in live

Open Level 2 detail

Level 2

SDLC and pipelines

5 Level 3 activities

Enable frequent iteration with consistent controls, evidence, and reduced release risk

Open Level 2 detail

Level 2

Continuous improvement

3 Level 3 activities

Sustain value and reduce operational risk as agents scale and evolve

Open Level 2 detail

Level 2

Process standardisation

3 Level 3 activities

Make automation repeatable and reduce variance that breaks safety and control assumptions

Open Level 2 detail

Linked Level 3 activities

SDLC and pipelines

DevOps / DevSecOps pipeline integration

Integrate evals, policy gates, and security checks into CI/CD with explicit alignment between technology teams and oversight/control teams

Open Level 3 detail

Continuous improvement

Feedback collection, triage, and change implementation

Capture feedback on outcomes/overrides/failures and implement a managed loop (triage, prioritise, release fixes, verify impact)

Open Level 3 detail

Inventory and discovery

Intake and registration workflow

Expand from logging GenAI use cases to registering agent bundles, autonomy levels, tools/connectors, permissions, and deployment endpoints as part of a governance platform

Open Level 3 detail

Monitoring and observability

Operational monitoring (uptime, latency, cost)

Monitor agent workload patterns and downstream tool/system health, not only model APIs and supporting services

Open Level 3 detail

Lifecycle management

Pre-deployment evaluation (capability and safety)

Treat evaluation as a product and tooling capability - datasets, harnesses, trajectory evals, red teaming, acceptance thresholds, and regression gates, not a light-touch checklist

Open Level 3 detail

Process standardisation

Process discovery, mapping, and documentation

Document and version critical processes (inputs, decisions, exceptions, controls, handoffs) that agents will execute or influence

Open Level 3 detail

Monitoring and observability

Behaviour monitoring (actions, tool use)

Track tool calls, retries, overrides, boundary hits, and failure modes, not just prompts/outputs

Open Level 3 detail

Lifecycle management

Change and release management

Manage releases as bundles (agents + model + tools/connectors + policies + prompts + memory/config), with controlled rollout and rollback

Open Level 3 detail

SDLC and pipelines

DataOps pipeline integration

Treat data changes as production changes impacting agent behaviour, not background hygiene

Open Level 3 detail

Inventory and discovery

Metadata schema and classification taxonomy

Classify agents by action types, autonomy tier, risk tier, data sensitivity, tool access, and operational criticality

Open Level 3 detail

Continuous improvement

Process metrics and maturity tracking

Track maturity of controls, monitoring, adoption, and operational outcomes per agent/domain

Open Level 3 detail

Process standardisation

Standard operating procedures and playbooks

Create SOPs for operating with agents - human intervention points, exception handling, escalation, and day-to-day run patterns

Open Level 3 detail

SDLC and pipelines

Control gating and approvals in CI/CD

Implement approvals based on risk tier and evidence, while requiring human review for higher-risk classes regardless of test results

Open Level 3 detail

Inventory and discovery

Dependency mapping (apps, connectors, data stores, services)

Map end-to-end dependencies for autonomous chains - applications, external connectors, external databases, event streams, model gateways, policy engines

Open Level 3 detail

Continuous improvement

Live backlog prioritisation and iteration principles

Update post-live prioritisation principles to balance risk reduction, stability, and outcome improvements, not just feature delivery

Open Level 3 detail

Lifecycle management

Live tuning, configuration, and update management

Define how live agents are tuned and updated - configuration management, policy updates, safe rollout, and rollback - rather than informal prompt tweaking

Open Level 3 detail

Process standardisation

Process conformance and measurement

Use conformance checks (process mining/telemetry, exception rates, control-point adherence) to ensure reality matches documented process

Open Level 3 detail

Monitoring and observability

Risk signal monitoring (incidents, drift alerts)

Operate a KRI-driven alerting regime for agent fleets, beyond basic monitoring, with defined thresholds and response playbooks

Open Level 3 detail

SDLC and pipelines

Agent testing harnesses (evals, simulations)

Build repeatable harnesses to simulate workflows, tool failures, adversarial inputs, and boundary violations

Open Level 3 detail

Inventory and discovery

Go-live readiness checklist and gates

Add readiness gates for autonomy (fallbacks, escalation, evals, logging, access), beyond content checks

Open Level 3 detail

Monitoring and observability

Outcome monitoring and role-based reporting

Combine outcome measurement with role-based reporting - goal completion, quality, business KPIs, and committee-ready views, not just user satisfaction

Open Level 3 detail

Lifecycle management

Retirement and decommissioning

Retire agents safely by removing tool access, archiving evidence, and migrating workflows

Open Level 3 detail

SDLC and pipelines

Environment management (dev / test / prod)

Add safe sandboxes for tool actions and controlled test data where appropriate, not only staging for chat/UI

Open Level 3 detail

Lifecycle management