Appendix G: Executive AI Dashboard and Metrics

Executives do not need a technical telemetry screen. They do need a regular management view that shows where AI use is spreading, where exposure is rising, where controls are weak, and whether capability is becoming more durable.

The purpose of this appendix is simple: help leadership teams review AI as an operating portfolio rather than a collection of scattered pilots, vendor updates, and isolated incidents.

What An Executive Dashboard Should Answer

An executive AI dashboard should help leaders answer six practical questions:

What material AI use exists now, and what changed this period?
Where is value being created, and where is activity being mistaken for value?
Where is risk, incident pressure, or policy breach rising?
Which controls, approvals, or evidence requirements are weak or overdue?
Which vendors, platforms, or jurisdictions are becoming strategic dependencies?
Is the organisation becoming more governable as AI use expands, or less?

Dashboard Design Principles

A useful executive dashboard should be:

short enough to review in one meeting
comparative enough to show change over time
selective enough to focus on material systems, not every experiment
balanced enough to show value, exposure, and dependency together
actionable enough that each red signal has a named owner and next step

The common failure is to produce a dashboard that reports activity volume without helping leaders decide where to intervene.

Core Dashboard Sections

Section	What leaders should see	Why it matters
Inventory and visibility	Number of material AI systems, high-impact workflows, and major vendors in current use; additions, retirements, or scope changes since last review	Leaders cannot govern what they cannot see
Value and adoption	Which use cases are producing measurable workflow, service, or strategic value; where adoption is shallow, forced, or workaround-heavy	Usage growth alone is not evidence of value
Risk and incidents	Incidents, complaints, overrides, near misses, policy breaches, and unresolved remediation items	Incident visibility shows where AI use is outrunning control
Control and assurance	Systems lacking current approval, testing, monitoring, documentation, owner confirmation, or audit evidence	Weak control maturity is often invisible until something fails
Vendor and dependency	Critical suppliers, concentrated model or cloud dependence, important contract or service changes, and weak fallback positions	AI strength can hide strategic fragility
Workforce and trust	Reliance, resistance, override behavior, training uptake, contestation, and recurring concerns about fairness, workload, or surveillance	Legitimacy and adoption quality matter as much as technical availability

Suggested Monthly Or Quarterly Pack

Most executive teams should expect a short monthly or quarterly pack with five parts:

Portfolio summary Material systems in use, new additions, retired systems, and major scope changes.
Top green, yellow, and red signals A short list of the most important positive developments, emerging concerns, and urgent issues.
Material incidents and control gaps High-priority incidents, complaints, policy breaches, unresolved remediation items, and overdue approvals or reviews.
Strategic dependency view Vendor, platform, cloud, jurisdiction, or model changes that may require management action.
Decisions required Items needing executive approval, escalation, pause, additional funding, or board visibility.

The purpose is not to flood leaders with operational detail. It is to make AI exposure, capability, and dependence visible enough for real oversight.

Suggested Executive Metrics

Not every organisation needs the same scorecard, but most should track some version of the following.

1. Inventory And Materiality

count of material AI systems in live use
count of high-impact or regulated AI use cases
number of new systems introduced since last review
number of paused, retired, or materially changed systems
percentage of material systems with a named accountable owner

2. Value Realization

percentage of material use cases with a defined outcome metric
number of use cases delivering verified workflow or service improvement
number of use cases with persistent adoption but weak measurable value
time from pilot approval to evidence of sustainable operational benefit
top three uses where investment is increasing or being withdrawn

3. Risk, Incidents, And Breach Signals

number of incidents, near misses, or significant overrides this period
number of complaints, contestations, or policy breaches linked to AI use
number of open remediation items past due
number of material systems with repeat incidents or recurring weak behavior
average time to detect, escalate, and contain important AI-related failures

4. Control And Assurance

percentage of material systems with current approval status
percentage with up-to-date testing, monitoring, and documentation
percentage with a current model or vendor review after major change
number of systems lacking adequate audit trail, records logic, or evidence pack
number of high-impact systems operating with temporary exceptions

5. Dependency And Sovereignty Exposure

number of material systems dependent on a single model provider or cloud path
count of critical uses with no credible fallback or degraded-mode plan
number of important vendor, contractual, or policy changes since last review
number of material systems involving cross-border data or inference dependence
count of uses where lock-in, residency, or portability concerns remain unresolved

6. Workforce, Use Quality, And Trust

training completion for managers and staff in approved AI use
rate of override or non-use in systems expected to be relied on
recurring workforce concerns about fairness, surveillance, workload, or deskilling
pattern of challenge, complaint, or escalation from affected users
leadership view of where trust is strengthening and where it is being spent down

Red Flag Signals

Executive dashboards should make certain red flags hard to miss:

management cannot produce a stable inventory of material AI use
adoption is increasing faster than approval, documentation, or monitoring
one vendor or model family is becoming critical without fallback
incidents are being closed administratively but the same failure pattern recurs
value claims are based on usage or enthusiasm rather than verified outcomes
staff are using unapproved tools because approved routes are too slow or unclear
major systems remain live under temporary exceptions for too long

A Simple RAG Review Model

Use a simple status model for material systems:

Status	Meaning	Expected leadership response
Green	Value, control, and dependency are within expected limits	Continue, monitor, and review on normal cycle
Amber	One or more material concerns need management attention	Assign owner, set timeline, and watch closely
Red	Exposure, control weakness, or strategic dependence is outside tolerance	Escalate, pause, narrow, or intervene immediately

The important point is not the color itself. It is whether the status forces a real management response rather than becoming presentation language.

What Should Reach The Board

Not everything belongs at board level. Boards should usually see:

major new high-impact AI deployments
significant incidents, public failures, or regulator-sensitive events
concentrated vendor or jurisdictional dependence in critical functions
unresolved control gaps in high-impact systems
material shifts in exposure, assurance, or strategic reliance

Routine operational details should stay with management unless they indicate a pattern that changes risk tolerance, trust, or strategy.

What Good Looks Like

A credible executive AI dashboard should allow a leadership team to say:

we know where material AI use exists
we can see whether value is real or overstated
we can identify where control is weakening before failure forces attention
we understand where vendor and cross-border dependence are becoming strategic
we can tell which issues need management action and which need board visibility

That is the difference between reporting on AI activity and governing AI as an institutional capability.

← Appendix F

Next: Bibliography →