Subscribe to the Non-Human & AI Identity Journal

Baseline Measurement

The pre-deployment value set used to show whether a control changed outcomes after rollout. Good baselines are time-bounded, workflow-specific, and captured before implementation so that later savings or risk reductions can be tested rather than assumed.

Expanded Definition

Baseline measurement is the starting point that lets an NHI team prove whether a control actually changed outcomes after rollout. In practice, it captures the pre-implementation state for a specific workflow, identity class, or control domain, then holds that reference steady long enough to compare before-and-after results. In NHI governance, the baseline must be scoped carefully because service accounts, API keys, secrets, and agent permissions behave differently across environments and release cycles.

Definitions vary across vendors when the term is applied to observability, security testing, or operational reporting, but the core idea is consistent: if the starting condition is unclear, improvement claims are weak. For that reason, baseline work should be time-bounded, workflow-specific, and tied to a measurable control objective rather than a broad program goal. The NIST Cybersecurity Framework 2.0 is useful here because it reinforces the need for measurable outcomes, not just activity.

The most common misapplication is treating a post-rollout snapshot as a baseline, which occurs when teams record metrics only after the control has already changed the environment.

Examples and Use Cases

Implementing baseline measurement rigorously often introduces data-collection overhead, requiring organisations to weigh audit quality against the operational cost of instrumenting every workflow.

  • A team records the number of active service accounts, privilege scope, and secret age before introducing rotation controls, then compares those figures 30 days later.
  • An organisation measures API key exposure in code repositories before migrating to a secrets manager, using the original value set to validate whether leakage actually declined.
  • Security leaders benchmark the frequency of stale credentials in CI/CD pipelines before enforcing policy gates, then use the baseline to distinguish real progress from temporary cleanup.
  • Before deploying Zero Trust changes, practitioners capture access patterns for machine identities and compare them to the post-change state using guidance from the Ultimate Guide to NHIs.
  • For agentic AI, teams baseline tool access, token issuance, and escalation paths before hardening controls, then reassess after policy enforcement and workflow redesign.

These use cases become much stronger when paired with operational benchmarks from the Ultimate Guide to NHIs and measurement expectations from the NIST Cybersecurity Framework 2.0, especially when the organisation needs to show whether a change was effective or merely visible.

Why It Matters in NHI Security

Baseline measurement is essential because NHI risk often hides in volume, privilege creep, and long-lived credentials, making intuition a poor substitute for evidence. NHI Management Group research shows that 97% of NHIs carry excessive privileges and only 5.7% of organisations have full visibility into their service accounts, which means many teams cannot prove whether a remediation program reduced exposure at all. Without a reliable baseline, control owners may celebrate motion while the underlying attack surface stays flat or worsens. The Ultimate Guide to NHIs is a strong reference point for the lifecycle problems that make baselines necessary, while NIST Cybersecurity Framework 2.0 helps translate those measurements into governable outcomes.

Practitioners should treat baseline drift as a governance issue, not just a reporting defect, because changing workloads, automation, and agent permissions can invalidate comparisons faster than teams expect. Organisations typically encounter the value of baseline measurement only after an incident review, when they cannot prove what changed, what improved, or whether the control ever worked.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Non-Human Identity Top 10 NHI-01 Baseline data is needed to measure NHI inventory and exposure before controls change.
NIST CSF 2.0 GV.RM-03 Risk measurement needs baselines so governance can evaluate control effectiveness over time.
NIST Zero Trust (SP 800-207) Zero Trust relies on continuous assessment, which makes baseline comparison operationally necessary.

Set workflow-specific baselines for machine access and verify whether post-change access patterns actually tightened.