TL;DR: Visibility, not more manual review, becomes the limiting factor when data products span many systems, as Collibra says its updated data quality tab can roll up scores across derived relations, branches, and business assets so leaders see a unified health signal instead of fragmented column-level checks.
At a glance
What this is: Collibra's post describes data quality tab enhancements that aggregate quality scores from technical assets into business-facing data products and domains.
Why it matters: For IAM-adjacent governance teams, the pattern matters because visibility, lifecycle, and trust breakdowns at machine scale create the same control problems seen in NHI, autonomous, and human identity programmes.
By the numbers:
- 92% of data leaders admit that high-quality data products are their top priority.
👉 Read Collibra's analysis of data quality tab rollups and business asset health
Context
Data quality becomes a governance problem when technical metrics cannot be translated into business trust. The core issue in Collibra's post is visibility: organisations can measure quality at the column level, but they struggle to explain what those measurements mean for a data product, report, or downstream decision.
That gap is familiar to identity teams. Whether the object is a dataset, a service account, or an AI workflow, programmes fail when control signals stay trapped at the technical layer and never reach the business layer that decides whether to trust the asset.
The article's starting point is typical for large enterprises with fragmented data estates, where manual tracking cannot keep up with the number of assets, relations, and refresh cycles.
Key questions
Q: How should governance teams roll up technical data quality into business-facing trust signals?
A: Governance teams should define the aggregation path first, then map every technical source that contributes to a business asset or data product. The score is only useful if it reflects complete lineage, known refresh cadence, and the actual decision context. Without that, the number looks precise but does not support trustworthy governance.
Q: Why do data quality programmes fail when assets span multiple schemas and tables?
A: They fail because linear reporting cannot represent branching dependencies. If one business asset depends on several technical paths, a single unstructured score can hide weak inputs or overstate readiness. A graph-based approach is needed so the reported health matches the real dependency structure.
Q: How do teams know if a data quality score is actually trustworthy?
A: They should verify three things: the lineage behind the score is complete, the refresh cadence matches the asset's lifecycle, and the aggregation rules include all material branches. If any of those are missing, the score is a partial indicator rather than a reliable control signal.
Q: What should organisations do before certifying a data product based on quality scores?
A: They should review the upstream technical inputs, confirm the roll-up logic covers every contributing relation, and assess whether the business asset depends on any blind spots. Certification should reflect both score and scope, not just a single numeric threshold.
How it works in practice
Aggregation paths and derived relations in data quality scoring
The enhancement uses aggregation paths to follow connected relations from physical columns through tables, schemas, and business assets. Derived relations let the platform roll up scores even when the business object is not directly tied to a single technical source. In effect, the quality score becomes a graph traversal problem rather than a linear lookup. That matters because complex enterprise data products are rarely fed by one system, and a single failed column can change the apparent health of many downstream assets.
Practical implication: define which relations are authoritative for quality roll-up before using the score operationally.
Why linear reporting fails for multi-branch data products
Traditional reporting assumes a simple path from source to asset, but real data products often branch across multiple schemas, tables, and datasets. When that happens, a single score can understate or overstate actual quality unless every contributing branch is included. The metadata graph solves this by joining disparate paths into one evaluation context. This is a control problem as much as a data problem, because incomplete lineage creates false confidence and delays stewardship action.
Practical implication: validate branch coverage in lineage and score aggregation before exposing business-facing quality metrics.
Quality metrics as a live control plane
Once quality scores are stored as attributes on assets, they can drive filtering, search, and downstream visibility in the marketplace and control tower views. That turns quality from a static report into an operational signal that changes as the underlying data changes. The architecture is useful because it separates measurement from presentation while preserving traceability across layers. The key technical question is whether the score reflects the actual asset graph or only the parts the system can see cleanly.
Practical implication: treat the score as an operational control signal only if recalculation and lineage coverage are validated.
NHI Mgmt Group analysis
Data quality fails at enterprise scale when visibility stops at the technical layer. Collibra's post is really about the gap between measurement and trust: teams can record quality metrics, but they cannot reliably translate those metrics into a business-level view of asset health. That is the same pattern identity teams see when technical controls do not surface into governance decisions. The practitioner conclusion is that visibility architecture matters as much as the metric itself.
Business trust depends on lineage-aware aggregation, not isolated scores. A column-level quality value is useful, but it is not enough when a data product draws from multiple schemas and branches. This is where graph-based aggregation becomes a governance requirement rather than a convenience, because it determines whether downstream consumers see a coherent asset picture. The practitioner conclusion is to govern the path, not just the metric.
Quality anywhere is a data-governance version of identity blast radius. Once an upstream asset changes, every dependent report, product, or model inherits that change. That is analogous to how one compromised non-human identity can affect a wider access surface than the original account suggests. The practitioner conclusion is to manage dependency scope explicitly so that trust signals stay aligned with real downstream impact.
Granular control over refresh frequency is a governance control, not a dashboard feature. Monthly, weekly, and daily scores only matter if the cadence matches the lifecycle of the underlying asset and the business decision it supports. Otherwise the organisation gets stale assurance dressed up as automation. The practitioner conclusion is to align measurement cadence with asset volatility and decision criticality.
Data health gap: the real problem is not that organisations lack quality checks, but that they lack a repeatable way to roll those checks into a business-trust signal across the full asset graph. That breaks the assumption that a technical metric can safely stand in for governance judgement. The practitioner conclusion is to treat aggregation logic as part of the control design, not an afterthought.
From our research:
- The average estimated time to remediate a leaked secret is 27 days, despite 75% of organisations expressing strong confidence in their secrets management capabilities, according to The State of Secrets in AppSec.
- Organisations maintain an average of 6 distinct secrets manager instances, creating fragmentation that undermines centralised control, according to The State of Secrets in AppSec.
- For a broader governance lens, the NHI Lifecycle Management Guide shows how visibility, rotation, and offboarding need to stay aligned as assets move across environments.
What this signals
Data health gap: teams should expect governance pressure to shift from reporting volume to trust integrity. When scores can be rolled up across many branches and business assets, the control question becomes whether the aggregation logic is complete enough to support certification, not whether another dashboard exists.
The practical signal is that data governance and identity governance are converging around the same operating problem: how to turn technical state into a business decision without losing lineage. That makes score provenance, refresh cadence, and exception handling central programme controls rather than reporting details.
With 43% of security professionals already concerned about AI systems learning and reproducing sensitive information patterns from codebases, the boundary between data trust and identity trust is getting thinner, not wider. Teams should prepare for shared governance patterns across data products, machine identities, and AI-enabled workflows.
For practitioners
- Map aggregation paths before operational use Document which schemas, tables, branches, and derived relations contribute to each business-facing score, then test whether the roll-up matches the asset graph you actually govern.
- Set refresh cadence by asset volatility Use monthly, weekly, or daily recalculation only where the underlying data product changes at that speed, and avoid applying one cadence across every asset class.
- Validate branch coverage in lineage reviews Check that multi-source data products include every contributing branch before exposing scores to business stakeholders or certification workflows.
- Separate technical scoring from business trust decisions Require a governance step that reviews the aggregated score, the lineage behind it, and any unresolved blind spots before the asset is treated as certified.
Key takeaways
- Data quality becomes a governance issue when technical metrics cannot be translated into a trustworthy business signal.
- Graph-based roll-up matters because branching data products need lineage-aware aggregation, not isolated scores.
- Teams should treat aggregation logic, refresh cadence, and lineage coverage as control design choices, not reporting settings.
Standards & Framework Alignment
This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.
NIST CSF 2.0, NIST CSF 2.0 and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.
| Framework | Control / Reference | Relevance |
|---|---|---|
| NIST CSF 2.0 | PR.DS-4 | Quality roll-up depends on preserving data integrity across asset paths. |
| NIST CSF 2.0 | ID.AM-1 | The post hinges on accurate asset visibility and dependency mapping. |
| NIST Zero Trust (SP 800-207) | PR.AC-1 | Access and trust decisions should follow verified asset context, not partial visibility. |
Apply zero-trust principles to governance by requiring validated asset context before certification.
Key terms
- Aggregation Path: An aggregation path is the defined chain of related technical objects used to roll up a score or status into a higher-level business asset. In governance terms, it determines which dependencies count, which are ignored, and whether the reported result is trustworthy across a complex data graph.
- Derived Relation: A derived relation is an inferred connection that allows a platform to connect technical data sources to business-facing assets even when the link is not directly stored. It is useful for roll-up logic, but it must be validated carefully because inferred links can hide gaps if the underlying graph is incomplete.
- Business-Facing Data Product: A business-facing data product is a curated data asset presented for decision-making, reporting, or downstream consumption rather than raw technical use. Its governance depends on whether the health signal reflects all contributing sources and not just the easiest ones to measure.
- Data Quality Roll-Up: Data quality roll-up is the process of combining lower-level quality signals into a higher-level view of asset health. It is valuable when many technical components contribute to one business outcome, but it only works if lineage, branch coverage, and refresh logic are aligned.
Deepen your knowledge
NHI governance, agentic AI identity, and machine identity lifecycle are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are responsible for identity security strategy or NHI governance in your organisation, it is worth exploring.
This post draws on content published by Collibra: Bridge the data health gap between technical and business assets. Read the original.
Published by the NHIMG editorial team on 2026-03-31.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org