TL;DR: Open Semantic Interchange and governed semantic layers aim to make data definitions, lineage, and access controls consistent across Snowflake and Collibra workflows, which matters as unstructured data and AI models depend on trusted context, according to Collibra. The deeper shift is that governance now has to travel with the data and the semantics, not sit beside them.
At a glance
What this is: This is Collibra's analysis of how Open Semantic Interchange can speed data delivery by pairing semantic consistency with active governance in Snowflake environments.
Why it matters: It matters because IAM, data governance, and AI teams increasingly have to control who can use what data, which definitions they can trust, and how that context remains consistent across platforms.
By the numbers:
- Only 5.7% of organisations have full visibility into their service accounts.
- 97% of NHIs carry excessive privileges, increasing unauthorised access and broadening the attack surface.
- 79% of organisations have experienced secrets leaks, with 77% of these incidents resulting in tangible damage.
👉 Read Collibra's analysis of open semantic interchange and data governance
Context
Open Semantic Interchange is a governance and interoperability problem, not just a data format issue. The article argues that business users, data engineers, and AI systems need a shared semantic layer so definitions, lineage, and policy enforcement stay consistent as data moves through Snowflake and Collibra workflows.
For IAM and governance teams, the relevant question is where trust lives when data is consumed by dashboards, BI users, and AI agents. If the meaning of a metric, the lineage of a dataset, or the masking policy applied to sensitive fields changes by tool, governance becomes inconsistent even when the underlying platform is technically available.
This matters for both data governance and identity governance because access decisions increasingly depend on metadata, classification, and policy translation. The article's underlying point is typical of modern enterprise data stacks: the risk is not only unauthorized access, but also unauthorized interpretation.
Key questions
Q: How should governance teams manage semantic consistency across data platforms and AI tools?
A: Governance teams should treat semantic consistency as a control objective, not a documentation exercise. That means assigning ownership for business definitions, enforcing lineage, and checking that access rules and masking policies survive each consumption path. If dashboards, warehouses, and AI systems interpret the same metric differently, the governance model has already drifted.
Q: Why do data governance and IAM teams need to work together on semantic layers?
A: Because access control and meaning control now overlap. IAM determines who can reach a dataset, but semantic governance determines what that dataset means, how it is classified, and what policy context applies. When those layers diverge, users can receive data they are entitled to access but not entitled to interpret in that form.
Q: How can organisations tell whether policy translation into warehouse controls is working?
A: Look for consistency between policy intent, platform enforcement, and actual user experience. If row-level access, masking, or classification behaves differently across interfaces, the translation is incomplete. The strongest signal is whether the same governance rule produces the same outcome in dashboards, BI tools, and AI-enabled workflows.
Q: What should security teams do when AI models depend on governed business definitions?
A: Security teams should require those definitions to be versioned, reviewed, and tied to lineage before they are used in production models. AI systems amplify semantic mistakes because they reuse context at scale. If the governing definition is wrong or unclear, the model can be accurate technically and still be wrong operationally.
Technical breakdown
Governed semantic layers in Snowflake environments
A governed semantic layer is the authoritative translation layer between raw data and business meaning. It defines metrics, dimensions, lineage, and policy context so different tools do not invent their own version of the truth. In this model, the semantic layer is not decorative metadata. It becomes part of the control plane because it determines what the user sees, what the AI model consumes, and which policies apply to the underlying data objects.
Practical implication: map critical metrics and sensitive datasets to a governed semantic layer before teams build more dashboards, models, or AI workflows on top of them.
Model Context Protocol and semantic interoperability
The article links MCP to semantic context delivery, which is a useful reminder that AI systems need structured, trustworthy inputs, not just tool access. When an AI agent or analytical workflow retrieves context from a governed layer, the control problem shifts from raw access to the quality and provenance of the context being supplied. Without that layer, the system may still be able to answer a question, but it can answer it with inconsistent business meaning or incomplete policy awareness.
Practical implication: treat semantic context as governed input to AI and analytics workflows, not as an optional enrichment layer.
Row access policies and dynamic masking as policy translation
Plain-language governance rules only matter if they can be translated into enforceable platform controls. The article points to Snowflake row access policies and dynamic data masking as the mechanism that turns policy into runtime behaviour. That translation matters because business users can receive data immediately while sensitive values remain protected according to role, purpose, or classification. The governance challenge is consistency, not just implementation.
Practical implication: verify that policy intent, classification, and platform enforcement stay aligned as data moves between governance tooling and the warehouse.
NHI Mgmt Group analysis
Open semantic interchange is becoming a governance control point, not a convenience layer. The article shows that enterprises are no longer just moving data between systems. They are moving meaning, policy, and trust across those systems. Once semantic context is relied on by dashboards and AI workflows, a broken definition becomes an access problem, a quality problem, and an accountability problem at the same time. Practitioners should treat semantic interoperability as part of governance architecture, not a downstream integration detail.
Metric trust is now an identity-adjacent control issue. When business users and AI agents consume the same numbers through different interfaces, the organisation needs a consistent answer to who may see what, and under which policy context. That puts metadata, classification, and access translation into the same operational conversation as IAM and data governance. The implication is that identity teams cannot separate entitlement control from semantic control anymore.
Semantic sprawl creates a new form of control drift. If every platform, dashboard, and AI workflow can reinterpret the same business term differently, then governance fragmentation happens even when access policies exist. This is a classic control-plane failure in modern data estates: the organisation believes it has one policy, but multiple tools apply multiple interpretations. Practitioners should expect audit and assurance work to move toward definition integrity as much as entitlement integrity.
Open standards matter because they reduce lock-in of meaning, not just data. Interoperability is often discussed as a technical portability issue, but the deeper benefit is governance portability. If semantics can travel with the data, organisations can preserve policy intent across changing warehouses, AI models, and analytical tools. That makes the governance model more durable as platforms evolve.
The named concept here is governed semantic trust. This is the condition in which data users, engineers, and AI systems all rely on the same authoritative meaning, lineage, and policy context. Without it, speed increases but assurance erodes. The practical conclusion is that governance teams need to measure whether semantic consistency is actually surviving across consumption paths, not just whether the underlying data platform is up to date.
From our research:
- Only 5.7% of organisations have full visibility into their service accounts, according to Ultimate Guide to NHIs.
- Another finding from the same research shows that 97% of NHIs carry excessive privileges, which broadens the attack surface even when governance exists on paper.
- For teams building semantic and access controls together, Ultimate Guide to NHIs is the best next resource for lifecycle, visibility, and Zero Trust context.
What this signals
Governed semantic trust: the next governance benchmark is whether meaning, lineage, and access policy remain aligned as data moves across BI, warehouse, and AI consumption paths. When those layers diverge, the control failure is no longer just a data quality issue, it becomes a trust issue for the whole programme.
Teams should expect more pressure to prove that policy translation is consistent rather than merely configured. If the same business term can resolve to different values or visibility rules depending on the tool, assurance evidence will need to include semantic consistency checks, not only access review artefacts.
The practical signal for identity and governance teams is that data context is becoming operational context. That means the organisation will need ownership for semantic definitions, masking rules, and lineage evidence that can stand up to audit and AI consumption alike.
For practitioners
- Inventory semantic dependencies across key data products Map the metrics, dimensions, and governed definitions that feed executive dashboards, AI models, and operational reports. Prioritise the definitions that drive regulated decisions or customer-facing workflows, because those are the ones most likely to create control failures when they diverge.
- Translate policy language into enforceable warehouse controls Validate that plain-language access rules are represented in row access policies, dynamic masking, and any downstream enforcement logic. Re-test the translation whenever dataset classifications, roles, or consumption paths change.
- Treat semantic lineage as part of assurance evidence Include metric provenance, source facts, and transformation logic in governance reviews so audit teams can verify that business meaning has not drifted across tools. This is especially important where AI outputs depend on curated semantic context.
- Define a control owner for semantic consistency Assign ownership for business definitions, data classification, and access translation so the semantic layer does not become an orphaned governance asset. The owner should be able to resolve disputes when tools present conflicting versions of the same metric.
Key takeaways
- Open semantic interchange shifts governance from static definitions to portable trust across data and AI systems.
- The biggest risk is not only who can access the data, but whether every tool interprets the same data the same way.
- Practitioners should govern semantic context, policy translation, and lineage together before AI and analytics workflows amplify inconsistencies.
Standards & Framework Alignment
This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.
NIST CSF 2.0, NIST Zero Trust (SP 800-207) and NIST AI RMF set the governance and control requirements practitioners need to meet.
| Framework | Control / Reference | Relevance |
|---|---|---|
| NIST CSF 2.0 | PR.AC-4 | Access and authorisation still need to align with governed semantic context. |
| NIST Zero Trust (SP 800-207) | AC-3 | Zero trust requires policy enforcement at each consumption path, not one trusted data perimeter. |
| NIST AI RMF | AI systems consuming governed context need lifecycle accountability and provenance controls. |
Apply continuous authorisation to data consumers and verify policy translation at every access point.
Key terms
- Governed semantic layer: A governed semantic layer is the authoritative translation layer between raw data and business meaning. It standardises metrics, dimensions, lineage, and policy context so different tools do not invent their own version of the truth. In practice, it becomes part of the control plane because it shapes what users and AI systems can safely consume.
- Semantic interoperability: Semantic interoperability is the ability for different systems to exchange data without losing the meaning attached to it. In identity and governance programmes, it means the same metric, classification, or policy context can be understood consistently across platforms, analytics tools, and AI workflows.
- Policy translation: Policy translation is the process of converting human-readable governance rules into enforceable technical controls. In data environments, that often means turning access, masking, and classification requirements into warehouse-native rules that behave consistently when data is queried, moved, or reused.
- Semantic drift: Semantic drift is the gradual loss of consistency in how business terms, metrics, or classifications are defined across systems. It can occur even when access controls are in place, which makes it especially dangerous for analytics and AI because users may trust outputs that no longer mean the same thing everywhere.
Deepen your knowledge
NHI governance, agentic AI identity, and machine identity security are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are responsible for identity security strategy or governance in your organisation, it is worth exploring.
This post draws on content published by Collibra: Accelerating data delivery using OSI with Snowflake and Collibra. Read the original.
Published by the NHIMG editorial team on 2026-03-13.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org