TL;DR: Fragmented governance and data silos slow decisions, erode trust, and block legitimate access, while unified governance and data sharing improve agility, insight reuse, and operational efficiency, according to Collibra. The underlying issue is not just access friction, but governance that remains tied to platforms instead of to the full lifecycle of data use.
At a glance
What this is: This is an editorial on how siloed data and fragmented governance limit organizational speed, trust, and decision quality.
Why it matters: It matters to IAM practitioners because the same governance patterns that unblock data sharing also shape access, accountability, and lifecycle control across NHI, autonomous, and human identity programmes.
By the numbers:
- 75% of CDOs say improving data culture and addressing legacy systems are top priorities.
- 72% of organisations have experienced or suspect they have experienced a breach of non-human identities, 46% confirmed and 26% suspected.
👉 Read Collibra's analysis of how data sharing breaks internal silos
Context
Data sharing fails when governance is fragmented, because control ends up attached to a specific platform instead of to the data itself. In practice, that creates silos, inconsistent ownership, and access friction that slows both analytics and operational decision-making.
For IAM and governance teams, the lesson is broader than data management. The same lifecycle discipline that governs non-human identities, access reviews, and privileged access also determines whether humans and systems can reuse trusted data without creating unmanaged exceptions. Collibra's framing is typical of a mature enterprise problem: the issue is structural, not just cultural.
When generative AI starts surfacing flawed or disconnected data at scale, weak governance stops being an efficiency issue and becomes a risk amplifier. That makes data sharing less about opening the floodgates and more about applying consistent governance across every producer, consumer, and workflow.
Key questions
Q: How should organisations reduce data silos without losing governance control?
A: Start by centralising ownership, policy, and metadata for high-value datasets, then apply those controls consistently across platforms. The goal is not unrestricted access. It is reusable access with clear stewardship, lineage, and approval paths so that legitimate users do not need to create shadow workarounds.
Q: Why do siloed data environments make governance slower and less reliable?
A: Because the same dataset ends up governed differently in each platform, which creates inconsistent ownership, duplicated policy decisions, and fragmented audit evidence. That slows legitimate use and makes security teams more likely to default to blanket restriction instead of risk-based access.
Q: What do teams get wrong about data sharing in regulated environments?
A: They often treat data sharing as a permission problem when it is really a governance design problem. If business context, ownership, and quality do not travel with the data, access can be granted but trust is still missing, so the organisation keeps rebuilding control manually.
Q: How can security and data teams tell whether governance is improving data confidence?
A: Look for faster legitimate access, fewer manual exceptions, clearer dataset ownership, and lower dependence on ad hoc interpretation. If users can discover and reuse governed data without repeated clarification, the programme is improving trust rather than just moving records around.
Technical breakdown
Why siloed data governance breaks enterprise access models
Siloed data governance usually emerges when access rules, ownership, and metadata are managed separately in each platform. That creates local optimisations, but it breaks cross-domain visibility, weakens accountability, and makes legitimate access slower than shadow workarounds. The result is not just poor analytics. It is a governance model that cannot keep pace with distributed cloud estates, legacy systems, and AI-driven consumption. When data is treated as platform-bound rather than enterprise-bound, every new use case becomes a custom exception.
Practical implication: map ownership and access policy to the data asset lifecycle, not to each storage platform.
Unified governance and data confidence
Unified governance means decoupling control from individual systems and applying consistent policy across users, sources, and consumption paths. In this model, metadata, context, and policy are central, so teams can understand what data is, who owns it, and how it may be used. That is what turns raw availability into data confidence. Without that shared context, self-service becomes guesswork and security teams default to overrestriction. The governance problem is not access alone, but trust in what is being accessed.
Practical implication: build a central inventory of business context, access rules, and stewardship so self-service can be controlled without being stalled.
Data sharing under generative AI pressure
Generative AI raises the stakes because it consumes data at speed and can surface gaps in quality, lineage, and control that human users might never hit at scale. If the underlying governance is weak, AI does not create the problem, it exposes it faster. That makes data sharing programs inseparable from lifecycle governance, quality controls, and accountability for downstream use. The technical challenge is not only moving data. It is preserving confidence as the number of consumers and decision paths grows.
Practical implication: include AI consumption paths in governance design and monitor whether shared data remains trusted after automated reuse.
NHI Mgmt Group analysis
Fragmented data governance is a lifecycle problem, not a tooling problem. The article describes a common failure mode where ownership, policy, and access are split across platforms, so no one can govern the full path from producer to consumer. That is the same structural weakness that appears in identity programmes when lifecycle controls are tied to systems instead of subjects. Practitioners should treat governance fragmentation as an operating model defect, not a feature gap.
Data sharing only works when trust is separated from the storage layer. Collibra's framing is strongest when it shows that teams do not merely need more access, they need confidence that the data they receive is understood, owned, and fit for use. That aligns with NIST Cybersecurity Framework 2.0 and enterprise governance practice: access is only useful when context travels with it. The implication is that trust must be portable across systems, or shared data will remain politically possible but operationally brittle.
Data confidence is the named concept this article surfaces. It is the point at which business users, security teams, and data stewards can rely on the same governed dataset without re-litigating ownership or quality at each handoff. That concept matters because modern analytics and AI consume data faster than traditional approval chains can adjudicate it. Practitioners should recognise data confidence as a governance outcome, not a technology feature.
Generative AI turns siloed governance into an enterprise-wide amplification mechanism. The article correctly notes that flawed or disconnected data becomes more dangerous when AI surfaces it at scale. That is not a new risk class so much as a multiplication effect on existing data and access weaknesses. The implication for governance leaders is that AI readiness depends on whether the data estate already has consistent ownership, lineage, and stewardship.
Organisations that still think of data sharing as permissioning will keep recreating silos. The real issue is whether the enterprise can maintain consistent control while allowing broad, legitimate use. That is why IAM, IGA, and data governance need to converge around a shared lifecycle model. Practitioners should measure whether governance makes reuse safer, not merely whether it makes access possible.
From our research:
- 72% of organisations have experienced or suspect they have experienced a breach of non-human identities, 46% confirmed and 26% suspected, according to The 2024 ESG Report: Managing Non-Human Identities.
- Two-thirds of enterprises have endured a successful cyberattack resulting from compromised non-human identities, with a quarter encountering multiple attacks, which shows how quickly unmanaged access becomes operational risk.
- That is why the broader control model matters, and the practical starting point is Ultimate Guide to NHIs , Lifecycle Processes for Managing NHIs for lifecycle governance across machine identities.
What this signals
Data confidence will become the governance metric that matters most. Teams that only measure dataset availability will miss the real question, which is whether people and systems can reuse data without rework, exception handling, or second-guessing. The organisations that solve this will be the ones that can combine access, stewardship, and provenance in one operating model.
With 72% of organisations already reporting or suspecting a non-human identity breach, the same pattern that weakens data governance also weakens machine access control. The practical signal is whether your programme can govern reuse without creating unmanaged exceptions across systems and workloads.
Unified governance is the bridge between data sharing and identity control. That bridge matters because AI, analytics, and automated workflows all consume data faster than legacy approval chains can absorb. Practitioners should watch for whether shared-data programmes are expanding control coverage or merely moving risk into more places.
For practitioners
- Inventory data ownership across platforms Create a single stewardship view for high-value datasets, including business owner, technical owner, and approval path. If ownership changes by environment, the dataset is still siloed even if it is technically accessible.
- Decouple access policy from storage platform Define policy once at the enterprise level, then apply it consistently across cloud, legacy, and analytics systems. This reduces local exceptions and makes policy review possible across the full data lifecycle.
- Track trust signals, not only access counts Measure whether users can find, understand, and reuse data without additional manual intervention. Usage growth alone is not enough if teams still depend on ad hoc clarifications or shadow extracts.
- Include AI consumption paths in governance reviews Review whether generative AI and analytics pipelines are pulling from governed datasets with clear lineage and quality controls. If not, the same silo risk is being amplified through automation.
Key takeaways
- Siloed data governance slows collaboration because ownership, policy, and context are split across systems.
- The scale of the governance problem is already visible, with most organisations reporting or suspecting non-human identity breaches.
- Practitioners should treat unified governance as the control model that makes shared data usable without creating new exceptions.
Standards & Framework Alignment
This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.
NIST CSF 2.0, NIST CSF 2.0 and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.
| Framework | Control / Reference | Relevance |
|---|---|---|
| NIST CSF 2.0 | PR.AC-4 | Shared data governance depends on least privilege and consistent access enforcement. |
| NIST CSF 2.0 | GV.OC-1 | The article centers on organisational context, ownership, and control visibility. |
| NIST Zero Trust (SP 800-207) | Unified governance supports continuous verification across data consumers and systems. |
Document governance ownership and business context so data sharing stays accountable.
Key terms
- Unified Governance: A governance model that applies consistent policy, ownership, and context across systems instead of tying control to one platform. It matters because reusable data and trusted access depend on the same rules following the asset, not the storage location.
- Data Confidence: The degree to which users and systems can trust that data is accurate, owned, and fit for use at the point of consumption. In practice, it combines lineage, stewardship, and policy enforcement so access can scale without forcing repeated manual validation.
- Data Silos: Isolated pockets of data that are difficult to discover, govern, or share across teams and systems. Silos are not just a storage issue. They create inconsistent ownership, slower access decisions, and weaker accountability across the full data lifecycle.
- Lifecycle Governance: The discipline of managing access, ownership, review, and offboarding across the full life of an asset or identity. For data programmes, it means treating the dataset and its consumers as governed objects that must remain accountable as usage changes.
Deepen your knowledge
NHI governance, agentic AI identity, and machine identity lifecycle are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are responsible for identity security strategy or NHI governance in your organisation, it is worth exploring.
This post draws on content published by Collibra: The organizational immune system, how data sharing cures internal silos. Read the original.
Published by the NHIMG editorial team on 2025-10-15.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org