Subscribe to the Non-Human & AI Identity Journal
Home FAQ Governance, Ownership & Risk What do organisations get wrong about unstructured data…
Governance, Ownership & Risk

What do organisations get wrong about unstructured data access?

← Back to all FAQ
By NHI Mgmt Group Editorial Team Updated June 25, 2026 Domain: Governance, Ownership & Risk

They often certify user accounts without tracing those accounts to the files, repositories, and collaboration spaces they can actually reach. That leaves sensitive data exposed even when application access looks clean on paper. Effective governance needs data-level visibility, not just login-level control.

Why This Matters for Security Teams

Unstructured data is where governance assumptions break first. Security teams often focus on user provisioning, application entitlements, and folder-level permissions, yet the real exposure sits in files, shared drives, chat exports, wiki pages, and collaboration spaces that accumulate over time. Once access is granted, it is rarely re-evaluated against the sensitivity of the content itself. That gap is why data-level visibility matters more than clean login records, especially in environments where service accounts, shared links, and inherited permissions spread access far beyond intent. NHI Mgmt Group notes that only 5.7% of organisations have full visibility into their service accounts in the Ultimate Guide to NHIs. In practice, many security teams discover unstructured data overexposure only after a search index, shared workspace, or stale access path has already been abused, rather than through intentional review.

How It Works in Practice

The practical mistake is treating unstructured data access like a simple IAM problem. A user or agent may be properly authenticated and still reach hundreds of sensitive files through inherited permissions, embedded links, synced folders, or collaboration defaults. That is why effective governance has to combine identity controls with content discovery, classification, and continuous entitlement review. NHI Mgmt Group’s Ultimate Guide to NHIs — Key Research and Survey Results shows how often organisations lose visibility once credentials are issued, and the same pattern appears in file stores where access spreads faster than review cycles. Practitioners typically need to map three layers together:
  • Who can authenticate, including users, service accounts, and other NHIs.
  • What repositories, shares, and collaboration spaces they can actually reach.
  • What sensitive data is present in those locations and whether it is being copied, exported, or shared externally.
Current guidance from the OWASP Non-Human Identity Top 10 aligns with this view: identity controls are necessary, but not sufficient, when credentials or tokens can be reused across many unstructured data surfaces. The operational fix is to inventory the storage systems first, then correlate permissions, activity, and content sensitivity, and finally tighten sharing defaults and revoke stale access. These controls tend to break down in fast-moving collaboration environments where external sharing, auto-sync clients, and inherited workspace permissions make access drift faster than review processes can keep up.

Common Variations and Edge Cases

Tighter data access control often increases operational overhead, requiring organisations to balance visibility gains against productivity and change management. That tradeoff becomes sharper when unstructured data is spread across multiple cloud tenants, regional file stores, or third-party collaboration tools. There is no universal standard for this yet, but current guidance suggests prioritising the highest-risk repositories first rather than attempting full coverage in one pass. A few edge cases matter:
  • Shared links can bypass normal folder reasoning, so link auditing matters as much as ACL review.
  • Service accounts may have legitimate bulk-read access for indexing or automation, but they should be constrained and monitored because they can silently widen exposure.
  • Retention and legal hold can preserve sensitive content long after business need has ended, so access review and data lifecycle review must be linked.
The broader risk picture is consistent with NHI Mgmt Group research: 97% of NHIs carry excessive privileges, and 79% of organisations have experienced secrets leaks, both of which amplify unstructured data exposure when access is not tied back to real use cases. For practitioners, the important question is not just whether a user can log in, but whether that identity can still reach the wrong document, in the wrong workspace, for the wrong reason. For deeper context, the Ultimate Guide to NHIs — Key Challenges and Risks is a useful reference point.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.

FrameworkControl / ReferenceRelevance
OWASP Non-Human Identity Top 10NHI-01Unstructured data access often stems from overbroad identity reach and stale entitlements.
NIST CSF 2.0PR.AC-4The issue is ineffective access governance across data repositories and collaboration spaces.
NIST Zero Trust (SP 800-207)ID.AMZero Trust requires knowing what data and assets identities can actually reach.

Continuously review access rights to unstructured data and revoke permissions that are no longer justified.

NHIMG Editorial Note
Reviewed and updated by the NHIMG editorial team on June 25, 2026.
NHI Mgmt Group — the #1 independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org