Notifications

Clear all

Agentic AI cost management: what the fragmentation tax means

Last Post

RSS

Mr NHI

(@lalit)

Member Admin

Joined: 1 year ago

Posts: 257

Topic starter 24/06/2026 8:07 pm

TL;DR: 84% of companies are already seeing more than 6% gross margin erosion from AI costs, while only 15% can forecast AI spend within ±10% accuracy, making fragmented consumption a finance and governance problem, not just an efficiency issue, according to Kong. Cost visibility is now a prerequisite for sustainable AI investment and monetisation.

NHIMG editorial — based on content published by Kong: Agentic AI Cost Management: Stopping Margin Erosion and the Fragmentation Tax

By the numbers:

84% of companies report more than 6% gross margin erosion from AI costs.
Only 15% of companies can forecast AI costs within ±10% accuracy.
61% of companies run AI workloads across a combination of public and private environments.

Questions worth separating out

Q: How should organisations control AI costs in agentic environments?

A: Organisations should control AI costs by combining metering, attribution, and enforcement across the full request path.

Q: Why do fragmented AI deployments create margin risk?

A: Fragmented AI deployments create margin risk because each team can consume premium models, duplicate capabilities, and move data without shared visibility.

Q: How do you know if AI cost visibility is actually working?

A: AI cost visibility is working when finance can forecast within a narrow error band, product teams can price features before launch, and operations can trace unusual spend to a specific workload or identity.

Practitioner guidance

Build end-to-end AI consumption attribution Trace every AI request from user or application identity through model calls, tool use, retrieval, and downstream APIs so finance and security can see where cost and access both accumulate.
Define ownership for each AI traffic path Assign a named owner for model gateways, MCP integrations, agent workflows, and billing hooks so duplicate spend and untracked access cannot hide between team boundaries.
Introduce enforcement where spend becomes risky Use caps, model routing, anomaly alerts, and consumption thresholds to stop runaway loops, repeated retries, and unnecessary premium-model usage before they distort margin.

What's in the full article

Kong's full article covers the operational detail this post intentionally leaves for the source:

A deeper breakdown of AI FinOps metrics, including the specific cost categories teams should track across model, compute, storage, and egress.
Practical examples of metering and billing hooks for agentic workflows, including how consumption can be tied back to products and customers.
The article's own framework for deciding when to route work to smaller models, when to cap usage, and how to surface anomalies early.
More detail on the cost dashboard structure Kong recommends for finance and platform teams.

👉 Read Kong's analysis of agentic AI cost management and the fragmentation tax →

Agentic AI cost management: what the fragmentation tax means?

Explore further

View Full Forum → | NHI Foundation Course →

Quote

Topic Tags

Mr NHI

(@mr-nhi)

Member Moderator

Joined: 2 months ago

Posts: 11787

25/06/2026 5:17 am

Agentic AI cost management is becoming an identity governance problem, not only a finance problem. Once AI consumption is distributed across model calls, APIs, MCP servers, and agent workflows, the organisation has lost a single control point for attribution. That is the same structural weakness IAM teams already recognise in shadow access and unowned service identities. The implication is that AI governance must treat usage, access, and accountability as one control plane.

A few things that frame the scale:

92% agree governing AI agents is critical to enterprise security, yet only 44% have implemented any policies to do so, according to AI Agents: The New Attack Surface report.
Only 52% of companies can track and audit the data their AI agents access, leaving 48% with a complete blind spot for compliance and breach investigation.

A question worth separating out:

Q: Who should own AI FinOps in a security-led programme?

A: AI FinOps should be jointly owned by finance, platform engineering, product, and security because the controls span cost, access, and usage. Security brings policy and identity context, finance brings measurement discipline, and engineering owns implementation. A siloed model creates the same fragmentation the programme is trying to remove.

👉 Read our full editorial: Agentic AI cost management exposes the fragmentation tax

ReplyQuote

Forum Statistics

11 Forums

13.5 K Topics

25.8 K Posts

126 Online

135 Members

Latest Post: Silk Typhoon arrest and exposed credentials: what do teams need to watch? Our newest member: Alex Recent Posts Unread Posts Tags

Forum Icons: Forum contains no unread posts Forum contains unread posts

Topic Icons: Not Replied Replied Active Hot Sticky Unapproved Solved Private Closed

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

Get in Touch

Quick Links

FAQ

NHI 101 Articles

Legal & Policies