Kafka ACLs control basic access to brokers and topics, while an event gateway can enforce identity-aware policy, validate schemas, mask data, rate limit traffic, and tie auditability to the caller. They solve different layers of the problem, so teams often need both, but only a gateway can govern the event itself.
Why This Matters for Security Teams
Kafka ACLs and an event gateway are often discussed as if they are interchangeable controls, but they sit at different layers of the trust model. Kafka ACLs answer a narrow question: who can connect, produce, consume, or alter a broker-backed resource. An event gateway answers a broader one: should this caller be allowed to publish this event, with this schema, containing this data, under these conditions. That distinction matters when the caller is an NHI, because static broker permissions do not express event-level intent or content-sensitive policy.
For teams managing secrets, service accounts, and machine-to-machine traffic, this gap is not theoretical. NHIs outnumber human identities by 25x to 50x in modern enterprises, and NHI governance failures regularly show up as overbroad access, leaked secrets, and difficult-to-audit event flows. NHI Mgmt Group’s Ultimate Guide to NHIs — What are Non-Human Identities is explicit that visibility and rotation problems are widespread, which is why broker-only controls are rarely enough. The broader risk picture also aligns with the NIST Cybersecurity Framework 2.0, which pushes organisations toward stronger identification, monitoring, and access governance across the full lifecycle.
In practice, many security teams encounter uncontrolled event publishing only after a service account has already been over-permissioned and data has already moved through trusted pipelines.
How It Works in Practice
Kafka ACLs are useful for coarse transport and broker governance. They can prevent a principal from reading a topic, writing to a cluster, or performing administrative actions. That makes them foundational, but limited. They do not inspect payloads, evaluate business context, or decide whether a specific message should be masked, rejected, delayed, or tagged for audit.
An event gateway sits in front of the broker or event stream and evaluates the request before the event is accepted. In practice, that usually means four layers of control working together:
- Identity validation for the caller, often tied to workload identity rather than a static secret.
- Policy evaluation at request time, using context such as topic name, schema version, source system, and destination.
- Content controls such as schema validation, field-level masking, or sensitive-data filtering.
- Operational controls such as rate limiting, replay protection, and audit logging tied to the caller.
This is why event gateways are closer to Zero Trust enforcement than simple ACLs. They can apply decisions dynamically, which is important when the producer is an automated workload whose behavior changes with code, model output, or upstream triggers. NHI Mgmt Group’s research on NHI governance highlights how frequently organisations lose visibility into service accounts, and that visibility gap is exactly where broker-only permissions become brittle. Current guidance suggests pairing Kafka ACLs with gateway policy rather than treating them as substitutes. This approach also maps well to the identity and access emphasis in the NIST Cybersecurity Framework 2.0.
These controls tend to break down when producers bypass the gateway and connect directly to brokers, because the policy decision disappears at the point where the event is actually created.
Common Variations and Edge Cases
Tighter gateway enforcement often increases latency, operational complexity, and schema management overhead, so organisations need to balance stronger governance against throughput and developer friction. That tradeoff becomes especially visible in streaming environments where high-volume producers, legacy services, and partner integrations all share the same event backbone.
There is no universal standard for this yet, but current guidance suggests a layered model. Use Kafka ACLs to constrain broker-level reachability and administrative actions. Use an event gateway when the organisation needs identity-aware policy, field-level controls, tenant isolation, or auditable decisions tied to the caller. If the environment already uses strict network segmentation, teams sometimes assume ACLs are sufficient. They are not, because segmentation stops at the network boundary and does not inspect the event itself.
Edge cases include direct broker clients, batch reprocessors, and internal tooling that was never designed to traverse a gateway. Those flows often require exception handling, but exceptions should be narrow and time-bound. The same principle appears in NHI governance research from Ultimate Guide to NHIs — What are Non-Human Identities: visibility, rotation, and access reduction only work when policy is applied at the point of use, not just at the perimeter.
Standards & Framework Alignment
This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.
OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.
| Framework | Control / Reference | Relevance |
|---|---|---|
| OWASP Non-Human Identity Top 10 | NHI-01 | Kafka ACL gaps often stem from weak NHI access scoping and overprivileged service accounts. |
| NIST CSF 2.0 | PR.AA-1 | Identity verification and access control are central when comparing ACLs with event gateways. |
| NIST Zero Trust (SP 800-207) | GV3 | Event gateways implement request-time decisions aligned to Zero Trust principles. |
Map each producer and consumer NHI to least privilege and remove standing broker access where possible.
Related resources from NHI Mgmt Group
- What is the difference between privilege reduction and secret rotation?
- What is the difference between a rules-based secret scanner and a hybrid scanner?
- What is the difference between code scanning and runtime identity monitoring?
- What is the difference between zero trust for users and zero trust for NHIs?