Subscribe to the Non-Human & AI Identity Journal

Notifications
Clear all

SGLang deserialization flaws: what IAM and AI teams need to know


(@nhi-mgmt-group)
Member Moderator
Joined: 1 year ago
Posts: 4368
Topic starter  

TL;DR: Three unsafe deserialization flaws in SGLang, including two unauthenticated remote code execution paths that trigger when exposed multimodal or disaggregation features accept network input, plus a third crash-dump replay issue tied to malicious .pkl files, were found by Orca Security. The broader lesson is that AI serving frameworks still treat untrusted bytes as trusted control flow, which makes runtime trust boundaries the real security control.

NHIMG editorial — based on content published by Orca Security: SGLang unsafe deserialization vulnerabilities in AI serving frameworks

Questions worth separating out

Q: What breaks when AI serving frameworks deserialize untrusted network data?

A: The trust boundary collapses before the application can validate the request.

Q: Why do AI serving brokers create hidden NHI risk in Kubernetes and cloud environments?

A: Because they are privileged non-human identities that often listen on internal ports and handle sensitive model traffic.

Q: How do security teams know whether an inference stack is exposed to deserialization abuse?

A: Check for any path that accepts external or semi-trusted .pkl, pickle, or equivalent serialized objects, especially in network brokers, replay scripts, job queues, and admin utilities.

Practitioner guidance

  • Remove pickle from externally reachable AI paths Replace pickle-based network deserialization with schema-validated formats such as JSON, msgpack, or Protocol Buffers wherever input can cross a trust boundary.
  • Constrain broker reachability to trusted interfaces Bind internal brokers to localhost or tightly segmented private networks, then enforce firewall rules so only known internal clients can reach the port.
  • Audit every replay and debugging utility for file trust Treat crash-dump replay scripts and offline loaders as privileged execution paths, and require provenance checks before any .pkl artifact is opened.

What's in the full article

Orca Security's full research covers the operational detail this post intentionally leaves for the source:

  • Line-by-line attack flow for CVE-2026-3059 and CVE-2026-3060, including the exact broker and receiver code paths
  • Proposed patch analysis showing the localhost-binding and msgpack migration approach in more depth
  • Detection guidance with process- and network-level indicators that help distinguish exploitation from normal inference traffic
  • Disclosure timeline, affected versions, and the unmerged patch status for teams tracking remediation exposure

👉 Read Orca Security's analysis of SGLang pickle-based RCE paths and AI workload exposure →

SGLang deserialization flaws: what IAM and AI teams need to know?

Explore further

View Full Forum →  |  NHI Foundation Course →



   
Quote
Share: