TL;DR: System prompt leakage, prompt reverse-engineering, and RAG manipulation are emerging as major LLM security risks in 2025, with Gartner projecting that by 2027 half of enterprise GenAI models will be industry- or function-specific, according to Lasso Security. Guardrails alone are not enough because security has to move into externalised controls, context-aware access, and compliance-ready architecture.
NHIMG editorial — based on content published by Lasso Security: LLM Security Predictions: What's Coming Over the Horizon in 2025?
By the numbers:
- Gartner predicts that by 2027, half of GenAI models that enterprises use will be designed for specific industries or business functions.
- One study shows that most RAG attacks settle around a 40% success rate, which can rise to 60% if ambiguous answers are counted as successful attacks.
- Lasso Security says its RapidClassifier can run custom security policies in under 50 milliseconds.
Questions worth separating out
Q: How should security teams handle system prompts that may contain sensitive data?
A: They should remove credentials, internal rules, and hidden routing logic from prompts and place them in governed external systems.
Q: Why do LLM guardrails fail when attackers can reverse-engineer prompts?
A: Guardrails fail because they rely on the model to preserve policy secrecy.
Q: What do security teams get wrong about RAG risk?
A: They often focus on the model and ignore the retrieval layer.
Practitioner guidance
- Separate secrets from prompts Move credentials, connection strings, and internal rules out of system prompts and into secure vaults or external control planes.
- Externalise security enforcement Use external policy systems for allow, deny, and data-filtering decisions so the LLM is not the sole gatekeeper.
- Test for prompt reverse-engineering Red team the application with prompt injection, behavioural probing, and retrieval manipulation scenarios.
What's in the full article
Lasso Security's full research covers the operational detail this post intentionally leaves for the source:
- How RapidClassifier is positioned to enforce policies in under 50 milliseconds across live GenAI interactions.
- The article's detailed breakdown of RAG attack patterns and why retrieval ranking becomes an exploitation point.
- Practical examples of context-based access control decisions for user role, query sensitivity, and retrieved document handling.
- The vendor's implementation framing for separating secrets, connection strings, and internal rules from system prompts.
👉 Read Lasso Security's analysis of system prompt leakage and RAG risk →
System prompt leakage in LLMs: are your guardrails enough?
Explore further