AI & Agentic Security
The attack surface where prompt injection has moved from chatbot curiosity to infrastructure compromise.
AI agents now sign payloads, call APIs, move money, and act on production systems — through x402 payments, AP2 authorization, MCP servers connecting agents to production workflows. The blast radius of a compromised agent is not a wrong answer; it’s a wrong action that can’t be undone. Traditional application security wasn’t built for systems that reason, chain tool calls across trust boundaries, and act on context that shifts every interaction. We audit the systems making the actions, and the infrastructure they connect to.
[AI SECURITY]
[Fig. 01]
TEAM
Senior Researchers. Every Engagement.
Hexens security researchers are CTF champions, bug bounty leaderboard veterans, and engineers who’ve spent careers breaking systems that weren’t supposed to break.
No junior bench, no rotation, no learning on your codebase.








Credentials earned, not collected.
TOOLING
Security Engineers X Frontier AI
Rigorous, line-by-line review — extended by frontier AI as a force multiplier. The engineer brings the judgment. The model removes the ceiling on what that judgment can reach.
Deeper analysis of individual findings.
More adversarial test cases per surface.
Broader code path exploration.
METHOD
Two Independent Teams. In Parallel.
Two senior security teams run against the same target in parallel, pairing manual review with frontier AI as a force multiplier. Where findings overlap, you have confirmation. Where they diverge, you’ve caught what a single-team audit would have missed.
Beyond scope by default.
Engagements are exclusive.
Retesting, included.
OUTCOME
Findings that hold up to a post-mortem.
The audits that matter are the ones still defensible after something goes wrong. None of ours have been tested that way.
- $120 BLN+In digital assets protected
- Zeropost-audit exploits across 300+ engagements
- 91%client retention rate
- 90%of reports contain critical or high-severity findings
Coverage that neither security engineers nor frontier AI could deliver alone.
AI Agent Security Audit
Decision boundaries, authorization logic, input validation, output constraints, failure modes. How agents behave under adversarial conditions: manipulated inputs, unexpected state transitions, edge cases that lead to unauthorized actions or fund loss.
Agentic Commerce & Payment Protocol Security
Payment authorization, signature verification, settlement integrity, receipt validation, trust boundaries between agents and service providers. x402, AP2, ACP, and the custom payment rails built on top of them. When software pays for software at machine speed, the failure window is measured in seconds.
MCP Server & Tool Integration Security
Tool poisoning, cross-tool hijacking, privilege escalation, shadow servers, supply chain compromise through malicious packages. The OWASP MCP Top 10 categories, tested in production deployments — not synthetic environments.
LLM Security Assessment
Prompt injection (direct and indirect), system prompt extraction, output manipulation, data leakage, jailbreak vectors. We assess both the model interaction layer and the application logic processing model outputs.
MLOps Pipeline Threat Analysis
Training data integrity, model serving infrastructure, feature stores, model versioning, access controls across the ML lifecycle. We identify where an attacker compromises model behavior by manipulating the pipeline, not just the model.
AI Red Teaming
Adversarial simulation against AI systems under realistic attack conditions. Threat actors attempting to manipulate behavior, extract sensitive data, bypass safety controls, or exploit AI systems for lateral access into connected infrastructure. Informed by real-world attack patterns, not synthetic benchmarks.
The agentic AI attack surface is expanding faster than the security industry can respond.
Hexens engineers track these developments in real time - through original vulnerability research, active participation in the security community, and continuous adversarial testing of the same tools and frameworks our clients deploy.
[01]
Top 10
OWASP published a dedicated Top 10 for Agentic AI in December 2025 - memory poisoning, tool misuse, and privilege compromise lead the list.
[02]
+270%
MCP-related vulnerabilities grew in a single quarter.
[03]
93%
Of AI agent frameworks rely on unscoped API keys.
[04]
84%
Of organizations doubt they can pass a compliance audit focused on agent behavior.
[05]
$3–5 T
The agentic commerce market is projected to mediate this much in global commerce by 2030.
[06]
~45%
AI-generated code contains security flaws approximately this often.
[42]
[Fig. 02]


