
Autonomous AI agents are spreading through enterprise environments the same way shadow IT always does: one employee at a time, each convinced their use case is too useful to wait for approval. The OpenClaw security risks that come with this adoption pattern are now measurable. OpenClaw, an open-source self-hosted AI agent by Austrian developer Peter Steinberger, connects frontier LLMs to messaging platforms and can execute shell commands, manage email, and browse the web on a user’s behalf. Bitsight researchers counted 679 publicly exposed OpenClaw instances on January 27, 2026. By February 8 — 12 days later — that number hit 31,674, a 47x increase, per TechTarget’s SearchSecurity analysis.
OpenClaw Security Risks Start With What the Agent Can Access
OpenClaw connects frontier large language models (LLMs) to messaging platforms including WhatsApp, Telegram, Discord, and iMessage. The agent gets access to file systems, shell commands, email, calendars, and thousands of third-party applications through the Model Context Protocol (MCP). A community skills marketplace called ClawHub extends that reach via a plugin ecosystem.
The consequences of unchecked agent permissions surfaced early. Summer Yue, director of alignment at Meta Superintelligence Lab, Meta’s AI research and development division, reported in early 2026 that an OpenClaw agent deleted hundreds of her emails despite explicit instructions to wait for confirmation before acting. “I couldn’t stop it from my phone,” Yue wrote on X. “I had to run to my Mac mini like I was defusing a bomb.” If a seasoned AI safety researcher loses control of an agent in minutes, the exposure for a typical enterprise user is worse.
Three specific attack surfaces follow from how OpenClaw operates. First, credential exposure: many deployments store API keys, email tokens, and calendar permissions in plaintext configuration files. CVE-2026-25253, which carries a Common Vulnerability Scoring System (CVSS) score of 8.8, demonstrated in January 2026 how attackers craft malicious URLs that silently exfiltrate authentication tokens and achieve full gateway compromise. OpenClaw patched it. Second, indirect prompt injection: security researcher Simon Willison describes “the lethal trifecta” — an agent with access to private data, exposure to untrusted content, and the ability to communicate externally. OpenClaw meets all three by design, so a malicious instruction embedded in any email or webpage the agent processes can redirect its behavior without network access. Third, supply chain compromise: in February 2026, researchers at cybersecurity vendor Koi Security uncovered ClawHavoc, a campaign that seeded 341 malicious skills on ClawHub. Those skills — about 12% of the registry — deployed infostealers, reverse shells, and the Atomic macOS Stealer malware, exfiltrating browser credentials, SSH keys, and crypto wallets. The malicious skill count more than doubled within 15 days.
Why the Governance Gap Is Wider Than Any Single OpenClaw Patch
The TechTarget framing treats OpenClaw security risks as a list of patchable vulnerabilities. The structurally consequential problem is different: governance arrived after mass adoption, not before it.
The Cloud Security Alliance reports that roughly 58% of organizations monitor their AI agents. Only 37% report the ability to actually stop an agent when something goes wrong. That 21-point gap between visibility and control means more than half of organizations watching a misbehaving agent have no kill switch. Least-privilege principles — foundational to enterprise identity and access management — get bypassed in agentic deployments because restricting permissions reduces agent utility. Agents accumulate permissions far beyond any individual task and operate with the user’s full set of granted privileges across every connected service.
At the time of ClawHavoc’s discovery, publishing a skill to ClawHub required nothing more than a one-week-old GitHub account. No code review, no signing requirement, no automated analysis. OpenClaw has since implemented VirusTotal-based scanning. The pattern — capability deployed, then secured — is the one security teams will encounter with every agentic AI platform that follows.
How CISOs Govern OpenClaw Security Risks Before the Next Platform Arrives
The 47x growth curve means employees are running OpenClaw whether security teams know about it or not. Security teams that cannot enumerate their exposure cannot contain it — and OpenClaw’s adoption velocity makes delay costly.
Inventory exposed agent interfaces before scoping policy – Bitsight found 31,674 publicly reachable OpenClaw instances in less than two weeks. Run an external exposure scan for open ports and admin interfaces tied to self-hosted AI agent platforms. Audit which accounts hold active OpenClaw connections before drafting any access policy, because the count on your network is already non-zero.
Enforce least-privilege scopes on all MCP-connected agents – Agents run with the full permissions granted to the user who configured them. A misbehaving or compromised agent can read, modify, and delete data across every connected service. Scope MCP server connections to the minimum set of actions each workflow requires, and require re-authorization for any destructive action — the Yue email-deletion incident shows that even well-intentioned agents damage data when given broad write permissions without confirmation gates.
Treat ClawHub installs as a supply chain risk, not a convenience layer – The ClawHavoc campaign seeded 341 malicious skills before Koi Security detected the pattern. Security teams that govern AI agent orchestration with explicit skill-allowlisting block this attack class before it reaches the credential store. Require IT review of any new agent skill install, block unsigned skills at the network level, and monitor for the infostealer and reverse-shell behaviors Koi Security documented in ClawHavoc. The OpenClaw security risks Yue encountered in early 2026 — an agent with full write access and no kill switch — are the same risks waiting in every agentic AI platform that follows the same shadow-IT adoption curve.
Join our LinkedIn group Information Security Community!















