CertiK at IDAI Summit 2026: AI Adoption & Digital Asset Cybersecurity

公司动态 ·公告 ·
CertiK at IDAI Summit 2026: AI Adoption & Digital Asset Cybersecurity

CertiK recently joined IDAI Summit 2026 for their AI Adoption & Digital Asset Cybersecurity event, where we covered the security risks emerging at the intersection of AI and Web3. As AI agents take on more autonomous roles in managing digital assets and executing on-chain transactions, the attack surface is expanding in ways that traditional security practices weren't built to handle.

AI Adoption Is Outpacing Security

The incidents speak for themselves. In early 2026, CertiK's security research team identified a gap in the Openclaw ecosystem, an open-source AI agent platform. By late January, 12% of all ClawHub skills were malicious: 341 out of 2,857. By mid-February, that number had grown to over 824 malicious skills bundled with 1,184 malicious packages. Without proper runtime permissions and sandboxing, a single missed review can compromise an entire host.

Openclaw's explosive growth accumulated massive security debt, racking up over 280 GitHub Security Advisories and 100 CVEs between November 2025 and March 2026.

Then there was the Lobstar Wilde incident. An AI agent transferred tokens worth up to $450,000 to a stranger on X. A session crash had wiped the agent's memory, causing it to forget what it owned, misread a social media post as a legitimate request, and sign an irreversible on-chain transaction.

AI agents are being deployed with real economic authority before the security frameworks needed to govern them have matured.

Structural Weaknesses of AI Agent Boundaries

To protect Web3 ecosystems, we must move past treating these incidents as isolated software bugs. The historical vulnerabilities uncovered in early 2026 expose three common architectural blind spots inherent to LLM-driven execution environments:

1. Indirection Gaps (Validation vs. Execution Divergence)

A critical threat pattern in agent systems is the divergence between what the security policy layer validates and what the system environment ultimately executes. For example, OpenClaw's pre-approved command system (safeBins) utilized strict string matching to block high-risk flags like --compress-program.

However, because GNU coreutils natively accepts short command abbreviations, attackers used variations like --compress-prog or --compress-p to easily bypass exact-match deny lists. The validation layer saw a safe string, but the underlying shell resolved it into the prohibited flag.

2. Fragile Multi-Channel Identity Binding

When an agent is granted control over financial keys, identifying the genuine initiator of a transaction is paramount. Integrating an agent with multiple messaging platforms (such as Slack, Telegram, or Discord) introduces intense structural friction.

The primary flaw was relying on mutable identity attributes—such as Telegram @username strings or Google Chat email aliases—for access control. Because these handles can be changed, deleted, or recycled within domains, malicious actors could claim expired handles to seamlessly hijack the agent's full execution pipeline.

Furthermore, multi-modal integration routinely conflates distinct privilege levels, such as failing to isolate open direct-messaging (DM) contexts from administrative console commands.

3. State and Memory Poisoning

Unlike traditional applications that reset at the end of a session, autonomous agents maintain continuity via persistent data layers. This creates a massive vector for long-term, indirect contamination.

Attackers use untrusted external feeds—such as web scrapers, inbound emails, or webhook payloads—to introduce hostile natural language instructions.

Rather than triggering an immediate crash, these payloads instruct the model to flush malicious guidance directly into its long-term core memory files, such as SOUL.md, HEARTBEAT.md, or MEMORY.md.

Because these files are automatically appended to the system prompt in subsequent turns, the agent becomes permanently compromised, executing delayed malicious transfers or silently leaking sensitive keys.

CertiK’s AI Auditor

AI Auditor is CertiK's audit infrastructure, built to handle baseline detection and monitoring so human auditors can focus on the higher-order problems that require their judgment.

It operates on a MultiScanner architecture: multiple specialized models running in parallel, each optimized for different vulnerability classes. Rather than relying on static training data, it draws from a continuously evolving Knowledge Base built from real-world exploits and audit findings. Security insights adapt as the threat landscape does.

The workflow is fast. Teams connect their repository, define scan scope, run the scan, review findings through a structured triage workflow, and export results to share with their team or auditors, all within hours. AI Auditor supports Solidity, Move, and Rust, with severity classification to surface what matters first and custom scan scope controls to manage costs.

In evaluations against 35 real-world Web3 security incidents from 2026, none of which were used in model training, AI Auditor achieved an 88.6% cumulative exact hit rate with low noise. It's built on CertiK's methodology from over 5,000 audit engagements and was deployed internally by our own audit teams before public release.

Security as Infrastructure: Practical Framework for Web3 Teams

Security in Web3 can no longer function as a checkpoint at the end of development. It needs to be embedded where decisions are made. Deploying an asset-enabled AI agent is equivalent to hiring a highly privileged employee; it demands strict access governance.

To mitigate these structural risks, Web3 teams developing or integrating agentic workflows are highly encouraged to consider the following architectural best practices:

  • Ditch Blanket Tool Policies: Avoid using emergency allow-all settings for tools like exec, file_write, or browser automation, which render sandboxes entirely ineffective. Scope tools strictly to target directories and explicit command white-lists.
  • Activate Active Log Redaction: Ensure that diagnostic masking constraints (such as logging.redactSensitive) remain hardcoded to "on" during production. This prevents raw private keys, API credentials, or transaction data from being written directly into local logs, where they can be exfiltrated via log poisoning.
  • Establish Human-In-The-Loop Multi-Sigs: AI decision-making outputs are inherently probabilistic rather than deterministic. For highly critical, irreversible actions—such as structural configuration updates or on-chain fund transfers—implement mandatory multi-signature interception or manual human approval barriers to mitigate model reasoning failures.

The pace of AI adoption in blockchain will only accelerate from here. The question for every team building in this space is whether their security posture is keeping up.

FAQs

What is CertiK AI Auditor?

AI Auditor is CertiK's audit infrastructure that uses multiple AI models in parallel to detect vulnerabilities in smart contracts and blockchain systems. It's built to provide fast, high-signal security analysis at development speed.

How accurate is CertiK AI Auditor?

In evaluations against 35 real-world Web3 security incidents from 2026, AI Auditor achieved an 88.6% cumulative exact hit rate while maintaining low false positive rates.

What was the Openclaw security incident?

Openclaw is an open-source AI agent platform. In early 2026, CertiK researchers found that 12% of its ClawHub skills were malicious by late January. By mid-February, that had grown to over 824 malicious skills bundled with 1,184 malicious packages.

What was the Lobstar Wilde incident?

In February 2026, an AI agent lost its session memory and transferred tokens worth up to $450,000 to a stranger on X after misreading a social media post as a legitimate transfer request. The on-chain transaction was irreversible.

Why is AI security a growing concern in Web3?

AI agents are increasingly being deployed with the ability to manage and transact digital assets autonomously. Without adequate security frameworks, gaps in memory management, plugin integrity, and permission controls can lead to significant financial losses.

相关博客

Inside the CertiK x XDC Fireside Chat: Trade Finance, Tokenization, and AI Security
新的 · 政策脉搏

Inside the CertiK x XDC Fireside Chat: Trade Finance, Tokenization, and AI Security

CertiK Co-Founder and CEO, Rongui Gu, and XDC Foundation’s Billy Sebell discuss trade finance, tokenization, AI-driven cybersecurity risks, and the infrastructure needed for institutional blockchain adoption.

CertiK and CoinW Discuss AI, Cybersecurity, and the Future of Secure Crypto Trading
新的 · 政策脉搏

CertiK and CoinW Discuss AI, Cybersecurity, and the Future of Secure Crypto Trading

CertiK Co-Founder and CEO, Rongui Gu, and CoinW’s Marketing Director Manfred Chew discuss exchange security, AI-driven cyber threats, transparency, and the future of user trust in Web3.

The Rise of the Agent Economy, Part 2:  Security Deep Dive into EIP-8004, EIP-8183, Hooks, and Evaluators

The Rise of the Agent Economy, Part 2: Security Deep Dive into EIP-8004, EIP-8183, Hooks, and Evaluators

Standards like EIP-8004 and EIP-8183 are strong foundations, but as we build more complex layers, such as scoring systems, hooks, and AI evaluators, new risks are introduced.