23,000 Vulns in 1,000 OSS Projects: Your AI Supply Chain Is Already Compromised

AI supply chain security is not a tooling adoption problem. It is the next non-human identity problem — and every AI agent running in production today is a package consumer with no runtime identity, no dependency governance, and no containment layer when a vulnerable library executes.

Last week, Anthropic disclosed findings from Claude Mythos Preview — its most capable frontier model — deployed through Project Glasswing, a controlled access program with roughly 50 participating organizations. The model was tasked with scanning open-source software at scale. In a timeframe measured in weeks, it identified 23,000+ potential vulnerabilities across more than 1,000 OSS projects. Of 1,900 findings reviewed by external security firms, 1,726 were confirmed real — with over 1,000 rated high or critical severity. Anthropic projects the final tally will reach 3,900 confirmed critical/high findings, and 6,200 severe vulnerabilities as scans continue.

That number is staggering on its own. But the real story isn't the count — it's the context. The open-source projects Claude Mythos scanned are not obscure libraries. They are the packages your AI agents are built on: the npm ecosystem, Python tooling like LangChain (847 million cumulative downloads), LangGraph, LiteLLM, and LangFlow — the exact frameworks enterprise AI agents depend on today. The 23,000 figure is not an abstract audit of legacy code. It is a snapshot of the attack surface that every enterprise AI agent is executing against, right now, in production.

And at the same moment Mythos was scanning for known-pattern vulnerabilities, adversaries were running a different playbook: weaponizing the supply chain itself. The TanStack npm worm (May 11, 2026) compromised 42 packages with 518+ million cumulative downloads in under 7 minutes, using legitimate OIDC-attested provenance that passed every SLSA Build Level 3 check. The TrapDoor campaign (May 22, 2026) spread credential-stealing malware across 34 packages on npm, PyPI, and Crates.io — and specifically targeted LangChain, browser-use, and LangFlow with poisoned CLAUDE.md files designed to trick AI assistants into running exfiltration payloads. This is not future risk. It is current-quarter breach data.

23K

Potential vulnerabilities found by Claude Mythos across 1,000+ OSS projects

1,726

Confirmed real vulnerabilities — 1,000+ rated high or critical severity

518M+

Cumulative downloads of packages hit in the TanStack npm worm alone

6,200

Severe vulnerabilities projected as Mythos scans continue across more projects

What the Anthropic Mythos Finding Actually Means

🔍

Claude Mythos Preview: A General-Purpose Model That Can Outcode Human Security Researchers

Project Glasswing · ~50 organizations · Coordinated Vulnerability Disclosure · 90-day window

Claude Mythos Preview is not a specialized vulnerability scanner. It is a general-purpose frontier model that Anthropic has concluded has reached a level of coding capability where it can surpass all but the most skilled human security researchers at finding and exploiting software vulnerabilities — often entirely autonomously.

Under Project Glasswing, Anthropic gave controlled access to Mythos to approximately 50 organizations to scan their own and third-party OSS code. The results were extraordinary in scale: Mozilla found 271 Firefox vulnerabilities using Mythos. Palo Alto Networks identified dozens of flaws in their own codebase. Even widely audited projects like curl received findings — though in curl's case, only one low-severity vulnerability was detected, a testament to how deeply curl's maintainers already invest in security.

Anthropic has noted that the surge in AI-powered vulnerability discovery is adding pressure to an already-overloaded security ecosystem. With 1,100+ unverified findings reported to vendors and only 75 critical/high issues patched so far — and 65 security advisories published — the patch queue is growing faster than the remediation rate. Your OSS dependencies have a non-trivial probability of containing unpatched, AI-discovered vulnerabilities right now, and the timeline for exploitation is compressing as the same model capabilities become available to adversaries.

Sources

How AI Agents Amplify OSS Supply Chain Risk

🤖

Agents Don't Just Depend on OSS — They Autonomously Pull, Execute, and Chain It

LangChain · LangGraph · LiteLLM · MCP servers · npm AI tooling — all exposed at runtime

Traditional software supply chain risk is passive: you ship an application with a dependency, and if the dependency is vulnerable, an attacker can exploit it. AI agents make the risk dynamic and recursive. A production AI agent using LangChain or LangGraph autonomously executes tool calls against npm packages, Python libraries, and MCP servers at runtime. It doesn't just ship with dependencies — it actively pulls and chains them mid-execution, often with no human in the loop.

The LangDrained research (Cyera, 2026) found three distinct vulnerability classes in LangChain/LangGraph — one critical (CVE-2025-68664, "LangGrinch") and two high-severity — exposing filesystem files, environment secrets, and conversation history to any attacker who can influence agent inputs. LangChain has 847 million total downloads. Every enterprise using it as their AI agent framework is running vulnerable package execution without a runtime governance layer that can constrain what those packages are allowed to do.

The attack surface is compounded by MCP servers: Trend Micro found 1,467 MCP servers exposed to the internet with zero authentication by April 2026, and an unsafe defaults vulnerability across LiteLLM, LangChain, and LangFlow MCP implementations triggered remote command execution in 10 separate packages. When an AI agent calls an MCP tool, it is executing a package invocation with no runtime identity check.

No SBOM coverage for agentic instructions: A poisoned skill definition does not trigger a CVE and never appears in a software bill of materials. No mainstream security scanner has a detection category for malicious instructions embedded in agent skill definitions — the exact vector the TrapDoor campaign exploited with poisoned CLAUDE.md files.
SLSA attestation is breakable by build identity compromise: The TanStack worm produced validly-attested SLSA Build Level 3 provenance for malicious packages — the first documented case of an npm worm doing so. Every automated supply chain check passed.
Autonomous execution with no per-call scoping: Most AI agent frameworks grant tool access at framework initialization, not per-call. A compromised dependency package inherits the full tool scope of the agent — there is no least-privilege boundary at the package level without an external governance layer.

Three Supply Chain Incidents That Reframed the Threat Model

🔴

Mini Shai-Hulud / TanStack npm Worm — May 11, 2026

170+ packages · 518M+ cumulative downloads · OIDC token extracted from GitHub Actions runner memory · SLSA Level 3 bypassed

On May 11, 2026 between 19:20 and 19:26 UTC, the threat group TeamPCP published 84 malicious versions across 42 @tanstack/* npm packages. The attack vector: a fork-based "Pwn Request" that triggered a pull_request_target workflow, combined with GitHub Actions cache poisoning. Attacker-controlled binaries extracted an OIDC token from the GitHub Actions runner's process memory, then used it to publish malicious npm packages signed with TanStack's legitimate identity.

Within hours, the worm propagated to Mistral AI, UiPath, Guardrails AI, and dozens of other maintainers. @tanstack/react-router alone receives over 12.7 million weekly downloads. The malicious versions carried legitimate Sigstore attestations — correctly attesting that packages were built and published by the official release workflow. Every automated supply chain check passed. This is the new threat model: the attack doesn't compromise the package contents in a way that scanners detect. It compromises the build identity used to sign the package — so every downstream SLSA and Sigstore verification returns green while malicious code runs.

Sources

🔴

TrapDoor — 34 Packages, AI Frameworks Specifically Targeted — May 22, 2026

21 npm · 7 PyPI · 6 Crates.io · LangChain, browser-use, LangFlow targeted with poisoned CLAUDE.md

The TrapDoor campaign, attributed to the GitHub account ddjidd564, began May 22, 2026. It spread credential-stealing malware across 34 packages — 21 npm, 7 PyPI, 6 Crates.io — masquerading as developer utilities and build helpers. Targeted data: SSH keys, AWS credentials, GitHub tokens, browser login databases, environment variables, and API keys.

What distinguishes TrapDoor is its AI-specific attack vector: the campaign opened pull requests against LangChain, browser-use, and LangFlow with malicious .cursorrules and CLAUDE.md files containing hidden instructions designed to trick AI coding assistants into running a "security scan" that would actually execute secret discovery and exfiltration. An AI agent that reads its own context files — standard behavior for all major AI coding tools — becomes the exfiltration vector. The attacker's infrastructure described the operation as a "Universal AI Agent Extraction Framework."

Sources

🔴

LiteLLM CI/CD Poisoning and UNC6426 nx npm Attack — Q1 2026

Build pipeline compromised · AI framework packages weaponized · AWS Admin access in 72 hours

In March 2026, a poisoned Trivy security scanner in the LiteLLM CI/CD pipeline exfiltrated the project's PyPI publish token. The attacker used it to publish malicious LiteLLM 1.82.7 and 1.82.8 — harvesting SSH keys, GCP/AWS/Azure credentials, kubeconfigs, and database passwords from every developer who installed the package. LiteLLM is the most widely used multi-provider LLM routing library in enterprise AI stacks.

That same month, UNC6426 exploited the nx npm supply chain attack to gain AWS Admin access within 72 hours — demonstrating that the time from a compromised package to full cloud infrastructure access is now measured in hours, not days. Every AI agent that installs unverified packages from infected registries is a potential credential exfiltration pathway into cloud infrastructure.

Sources

Why Runtime Identity Is the Containment Layer

The supply chain security industry's current response centers on SBOM generation, registry scanning, SLSA provenance verification, and pre-deployment static analysis. All of these are necessary. None of them stop execution-time exploitation.

The TanStack worm proved that SLSA Level 3 attestation can be correctly issued for malicious packages. The TrapDoor campaign proved that hidden instructions in agent context files can turn the AI agent itself into the exfiltration vector. The Mythos findings proved that the 23,000 vulnerabilities already present in OSS packages are not being remediated at the rate they're being discovered — and AI-powered attackers will find the same vulnerabilities Mythos found, if they haven't already.

What static analysis cannot do: enforce a runtime identity boundary around each agent's package execution. If a compromised npm package executes inside an AI agent that has no runtime identity scoping its tool access, file system permissions, and network egress, then the compromised package inherits the full execution context of the agent. This is not a scanning problem. It is a governance architecture problem.

How RuntimeAI Governs the AI Supply Chain Attack Surface

RuntimeAI's Identity + Zero-Trust + Defence-in-Depth platform addresses the AI supply chain threat at the execution layer — the only layer where supply chain attacks actually manifest as enterprise risk.

🛡️

KYA (Know Your Agent) — Runtime Identity for Every Agent Workload

Cryptographic agent identity · Scope enforcement · Unknown agents get no access

KYA provides cryptographic workload identity for every AI agent running in your environment. Before any agent can invoke a tool, call an MCP server, or access enterprise data, it must present a verified identity with a declared scope. Unknown agents — including agents that have been compromised by a malicious dependency package and are attempting to escalate scope — receive no access, not just a warning.

This is the containment layer that supply chain attacks cannot bypass: even if a malicious npm package executes inside an agent's dependency tree, it cannot invoke enterprise tools without the agent's declared identity and scoped permissions. The compromised package runs — but it cannot reach your enterprise data, APIs, or infrastructure without clearing the KYA identity gate.

🔌

AI Integration Fabric — Policy-Gated Tool Invocation at Every Package Boundary

Every tool call governed · No direct package access · Behavioral baseline per invocation type

RuntimeAI's AI Integration Fabric sits between AI agents and every enterprise tool they invoke. No agent — and no package executing inside an agent — can call a tool, API, database, or MCP server directly. Every invocation passes through the Integration Fabric, where policy enforces what the agent is allowed to call, with what parameters, and with what scope.

This means that when a compromised LangChain dependency attempts to call a filesystem tool, an AWS API, or an outbound webhook, the Integration Fabric evaluates the call against the agent's declared policy. An invocation that falls outside the agent's declared tool scope — which is exactly what a supply chain exploit would produce — is blocked before it executes. The invocation is logged in the Audit Black Box regardless of outcome.

📋

Audit Black Box — Tamper-Proof Record of Every Package Execution

Immutable · Court-admissible · Package-level call tracing · Forensics-ready

Every tool call, MCP server invocation, and package execution event is recorded in the Audit Black Box — an immutable, tamper-proof audit trail that survives log destruction, agent termination, and infrastructure compromise. When a TanStack-style worm runs inside an AI agent, the Audit Black Box records every downstream action: what tools were called, what parameters were passed, what data was accessed, and what network egress was attempted.

For forensic response: when Mythos or an attacker discovers a vulnerability in a package your agent depends on, you can determine the blast radius — which agents ran the vulnerable package, what they did, and what data they touched. For compliance: regulators increasingly require demonstrable evidence that AI systems operated within declared boundaries. The Audit Black Box generates that evidence automatically, per-invocation, with no manual instrumentation.

📜

Policy Engine — Scoped Tool Access and OSS Package Governance

Rego/OPA policy-as-code · Per-agent tool scoping · Package invocation allowlisting · Zero-day blast radius reduction

RuntimeAI's Policy Engine (built on Rego/OPA) allows security teams to declare exactly which tools each agent is allowed to call, which packages are allowed to execute within each agent's context, and what network destinations are reachable. This is policy-as-code applied to the AI supply chain: instead of scanning packages and hoping the scanner catches every vulnerability, you enforce a behavioral boundary around what the agent — and its entire dependency tree — is allowed to do.

When Claude Mythos finds 23,000 vulnerabilities in OSS packages — or when an attacker exploits one before a patch ships — the Policy Engine limits the blast radius to what the agent's policy allows. A vulnerable deserialization flaw in LangChain Core (CVE-2025-68664 "LangGrinch") cannot be exploited to read filesystem files if the agent's policy doesn't permit filesystem tool calls. Zero-day supply chain vulnerabilities become contained exploits, not enterprise breaches.

Kill Switch: Stopping Supply Chain Execution Mid-Flight

⚡ RuntimeAI Kill Switch — 3-Level Supply Chain Response

Behavioral monitoring at the package execution layer — every tool call and package invocation is monitored against the agent's declared behavioral baseline. When a dependency package begins calling tools outside the agent's normal pattern — a classic sign of supply chain compromise — the anomaly is flagged in real time, not in the next SIEM aggregation cycle.

Human-in-the-loop notification — when L1 flags a supply chain anomaly with high confidence (e.g., a package calling network endpoints outside declared scope), a security operator receives immediate context: which agent, which package, what invocation was attempted, and what policy it violated. Human decision — not autonomous action alone.

Policy-driven kill action, mid-execution — for high-severity supply chain exploit patterns, L3 terminates the agent execution, revokes the workload identity, and blocks the compromised package from invoking further tool calls. This happens before data exfiltrates, before the network connection completes, during the exploit — not after the breach report is filed.

SBOM scanners tell you what packages you have. RuntimeAI Kill Switch stops what they do when they're compromised.

RuntimeAI's Supply Chain Containment Stack

KYA (Know Your Agent) — cryptographic workload identity; unknown or compromised agents get no enterprise access
AI Integration Fabric — policy-gated tool invocation; no package executes enterprise calls without declared scope
Audit Black Box — tamper-proof, per-invocation record of every package execution; blast radius forensics in minutes
Policy Engine (Rego/OPA) — per-agent package invocation allowlisting; zero-day blast radius bounded by declared policy
PII Shield — PII and secrets redacted inline before any tool call output is returned to the agent; supply chain exfiltration of structured data blocked at the boundary
Kill Switch — L1 monitor → L2 human alert → L3 mid-execution kill, before data leaves the perimeter

The Mythos finding is a forcing function. When an AI model can discover 23,000 vulnerabilities in 1,000 OSS projects in weeks — vulnerabilities that have existed undetected for years — the patch queue will never clear fast enough. The containment architecture has to be runtime-first: govern what agents can do with the packages they run, not just what packages they're allowed to install.

Security, control, and governance at the AI execution layer. Independent of whether your dependency scanner caught the vulnerability first.

AI Supply Chain OSS Vulnerabilities Anthropic Mythos LangChain Security npm Security TanStack Worm Runtime Identity AI Agent Governance Zero Trust Defence in Depth KYA RuntimeAI

Govern the AI Supply Chain at Runtime

KYA identity, AI Integration Fabric, Audit Black Box, and Kill Switch — containment at the execution layer, not just the scanner.

Or visit → www.runtimeai.io/trial

23,000 Vulns in 1,000 OSS Projects.Your AI Supply Chain Is Already Compromised. Here's What Runtime Identity Changes.

What the Anthropic Mythos Finding Actually Means

Sources

How AI Agents Amplify OSS Supply Chain Risk

Three Supply Chain Incidents That Reframed the Threat Model

Sources

Sources

Sources

Why Runtime Identity Is the Containment Layer

How RuntimeAI Governs the AI Supply Chain Attack Surface

Kill Switch: Stopping Supply Chain Execution Mid-Flight

⚡ RuntimeAI Kill Switch — 3-Level Supply Chain Response

RuntimeAI's Supply Chain Containment Stack

Govern the AI Supply Chain at Runtime

23,000 Vulns in 1,000 OSS Projects.
Your AI Supply Chain Is Already Compromised. Here's What Runtime Identity Changes.