Quoting Willy Tarreau

What Happened

Willy Tarreau, Lead Software Developer at HAProxy and longtime Linux kernel contributor, posted a striking data point on the kernel security mailing list: AI-generated vulnerability reports have increased the volume of incoming security disclosures by roughly 10- 25x over the past two years. The list went from 2-3 reports per week two years ago, to roughly 10 per week last year, to 5-10 reports per day in early 2026. Fridays and Tuesdays appear to be peak days, possibly correlating with researcher work schedules or automated batch jobs.

The observation was quoted by Simon Willison, who has been tracking the downstream effects of LLMs on open source infrastructure. Tarreau's note is notable because it comes from someone directly processing these reports — not a researcher extrapolating from survey data.

Technical Deep Dive

Tarreau makes two distinct points that are worth separating:

Report Quality Is Holding Up

Unlike earlier waves of AI-generated noise in bug trackers, Tarreau says most of these reports are technically correct. This is a meaningful distinction. LLM -assisted static analysis and fuzzing tools — such as combinations of semgrep, CodeQL, and GPT-4-class models used for triage — are apparently producing actionable findings, not just hallucinated CVEs. The kernel team has had to bring in additional maintainers specifically to handle throughput, not to filter junk.

Duplicate Reports Are a New Problem

A second emergent pattern: duplicate reports of the same vulnerability from different researchers using different tools. This is a coordination failure that didn't exist at previous report volumes. When two researchers independently feed similar kernel subsystem code to different AI-assisted scanners, they may surface the same memory safety issue via slightly different code paths. The result is redundant triage work for maintainers who must deduplicate findings before ass igning CVEs.

The tooling landscape driving this likely includes:

LLM-assisted code review: tools like GitHub Copilot, GP T-4o, or Claude being used to manually audit kernel subsystems for classic vulnerability classes (use-after-free, integer overflow , race conditions)
AI-enhanced fuzzers: projects integ rating LLMs with syzkaller or libFuzzer to generate more targeted input corpora
Automated static analysis pipelines: CodeQL queries or semgrep rules generated or refined by LLMs to catch Linux-specific patterns

There is no single tool identified by Tarreau — the phrase "possibly slightly different tools" suggests the volume is coming from a broad ecosystem of researchers independently leveraging AI assistance rather than one coordinated project.

Scale Context

The Linux kernel receives on the order of 10,000+ commits per merge window. Its security surface — spanning drivers, filesystems, network stack, and memory management — is enormous. A 5-10x weekly increase in valid security reports is a significant operational load change, not a rounding error.

Who Should Care

Security researchers using AI tools to audit open source code should be aware that duplicate disclosure is now a real coordination problem. Before submitting a kernel security report, checking whether a similar issue has been recently disclosed — even informally — reduces maintainer load.

Open source maintainers of high-profile C/C++ codebases (not just the kernel — projects like OpenSSL, glibc, or QEMU face similar exposure) should anticipate a structural increase in inbound security reports . Triage pipelines that assume low volume will break .

Security tool developers building LLM-assisted scanners should consider deduplication and cross-researcher coordination features. A tool that checks public vulnerability databases and recent mailing list archives before generating a report would reduce noise .

Enterprise security teams evaluating AI-assisted code auditing tools can treat the kernel security list as a leading indicator: the tooling is producing real findings at scale, which supports the case for internal adoption on propri etary codebases.

What To Do This Week

If you maintain a C/C++ open source project: audit your security report intake process. Add a deduplication step and consider a public disclosure tracker ( even a simple GitHub Security Advisory draft) so researchers can check for prior art before submitting.
If you're using AI tools for vulnerability research: before filing a report, run a search against lore.kernel.org and the oss-security mailing list to check for recent duplicate findings .
If you're evaluating AI security tooling: Tarreau's data point is useful evidence for internal ROI arguments. Valid kernel CVEs are among the hardest to find; if AI tools are surfacing them at 5-10/day, the signal-to-noise ratio on less complex codebases is likely higher.
If you're building AI-assisted scanners: add a pre-submission check against the NVD and relevant project mailing list archives using embeddings-based similarity search before surfacing a finding to the user.

The broader implication is structural: AI assistance is compressing the time and expertise required to find valid vulnerabilities in mature, well-audited codebases. The Linux kernel security list is one of the most rig orous in open source. The fact that it's being flooded with correct reports suggests the same dynamic is playing out quietly across hundreds of less-staffed projects.

Quoting Willy Tarreau

What Happened

Technical Deep Dive

Duplicate Reports Are a New Problem

Scale Context

Who Should Care

What To Do This Week

Related Reading

AI Screening? You Might Lose to AI-Polished Rivals

Microsoft MAF 1.0 Merges AutoGen & Semantic Kernel, Ending Fragmentation

AI Interviews Now Ask 'How to Handle Agent Failures'—Engineering Beats Jargon

Qwen Open-Sources SAE: Decoding & Steering LLMs, China Enters Interpretability

Tinygrad Tests MoE on Blackwell: Local AI Geeks Build Priciest Hardware Lego

Qwen3.6 35B Beats 27B in Speed and Quality: Parameter Count Is Unreliable