
TL;DR
Filippo Valsorda argues that LLMs have ended the era of treating security researchers with kid gloves. When anyone can discover vulnerabilities with an AI, the old coordinated disclosure model breaks down.
Filippo Valsorda, the cryptography engineer behind the Go cryptography standard library and age encryption, published an essay this week arguing that vulnerability reports no longer deserve special treatment. The reason: LLMs can now find vulnerabilities "as good as almost any security researcher."
The post sparked a 170+ comment discussion on Hacker News, with maintainers, security researchers, and vendors all weighing in on whether the old coordinated disclosure model still makes sense in 2026.
Valsorda's thesis is straightforward. Vulnerability reports historically received privileged treatment because security researchers provided two scarce resources:
In exchange, maintainers offered responsiveness and public credit.
Both sides of this bargain have eroded. LLMs can now perform vulnerability discovery at scale. The insight isn't scarce anymore. And confidentiality matters less when any attacker can run the same LLM analysis independently and find the same bugs.
"The insight is not scarce and precious anymore," Valsorda writes. If an AI found the vulnerability, there's no reason to assume attackers haven't already found it too.
The practical conclusion: maintainers should prioritize rapid triage and remediation over courteous researcher communication. Implement LLM-based scanning in your CI/CD pipeline. Reserve special treatment only for unusually severe cases or highly-trusted sources.
The thread surfaces both validation of the problem and pushback on the solution.
Maintainers confirm the spam problem is real. One maintainer of a vulnerability disclosure program reports that submissions went from 5 per month to 5 per day since January 2026. "These are clearly AI-generated and extremely low quality (albeit well-written). The rules of the program aren't read." They're considering shutting down the program entirely.
Dependabot fatigue compounds the issue. Several developers describe getting 100+ vulnerability alerts per week, mostly for dev dependencies or issues that don't affect their actual attack surface. "Half of them for dev dependencies," one writes. The signal-to-noise ratio has collapsed.
ReDoS is the poster child for broken scoring. Multiple commenters point to regex denial-of-service vulnerabilities that get marked as 10/10 severity despite being in build-time code that never sees untrusted input. "We got 116 github dependabot alerts this week. Half of them for dev dependencies."
The payment friction idea emerges. One commenter suggests requiring a small payment to submit vulnerability reports, refunded on valid findings. This triggers immediate pushback: "Why would anyone pay money to have a chance of being arrested?" The legal risks of security research already create friction - adding financial friction could discourage legitimate researchers entirely.
Supply chain concerns complicate the dev-dependency dismissal. Several commenters note that dev dependencies are still attack vectors - SolarWinds was compromised through its build tooling. "Developer's machines and cicd systems are high value targets." Dismissing dev dependency alerts entirely isn't risk-free.
Some question the AI capability claim. Not everyone agrees that LLMs can find vulnerabilities as well as skilled researchers. The counterargument: AI-generated reports are mostly garbage, suggesting the discovery capability isn't actually that strong. Valsorda's framing may overstate where we are today while being correct about the trajectory.
Read the full discussion at Hacker News.
Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
From the archive
Jun 23, 2026 • 7 min read
Jun 23, 2026 • 8 min read
Jun 23, 2026 • 8 min read
Jun 23, 2026 • 8 min read
If you maintain open source software or run a vulnerability disclosure program, this shift creates practical problems:
Triage becomes the bottleneck. When anyone can generate plausible-looking vulnerability reports, filtering real issues from AI-generated noise becomes the core challenge. Quality scoring, source reputation, and automated validation become more important than manual review of every submission.
The AI-found vulnerability paradox. If AI can find a bug, assume adversaries have already found it. This changes disclosure timelines - you may want to patch faster and skip the courtesy dance.
Bug bounty economics shift. Programs that pay per valid bug create incentives for volume submissions. Expect more platforms to adopt filtering mechanisms like video reproduction requirements, reputation gating, or even the controversial payment friction model.
Run your own scans. If LLMs can find your vulnerabilities, you should be running those scans yourself before researchers (or attackers) do. Integrate security scanning into CI/CD rather than relying on external reports.
Dev dependency alerts still matter, sometimes. Don't dismiss all dev dependency vulnerabilities, but do context-aware triage. A ReDoS in your test framework is different from malicious code in a build tool.
Valsorda's essay is part of a larger pattern: AI commoditizing expertise-based workflows. Security research joins code review, penetration testing, and other traditionally specialized domains where AI tools are compressing the skill curve.
This doesn't mean security researchers are obsolete. The hardest vulnerabilities - novel attack classes, complex chains, hardware-level exploits - still require human expertise. But the long tail of straightforward vulnerability discovery is increasingly automatable.
For maintainers, this means the volume of incoming reports will keep growing while the average quality drops. The workflows designed for a world of scarce, thoughtful security researchers need to adapt to a world of abundant, mechanical scanning.
The old model assumed vulnerability reporters were partners deserving special treatment. The new model may need to assume they're noise until proven otherwise - and design systems accordingly.
Read next
New research from MIT reveals that LLMs identify speakers by writing style, not by tags - meaning attackers who sound like the system effectively become the system. The findings explain why prompt injection remains unsolved.
7 min readModern LLMs now use MoE routing, mixed attention variants, and fused vision encoders. The simple transformer stack is gone - here's what replaced it and why it matters for developers.
6 min readNew benchmark data shows GPT-5.5 hallucinates 86% of the time when it does not know the answer - versus 28% for the open-weights GLM-5.2. The numbers challenge the assumption that bigger models equal more reliable output.
6 min readTechnical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
DeepSeek's reasoning-first model built for agents. First model to integrate thinking directly into tool use. Ships along...
View ToolConstrained generation library for LLMs. Uses finite state machines to mask invalid tokens during generation. Guarantees...
View ToolThe easiest way to run LLMs locally. One command to pull and run any model. OpenAI-compatible API. 52M+ monthly download...
View ToolDesktop app for discovering, downloading, and running local LLMs. Clean chat UI, OpenAI-compatible API server, and autom...
View ToolTurn product knowledge into browser QA plans, executable checklists, and release reports.
View AppDocument API key ownership, rotation context, and integration notes without storing secrets.
View AppGenerate polished food photos, social posts, specials, menus, and flyers from a menu item.
View AppA complete, citation-backed Claude Code course with setup, prompting systems, MCP, CI, security, cost controls, and capstone workflows.
ai-developmentInstall the dd CLI and scaffold your first AI-powered app in under a minute.
Getting StartedConfigure Claude Code for maximum productivity -- CLAUDE.md, sub-agents, MCP servers, and autonomous workflows.
AI Agents
Modern LLMs now use MoE routing, mixed attention variants, and fused vision encoders. The simple transformer stack is go...

Unsloth's dynamic quantization makes GLM-5.2 runnable on a 256GB Mac or a 24GB GPU with CPU offloading. Here is the hard...

Baidu releases Unlimited OCR, an open-source vision-language model that parses 100+ page documents in a single pass with...

A new paper shows a 3B parameter model hitting 94.3 on AIME26 and 96.1% on LeetCode contests - matching or exceeding mod...

Switzerland's fully open foundation model promises transparent training data and EU compliance. The HN crowd has questio...

New research from MIT reveals that LLMs identify speakers by writing style, not by tags - meaning attackers who sound li...

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.