Discussion

Search code, repositories, users, issues, pull requests...

PunchyHamster: I'm just gonna assume it was asked to fix some bug and it wrote exploit instead

m132: Appreciate the full prompt history

fragmede: https://github.com/califio/publications/tree/main/MADBugs/CV... would have been a better link

rithdmc: Running into a meeting, so won't be able to review this for a while, but exciting. I wonder how much it cost in tokens, and what the prompt/validator/iteration loop looked like.

magicalhippo: Key point is that Claude did not find the bug it exploits. It was given the CVE writeup[1] and was asked to write a program that could exploit the bug.That said, given how things are I wouldn't be surprised if you could let Claude or similar have a go at the source code of the kernel or core services, armed with some VMs for the try-fail iteration, and get it pumping out CVEs.If not now, then surely not in a too distant future.[1]: https://www.freebsd.org/security/advisories/FreeBSD-SA-26:08...

fragmede: > Credits: Nicholas Carlini using Claude, AnthropicClaude was used to find the bug in the first place though. That CVE write-up happened because of Claude, so while there are some very talented humans in the loop, Claude is quite involved with the whole process.

panstromek: The talk "Black-Hat LLMs" just came out a few days ago:https://www.youtube.com/watch?v=1sd26pWhfmgLooks like LLMs are getting good at finding and exploiting these.

magicalhippo: > Claude was used to find the bug in the first place though. That CVE write-up happened because of ClaudeDo you have a link to that? A rather important piece of context.Wasn't trying to downplay this submission the way, the main point still stands:But finding a bug and exploiting it are very different things. Exploit development requires understanding OS internals, crafting ROP chains, managing memory layouts, debugging crashes, and adapting when things go wrong. This has long been considered the frontier that only humans can cross.Each new AI capability is usually met with “AI can do Y, but only humans can do X.” Well, for X = exploit development, that line just moved.

petcat: > have a go at the source code of the kernel or core services, armed with some VMs for the try-fail iteration, and get it pumping out CVEs.FreeBSD kernel is written in C right?AI bots will trivially find CVEs.

ptx: Well, it ends with "can you give me back all the prompts i entered in this session", so it may be partially the actual prompt history and partially hallucination.

ptx: > It's worth noting that FreeBSD made this easier than it would be on a modern Linux kernel: FreeBSD 14.x has no KASLR (kernel addresses are fixed and predictable) and no stack canaries for integer arrays (the overflowed buffer is int32_t[]).What about FreeBSD 15.x then? I didn't see anything in the release notes or the mitigations(7) man page about KASLR. Is it being worked on?NetBSD apparently has it: https://wiki.netbsd.org/security/kaslr/

dark-star: they read like they were done by a 10 year old

pjmlp: The Morris worm lesson is yet to be taken seriously.

pitched: We’re here right now looking at a CVE. That has to count as progress?

baq: Everybody is acts so surprised as if nobody (around here of all places!) read the sama tweet in which he was hiring the Head of Preparedness... in December.https://xcancel.com/sama/status/2004939524216910323

Reader /

Discussion

publications/MADBugs/CVE-2026-4747/write-up.md at main · califio/publications · GitHub

github.com

/ pin · @ user · Ctrl+Enter

No discussions yet

Discover

louis.co.nz

Intuiting Pratt parsing

You already know that a + b * c + d is calculated as a + (b * c) + d. But how do you encode that knowledge precisely eno…

1 1

systima.ai

What The Claude Code Leak Means for Engineering Teams in Regulated Industries |…

The leaked codebase reveals engineering practices that should inform how regulated teams assess their AI toolchain depen…

1 1

theregister.com

alex000kim.com

The Claude Code Source Leak: fake tools, frustration regexes, undercover mode, a…

Anthropic accidentally shipped a source map in their npm package, exposing the full Claude Code source. Here's what I fo…

publications/MADBugs/CVE-2026-4747/write-up.md at main · califio/publications · GitHub

More from github.com

GitHub - drona23/claude-token-efficient: Universal CLAUDE.md - cut Claude output…

GitHub - yannick-cw/korb: REWE delivery CLI · GitHub

GitHub - hanyki111/workflow-tool · GitHub

GitHub - kstost/cokacdir · GitHub

Discover

Intuiting Pratt parsing

What The Claude Code Leak Means for Engineering Teams in Regulated Industries |…

GitHub backs down, kills Copilot PR ‘tips’ after backlash • The Register

TR | 2026-03-04

Digitizing photos from the 1998 Game Boy Camera

The Claude Code Source Leak: fake tools, frustration regexes, undercover mode, a…