The technique Anthropic uses was demonstrated by Nicholas Carlini in a talk he g...

mirsadm · 2026-04-11T18:19:53 1775931593

How is that going to find anything that interacts across files?

nodja · 2026-04-11T22:55:16 1775948116

You misunderstood.

Instead of asking the model: "Here's this codebase, report any vulnerability." you ask. "Here's this codebase, report any vulnerability in module\main.c".

The model can still explore references and other files inside the codebase, but you start over a new context/session for each file in the codebase.

doginasuit · 2026-04-12T05:25:59 1775971559

Honestly, that's the only way I've ever been able to trust the output. Once you go beyond the scope of one file it really degrades. But within a single file I've seen amazing results.

Eug894 · 2026-04-12T12:36:08 1775997368

Are you not supposed to include as many _preconditions_ (in the form of test cases or function constraints like "assert" macro in C) as you can into your prompt describing an input for a particular program file before asking AI to analyze the file?

Please, read my reply to one of the authors of Angr, a binary analysis tool. Here is an excerpt:

> A "brute-force" algorithm (an exhaustive search, in other words) is the easiest way to find an answer to almost any engineering problem. But it often must be optimized before being computed. The optimization may be done by an AI agent based on neural nets, or a learning Mealy machine.

> Isn't it interesting what is more efficient: neural nets or a learning Mealy machine?

...Then I describe what is a learning Mealy machine. And then:

> Some interesting engineering (and scientific) problems are: - finding an input for a program that hacks it; - finding a machine code for a controller of a bipedal robot, which makes it able to work in factories;

https://x.com/NENENENENE10/status/2042733015281914108

appcustodian2 · 2026-04-11T18:38:22 1775932702

I would think that it is still capable of exploring the codebase and reading other related files like any other coding agent already does.

vmg12 · 2026-04-11T18:36:20 1775932580

My phrasing wasn't clear but you aren't telling it to only look at one specific file but to focus its review on one file. Updated my original comment.