They can work really well if you put sufficient upfront engineering into your ar...

pron · 2026-04-13T16:34:05 1776098045

They don't work really well even on relatively small things and even with a virtually impractical upfront engineering: https://news.ycombinator.com/item?id=47752626

They just make a lot of mistakes that compound and they don't identify. They currently need to be very closely supervised if you want the codebase to continue to evolve for any significant amount of time. They do work well when you detect their mistakes and tell them to revert.

latentsea · 2026-04-15T01:34:39 1776216879

We've done the impractical upfront engineering, and they're working well for us :)

pron · 2026-04-17T15:16:03 1776438963

Since Anthropic weren't able to make them work even for something as simple and familiar as a C compiler then I would guess that:

1. You're supervising the agents closely, or

2. Your projects are very simple - simpler even than a C compiler, or,

3. They're not really working well; the catastrophic problems just haven't surfaced yet.