>can't meaningfully see and interact with the page like the end user will Isn't ...

orbital-decay · 2026-03-04T05:38:13 1772602693

Depends. Does it represent end users well enough? Does it hit the same edge cases as a million users would (especially considering poor variety of heavily post-trained models)? Does it generalize?

8note · 2026-03-04T05:49:23 1772603363

> computer use agent

they arent good enough yet at all.

i got an agent to use the windows UIA with success for a feedback loop, and it got the code from not wroking very well to basically done overnight, but without the mcp having good feedback and tagged/id-ed buttons and so on, the computer use was just garbage