are they even sure that the AI even accessed the content that second time? LLMs ...

bongodongobob · 2025-08-20T01:06:24 1755651984

This is it. M365 uses RAG on your enterprise data that you allow it to access. It's not actually accessing the files directly in the cases he provided. It's working as intended.

albert_e · 2025-08-20T02:49:29 1755658169

If this is indeed how copilot is archtected, then it needs clear documentation -- that it is a non-audited data store.

But how then did MS "fix" this bug? Did they stop pre-ingesting, indexing, and caching the content? I doubt that.

Pushing (defaulting) organizations to feed all their data to Copilot and then not providing an audit trail of data access on that replica data store -- feels like a fundamental gap that should be caught by a security 101 checklist.

bongodongobob · 2025-08-20T14:08:49 1755698929

How would you audit that?

crooked-v · 2025-08-20T01:25:18 1755653118

If that's the case, then as noted in the article, the 'as intended' is probably violating liability requirements around various things.

sailfast · 2025-08-20T02:37:51 1755657471

Correct. It is precisely that a user can ask about someone’s medical history (or whatever else) and not be reported that would be in violation of any heavily audited system. LLM Summaries break the compliance.

bongodongobob · 2025-08-20T14:09:34 1755698974

You allow what it can and can't see. If you include PII and medical records, that's your fault, not MS's.

sailfast · 2025-08-23T13:14:02 1755954842

That’s fair - unless they’re marketing the bot as compliant.