Your Bluesky Posts Are Probably In A Bunch of AI Datasets Now [404 Media]
After a machine learning librarian released and then deleted a dataset of one million Bluesky posts, several other, even bigger datasets have appeared in its place—including one of almost 300 million non-anonymized posts.
You're viewing a single thread.
Well yeah they are public? Lemmy is indexed by Google. I imagine everything on here is as well.