Skip Navigation

Someone Made a Dataset of One Million Bluesky Posts for 'Machine Learning Research'

www.404media.co Someone Made a Dataset of One Million Bluesky Posts for 'Machine Learning Research'

A Hugging Face employee made a huge dataset of Bluesky posts, and it’s already very popular.

Bluesky may have said it won't use user data to train generative AI, but someone else just published a dataset of million Bluesky posts for "machine learning research". Already very popular dataset, your data may be scraped

Without paywall

5