A lawsuit claims OpenAI stole 'massive amounts of personal data,' including medical records and information about children, to train ChatGPT
A lawsuit claims OpenAI stole 'massive amounts of personal data,' including medical records and information about children, to train ChatGPT
A lawsuit claims OpenAI stole 'massive amounts of personal data,' including medical records and information about children, to train ChatGPT
The lawsuit alleges OpenAI crawled the web to amass huge amounts of data without people's permission.
Scraping social media posts and reddit posts doesn’t sound like stealing, they’re public posts.
I doubt it’s only about some Reddit posts. The scrapping was done on the whole web, capturing everything it could. So besides stealing data and presenting it as its own, it seems to have collected some even more problematic data which wasn’t properly protected.
But that really isn't OpenAI's fault. Whoever was in charge of securing the patients data really fucked up.
if it was unsecured it's basically public. whomever put that data on a publicly accessible server is at fault
Just because something is posted online doesn't mean it can be taken a resold. Copyright law prevents that. Of course, copyright law and generative AI is new and gray area.
Here is not just scraping though, it is also using that data to create other content and to potentially also re-publish that data (we have no way of knowing whether chatGPT will spit out any of that nor where did it take what is spitting out).
The expectation that social media data will be read by anybody is fair, but the fact is that the data has been written to be read, not to be resold and published elsewhere too.
It is similar for blog articles. My blog is public and anybody can read it, but that data is not there to be repackaged and sold. The fact that something is public does not mean I can do whatever I want with it.
I could read your blog post and write my own blog post, using yours as inspiration. I could quote your post, add a link back to your blog post and even add affiliate links to my blog post.I could be hired to do something like that for the whole day