OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series

www.businessinsider.com OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series

A new research paper laid out ways in which AI developers should try and avoid showing LLMs have been trained on copyrighted material.

OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series::A new research paper laid out ways in which AI developers should try and avoid showing LLMs have been trained on copyrighted material.

268

AI Copyright @lemm.ee BitOneZero @ .world @lemmy.world 1y ago

OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series

www.businessinsider.com /openais-latest-chatgpt-version-hides-training-on-copyrighted-material-2023-8

You're viewing a single thread.

268 comments

Why are people defending a massive corporation that admits it is attempting to create something that will give them unparalleled power if they are successful?
- Mostly because fuck corporations trying to milk their copyright. I have no particular love for OpenAI (though I do like their product), but I do have great distain for already-successful corporations that would hold back the progress of humanity because they didn't get paid (again).
  
  But OpenAI will do the same?
  
  Perhaps, and when that happens I would be equally disdainful towards them.
  
  In the United States there was a judgement made the other day saying that works created soley by AI are not copyright-able. So that that would put a speed bumb there.
  I may have misunderstood what you though.
  
  Huh. Doesn't this means technically AI cannot do copyright infringement.
  
  Nah, it would mean that you cannot copyright a work created by an AI, such as a piece of art.
  
  E.g. if you tell it to draw you a donkey carting avocados, the picture can be used by anyone from what I understand.
  
  you cannot copyright a work created by an AI, such as a piece of art.
  
  That's what I said. Copyright infringement is when there is another copyrightable object that is copy of first object. AI is not witin copyright area. You can't copyright it, but also you can't be sued for copyright infringement too.
  
  if you tell it to draw you a donkey carting avocados, the picture can be used by anyone from what I understand.
  
  Yes. Same for Public Domain, but PD is another status. PD applies only to copyrightable work.
  
  Yeah, they might not copyright it, but after it becomes the 'one true AI', it will be at the hands of Microsoft, so please do not act friendly towards them.
  
  It will turn on you just like every private company has.
  
  (don't mean specifically you, but everyone generally)
  
  It's like argument "but new politicians will steal more" that I hear in Russia from people who protect Putin
  
  It's literally not, wtf.
  
  Do not let any private entity to get overwhelming majority on anything period.
  
  But do not kid yourself that Microsoft will let OpenAI do anything for public once it gets big enough.
  
  OpenAI is open only in name after they rolled back all the promises of being for everyone.
  
  That's my entire point. It's not who, but how long.
  
  Also Microsoft plays both sides here. OpenAI vs copyright is wrong question. There's more: both are status-quo. Both are for keeping corporate ownership of ideas.
  
  There's a massive difference though between corporations milking copyright and authors/musicians/artists wanting their copyright respected. All I see here is a corporation milking copyrighted works by creative individuals.
- Because everyone learns from books, it's stupid.
  
  An LLM is not a person, it is a product. It doesn't matter that it "learns" like a human - at the end of the day, it is a product created by a corporation that used other people's work, with the capacity to disrupt the market that those folks' work competes in.
  
  How are we going to make ai, if it can't learn?
  
  First, we don’t have to make AI.
  
  Second, it’s not about it being unable to learn, it’s about the fact that they aren’t paying the people who are teaching it.
  
  Then give the AI a library card, feel better?
  
  The reasoning that claims training a generative model is infringing IP would still mean a robot going into a library with a card it has to optically read all the books there to create the same generative model would still be infringing IP.
  
  Humans can judge information make decisions on it and adapt it. AI mostly just looks at what is statistically what is most likely based on training data. If 1 piece of data exists, it will copy, not paraphrase. Example was from I think copilot where it just printed out the code and comments from an old game verbatim. I think Quake2. It isn't intelligence, it is statistical copying.
  
  Well, mathematics cannot be copyrighted. In most countries at least.
- Because ultimately, it's about the truth of things, and not what team is winning or losing.
- AI is the new fan boy following since it became official that nfts are all fucking scams. They need a new technological God to push to feel superior to everyone else...
  
  Are you ok? You seem upset
- The dream would be that they manage to make their own glorious free & open source version, so that after a brief spike in corporate profit as they fire all their writers and artists, suddenly nobody needs those corps anymore because EVERYONE gets access to the same tools - if everyone has the ability to churn out massive content without hiring anyone, that theoretically favors those who never had the capital to hire people to begin with, far more than those who did the hiring.
  
  Of course, this stance doesn't really have an answer for any of the other problems involved in the tech, not the least of which is that there's bigger issues at play than just "content".
- Leftists hating on AI while dreaming of post-scarcity will never not be funny

268 comments