A new research paper laid out ways in which AI developers should try and avoid showing LLMs have been trained on copyrighted material.
OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series::A new research paper laid out ways in which AI developers should try and avoid showing LLMs have been trained on copyrighted material.
Why are people defending a massive corporation that admits it is attempting to create something that will give them unparalleled power if they are successful?
Mostly because fuck corporations trying to milk their copyright. I have no particular love for OpenAI (though I do like their product), but I do have great distain for already-successful corporations that would hold back the progress of humanity because they didn't get paid (again).
In the United States there was a judgement made the other day saying that works created soley by AI are not copyright-able. So that that would put a speed bumb there.
I may have misunderstood what you though.
you cannot copyright a work created by an AI, such as a piece of art.
That's what I said. Copyright infringement is when there is another copyrightable object that is copy of first object. AI is not witin copyright area. You can't copyright it, but also you can't be sued for copyright infringement too.
if you tell it to draw you a donkey carting avocados, the picture can be used by anyone from what I understand.
Yes. Same for Public Domain, but PD is another status. PD applies only to copyrightable work.
Yeah, they might not copyright it, but after it becomes the 'one true AI', it will be at the hands of Microsoft, so please do not act friendly towards them.
It will turn on you just like every private company has.
(don't mean specifically you, but everyone generally)
That's my entire point. It's not who, but how long.
Also Microsoft plays both sides here. OpenAI vs copyright is wrong question. There's more: both are status-quo. Both are for keeping corporate ownership of ideas.
There's a massive difference though between corporations milking copyright and authors/musicians/artists wanting their copyright respected. All I see here is a corporation milking copyrighted works by creative individuals.
An LLM is not a person, it is a product. It doesn't matter that it "learns" like a human - at the end of the day, it is a product created by a corporation that used other people's work, with the capacity to disrupt the market that those folks' work competes in.
The reasoning that claims training a generative model is infringing IP would still mean a robot going into a library with a card it has to optically read all the books there to create the same generative model would still be infringing IP.
Humans can judge information make decisions on it and adapt it. AI mostly just looks at what is statistically what is most likely based on training data. If 1 piece of data exists, it will copy, not paraphrase. Example was from I think copilot where it just printed out the code and comments from an old game verbatim. I think Quake2. It isn't intelligence, it is statistical copying.
AI is the new fan boy following since it became official that nfts are all fucking scams. They need a new technological God to push to feel superior to everyone else...
The dream would be that they manage to make their own glorious free & open source version, so that after a brief spike in corporate profit as they fire all their writers and artists, suddenly nobody needs those corps anymore because EVERYONE gets access to the same tools - if everyone has the ability to churn out massive content without hiring anyone, that theoretically favors those who never had the capital to hire people to begin with, far more than those who did the hiring.
Of course, this stance doesn't really have an answer for any of the other problems involved in the tech, not the least of which is that there's bigger issues at play than just "content".