Skip Navigation
A Nerdy Dystopia @sh.itjust.works Gadg8eer @sh.itjust.works

AI Companies Running Out of Training Data After Burning Through Entire Internet

futurism.com AI Companies Running Out of Training Data After Burning Through Entire Internet

AI companies are swiftly running into a massive problem: there isn't enough data on the internet to train the next generation of models.

AI Companies Running Out of Training Data After Burning Through Entire Internet
2
2 comments
  • ...... the whole, entire internet?!?!

    • That, or at least the public, http part of it. I doubt they would be willing to risk exposing it to the Tor network or that Gemini would be sufficiently large to matter.

      ...maybe the fediverse is changing that. Maybe. I don't know how big we are compared to reddit, but I have to question if we would ever actually matter in LLM training.