I have to admit, PDF parsing being such a hot and profitable topic in computer science was really something I never saw coming.
PDFs? The things you can select text from? And when not, there's decent OCR? And when not, you just ask the person to send you an email or a word doc?
It sounds like LLMs are looking for a new unpolluted source of historical data that they can learn from, and this source exists in the form of old scanned-in paper documents. That's the only reason I can fathom as to why this is such a big thing now.
Every time I try to convert a PDF to epub or something, or OCR one that doesn't actually have selectable text, it turns out shit. I assume the real reason people would want to get LLMs involved is that there is actually a lot of ambiguity in what a correct conversion would be, and there are a lot of PDFs out there.
This is that special blend of Tablet Kid "I don't need to know things I can google them" and Rich Kid "I don't need to do things I can crowdsource them" that makes for that Distinctively VP "I don't know what I'm doing and nobody can tell 👈😎👉"
That was my thought. Young kids fresh out of school are really easy to manipulate into delusions of grandeur, especially when said delusions are offered by the richest person in the world. He's gonna leave them out for the wolves.
Either that or Musk himself is truly so incompetent he thinks these kids are true geniuses. Honestly, with how things are going, that's a fiddy-fiddy chance, because Musk is somehow almost as unbelievably stupid as Trump.
Imagine getting a job like this and now half the nation knows your name...thats terrifying. being an intern may mean you have no idea of the true scope of what they are asking you to do.
Yeah, seems that’s the point. Old enough to competently perform what they’re told, but too young to realize the gravity of the situation and how wrong it is to partake in it.
We know that his dad is an engineering professor at university of Nebraska too. Really calls into question his credentials. I checked the other day and they had already removed his contact info from their website.
For context, this is the guy who figured out how to see what's written on some ancient Greek Scrolls without destroying them. It seems slightly far-fetched that he wouldn't know better.
Yeah, I don’t really get this one. The class clown is the kid who recognizes a function of a tool, correctly at that. Unlike a dipshit lawyer who let it hallucinate bogus case law. Hilarious.