Looking for resources to better understand LLMs
Looking for resources to better understand LLMs
I do not believe that LLMs are intelligent. That being said I have no fundamental understanding of how they work. I hear and often regurgitate things like "language prediction" but I want a more specific grasp of whats going on.
I've read great articles/posts about the environmental impact of LLMs, their dire economic situation, and their dumbing effects on people/companies/products. But the articles I've read that ask questions like "can AI think?" basically just go "well its just language and language isnt the same as thinking so no." I haven't been satisfied with this argument.
I guess I'm looking for something that dives deeper into that type of assertion that "LLMs are just language" with a critical lens. (I am not looking for a comprehensive lesson on technical side LLMs because I am not knowledgeable enough for that, some goldy locks zone would be great). If you guys have any resources you would recommend pls lmk thanks
I highly recommend this:
https://rti.github.io/gptvis
It explains the fundamentals of a transformer network (which all current LLMs are based on) on a super tiny, down to the basics example network, allowing you to understand what is happening within the network step by step, rather than being confronted with theoretical concepts or tonnes of linear algebra.
It's really nice and as hands on, as these things get.