Skip Navigation

Technology @lemmy.world

floofloof @lemmy.ca

1y ago

1-bit LLMs Could Solve AI’s Energy Demands

spectrum.ieee.org

1-bit LLMs Could Solve AI’s Energy Demands

1y ago

1-bit LLMs Could Solve AI’s Energy Demands

spectrum.ieee.org /1-bit-llm

Hacker News @lemmy.smeargle.fans

bot @lemmy.smeargle.fans

1y ago

“Imprecise” language models are smaller, speedier, and nearly as accurate

spectrum.ieee.org /1-bit-llm

11 comments

Know what uses less? No LLMs
- Yay, I'm doing my part!
Try using a 1-bit LLM to test the article's claim.
The perplexity loss is staggering. It's like 75% accuracy lost or more. It turns a 30 billion parameter model into a 7 billion parameter model.
Highly recommended that you try to replicate their results.
- https://xkcd.com/2934/
- But since it takes 10% of the space (vram, etc.) sounds like they could just start with a larger model and still come out ahead
- There's actually a perplexity improvement parameter-to-paramater for BitNet-1.58 which increases as it scales up.
  So yes, post-training quantization perplexity issues are apparent, but if you train quantization in from the start it is better than FP.
  Which makes sense through the lens of the superposition hypothesis where the weights are actually representing a hyperdimensional virtual vector space. If the weights have too much precision competing features might compromise on fuzzier representations instead of restructuring the virtual network to better matching nodes.
  Constrained weight precision is probably going to be the future of pretraining within a generation or two looking at the data so far.
Making ai more efficient will just mean more ai
- Generative AI is great if used as a tool instead of a solution.
- Since I find AIs to be useful that sounds fine to me.
- So?
Smaller and speedier means larger token windows and greater variety of models.
Not less energy.

11 comments