Any way to prune LLMs? 1 0
I don't know about that, but you could try GGML (llama.cpp). It has quantization up to 2-bits so that might be small enough.
Reply
Next
minipasila @lemmy.fmhy.ml
Posts 0
Comments 1