trave @ trave @lemmy.sdf.org

Posts

1
Comments

3
Joined

5 mo. ago

2d ago

what's the best model these days I could fit in 128gb ram?

oh I didn't realize I could use llamacpp with openwebui. I recall reading something about how ollama was somehow becoming less FOSS so I'm inclined to use llamacpp. Plus I want to be able to more easily use sharded ggufs. You have a guide for setting up llamacpp with openwebui?

I somehow hadn't heard of GLM 4.5 Air, I'll take a look thanks!

2d ago

what's the best model these days I could fit in 128gb ram?

Jump

some coding yeah but also want one that's just good 'general purpose' chat.

Not sure how much context... from what I've heard models kinda break down at super large context anyway? Though I'd love to have as large of a functional context as possible. I guess it's somewhat a tradeoff in ram usage as the context all gets loaded into memory?