Skip Navigation

High-Speed Large Language Model Serving on PCs with Consumer-Grade GPUs

github.com

GitHub - SJTU-IPADS/PowerInfer: High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Hacker News @lemmy.smeargle.fans

High-Speed Large Language Model Serving on PCs with Consumer-Grade GPUs

LocalLLaMA @sh.itjust.works

GitHub - SJTU-IPADS/PowerInfer: High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

0 comments

No comments