StreamingLLM: Efficient streaming technique enable infinite sequence lengths
StreamingLLM: Efficient streaming technique enable infinite sequence lengths
arxiv.org /abs/2309.17453
There is a discussion on Hacker News, but feel free to comment here as well.
3
crossposts
0
comments