Skip Navigation
TechNews @radiation.party

Retentive Network: A Successor to Transformer for Large Language Models

arxiv.org /abs/2307.08621
0 comments

No comments