[Paper] Learning to Generate Better Than Your LLM
[Paper] Learning to Generate Better Than Your LLM
arxiv.org /abs/2306.11816
I was looking through papers that combine LLMs and RL and this was pretty fascinating and the citations are perfect for continuing my search.
0
comments