Outperforming larger language models with less training data and smaller models
Outperforming larger language models with less training data and smaller models

blog.research.google
Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes
