Transformers struggle with generalizing tasks beyond pre-training data
Transformers struggle with generalizing tasks beyond pre-training data
arxiv.org /abs/2311.00871
There is a discussion on Hacker News, but feel free to comment here as well.
You're viewing a single thread.
All comments
2
comments
That and Starscream is being very uncooperative
1 0 Reply