Skip Navigation
Hacker News @lemmy.smeargle.fans

RedPajama v2 Open Dataset with 30T Tokens for Training LLMs

together.ai

RedPajama-Data-v2: an Open Dataset with 30 Trillion Tokens for Training Large Language Models — Together AI

0 comments

No comments