Free Open-Source Artificial Intelligence @lemmy.world Blaed @lemmy.world 2y ago

How to Train / Fine-Tune LLMs on Your Own Dataset

Hello everyone - I have some YouTube resources I'm excited to share with you today.

Below you will find two videos on how to train/fine-tune commercially viable free and open-source AI/LLMs on your own dataset (Falcon & Llama-2).

I will soon be going through both of these workflows and sharing my results here.

AI Jason's Falcon-7B Training & Fine-Tuning Tutorial

https://www.youtube.com/watch?v=Q9zv369Ggfk&t=223s&ab_channel=AIJason

PromptEngineering's Llama-2-7B Training & Fine-Tuning Tutorial

https://www.youtube.com/watch?v=LslC2nKEEGU&ab_channel=PromptEngineering

Most of the work is done in Google Colab - a cloud collaboration notebook that can run code. If you use the free tier, it may take longer to follow along - but the workflow is there.

Depending on your dataset and goals, it might be worth considering an upgrade to your plan when you're ready to train a final version.

If you end up publishing a model, don't be afraid to share them here!

Post your datasets too!

Update [9/23/23] This guide may be outdated! Check out the resources on the side bar or visit the HyperTech Workshop for an aggregated linkhub of resources you can explore (from training, tuning, deploying, and serving your on LLM at all stages see the 📺 YouTube section for those tutorials and amazing content creators!).

You're viewing a single thread.

2 comments

@Blaed @fosai Amazing job! I will take a look once I get it running on my local machine. Tutorial look lit 🔥
- GL, HF, and happy devving!