bellman-7b - a Swedish llama2 finetune

neph1010@alien.top · 2 years ago

bellman-7b - a Swedish llama2 finetune

ZookeepergameCool173@alien.top · 2 years ago

Try to fine tune a 13b model instead, which has a way better command of Swedish than the 7B. And in my experience tends to have less issues with becoming repetitive etc.

neph1010@alien.top · 2 years ago

I will. I also like 13b models. They seem like the perfect balance for us gpu starved people. But I’d rather fail some on 7b models first, since it’s quicker to iterate on them.

fetballe@alien.top · 2 years ago

Thanks! Can you also make a 13B version?

Acceptable_Can5509@alien.top · 2 years ago

Can you share the colab so others can look at how it was done?

neph1010@alien.top · 2 years ago

I used the colab template from this post: https://maximelabonne.substack.com/p/fine-tune-your-own-llama-2-model-in-a-colab-notebook-df9823a04a32

https://colab.research.google.com/drive/1PEQyJO1-f6j0S_XJ8DV50NkpzasXkrzd?usp=sharing

Specifically because it could be run on the free tier. But that’s not possible for any llama2 models, just some.