higgsfield_ai@alien.topB to Machine Learning@academy.gardenEnglish · 2 years ago

[P] Higgsfield.AI – Anyone can train Llama 70B or Mistral for free

1

[P] Higgsfield.AI – Anyone can train Llama 70B or Mistral for free

higgsfield_ai@alien.topB to Machine Learning@academy.gardenEnglish · 2 years ago

https://higgsfield.ai
We have a massive GPU cluster and developed our own infrastructure to manage the cluster and train massive models.

There’s how it works:

You upload the dataset with preconfigured format into HuggingFaсe [1].
Choose your LLM (e.g. LLaMa 70B, Mistral 7B)
Place your submission into the queue
Wait for it to get trained.
Then you get your trained model there on HuggingFace.

Essentially, why would we want to do it?

We already have an experience with training big LLMs.
We could achieve near-perfect infrastructure performance for training.
Sometimes GPUs have just nothing to train.

Thus we thought it would be cool if we could utilize our GPU cluster 100%. And give back to Open Source community (already built an e2e distributed training framework [2]).

This is in an early stage, so you can expect some bugs.

Any thoughts, opinions, or ideas are quite welcome!

[1]: https://github.com/higgsfield-ai/higgsfield/blob/main/tutori…

[2]: https://github.com/higgsfield-ai/higgsfield

Chat

higgsfield_ai@alien.topOPB
link
fedilink
English
arrow-up
1·
2 years ago
From our experience, to get a very good results you need

High quality dataset. It’s worth to spend more time on data cleaning. It’s way better to have a smaller dataset with high quality points than a huge dataset with garbage.

You need to fully finetune it.

Machine Learning@academy.garden

machinelearning@academy.garden

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !machinelearning@academy.garden

Community Rules:

Be nice. No offensive behavior, insults or attacks: we encourage a diverse community in which members feel safe and have a voice.
Make your post clear and comprehensive: posts that lack insight or effort will be removed. (ex: questions which are easily googled)
Beginner or career related questions go elsewhere. This community is focused in discussion of research and new projects that advance the state-of-the-art.
Limit self-promotion. Comments and posts should be first and foremost about topics of interest to ML observers and practitioners. Limited self-promotion is tolerated, but the sub is not here as merely a source for free advertisement. Such posts will be removed at the discretion of the mods.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
1 user / week
1 user / month
1 user / 6 months
1 local subscriber
1 subscriber
786 Posts
3.03K Comments
Modlog

mods:
communick@academy.garden