Obsidian: Worlds first 3B multi-modal opensource LLM.

dogesator@alien.top · 1 year ago

Obsidian: Worlds first 3B multi-modal opensource LLM.

InTheTransition@alien.top · 1 year ago

What was the task? Just curious about what I can use mini models for

toothpastespiders@alien.top · 1 year ago

Creating alpaca formatted json data from big blocks of text that often have a lot of garbage in it. The untrained orca 3b model wasn’t able to stick to the format if I provided it as an example in the instructions. But it did great with it after training on a small dataset of about 100 examples or so.

It’s still a bit early to call it a total success since I’ve only ran it through a handful of tests on similar blocks of text. But just the fact that it’s grabbing facts from the text and correctly formulating prompts around it is really impressive to me. 13b trained on the same data set is, unsurprisingly, still quite a bit better. But 3b’s still doing far far better than I would have thought possible. It’d be really cool to get a little scraping pipe going with next to no resource use.