Training LLMs to follow procedure for Math gives an accuracy of 98.5%

Desik_1998@alien.top · 1 year ago

Training LLMs to follow procedure for Math gives an accuracy of 98.5%

undefdev@alien.top · 1 year ago

I don’t understand the motivation behind this.

Fine, you’ve ran an experiment out of curiosity and you got the result, but why would you want to finetune more language models on this?

It’s not like we need models that are almost as good at things computers are excellent at, while using orders of magnitude more resources.

It would be way more useful to train tiny models to predict when a calculator should be used.

wishtrepreneur@alien.top · 1 year ago

It’s not like we need models that are almost as good at things computers are excellent at, while using orders of magnitude more resources.

one of the arguments for learning math (calculus, linear algebra, etc.) in school is because it supposedly helps you with critical thinking, logic reasoning, etc.

If this can be tested in LLMs then it gives weight to that proposal, because let’s face it, 99% of the population don’t use anything more complicated than exponential equations in their every day lives.

undefdev@alien.top · 1 year ago

Calculus, linear algebra and mathematics in general is a good idea. Arithmetics is probably not. To me that’s like training LLMs to count up to high numbers correctly. I’m arguing that instead of reading a book on “the first 10^12 natural numbers” one should read a book on linear algebra.

Desik_1998@alien.top · 1 year ago

I don’t understand the motivation behind this. It’s not like we need models that are almost as good at things computers are excellent at, while using orders of magnitude more resources. It would be way more useful to train tiny models to predict when a calculator should be used

I’m 100% on the fact that we should use calculators directly instead of LLMs for Math. I mean we humans also use calculators. But the thing is, many claim LLMs cannot do Math etc. This project is just to prove them wrong that LLMs when learnt the process especially in case of Math can do it. I mean if we consider the LLM to be the brain, it should ideally be able to master Math similar to how human brain does right

Fine, you’ve ran an experiment out of curiosity and you got the result, but why would you want to finetune more language models on this?

I mentioned this in Github Repo but missed out in the Reddit post. But here is the rationale: When initially tested using incontext learning for Gpt4, the model was able to follow the procedure showed in the 1shot prompting but it was sometimes failing with the overall result due to OpenAI’s addition technique. And given this is a procedural technique, doing a n-shot prompting would lead to even more tokens. Given these limitations, it was evident that finetuning would solve these issue. And also the model was able to quickly learn the procedure and the overall training and validation loss of model went close to 0 within 0.1 epochs

BackwardsPuzzleBox@alien.top · 1 year ago

I mean if we consider the LLM to be the brain, it should ideally be able to master Math similar to how human brain does right

Except it’s not a brain. It’s, at best, a tiny portion of a brain’s processing, hence the move toward multi-modal models.

It’s a bit like trying to do poetry with your visual cortex, or math using your autonomic system. I mean, more strength to you, but while you can use a hammer on a screw, it doesn’t mean you should.

Desik_1998@alien.top · 1 year ago

Yes agree Multimodal is the go to way but the current LLMs which are text based are also building a world view

Training LLMs to follow procedure for Math gives an accuracy of 98.5%

Training LLMs to follow procedure for Math gives an accuracy of 98.5%

What this project does?

Results

Future Improvements