Which model is best for binary text classification?

Shoddy_Vegetable_115@alien.top · 2 years ago

Which model is best for binary text classification?

phree_radical@alien.top · 2 years ago

Maybe it’s overkill, idk, but if you want higher accuracy, it’s an option

You can just list examples from your dataset and let the LLM complete the last one

Example:

# Classify text

(a) advertisement
(b) poetry
(c) information

Ignore real-time Al and customers will do the same to you. Our vector database is AI-ready and proven at scale.
Class: (a)

I find no peace, and all my war is done. I fear and hope. I burn and freeze like ice. I fly above the wind, yet can I not arise
Class: (b)

YOUR BEST COMES OUT OF THE BLUE. EXPLORE BOISE STATE
Class: (a)

Two-month ramp closure: northbound OR 99W onto OR 217 north Starts May 31 of Transportation Oregon Department OR 217 AUXILIARY LANES
Class: (c)

Staying healthy. Staying active. We have it all right here. IN YOUR PRIME LEARN MORE LIVING YOUR BEST LIFE
Class: (a)

Go further, FASTER. Take the world's premier English- proficiency test in less than 2 hours!
Class: (a)

A rhinoceros beetle is a living thing. Rhinoceros beetles grow and respond to their environment. They need food and water.
Class: (c)

Our vice runs beyond all that old men saw, And far authentically above our laws, And scorning virtues safe and golden mean, Sits uncontrolled upon the high extreme.
Class: (b)

{your text here}
Class: ({generate one token}

I don’t know about the task you have in mind specifically, but you can do just about anything with a 13B llama model. Picking a fine-tune doesn’t matter if you use examples instead of instructions. 7B Mistral seems to do fine with this example (even GPT2 can do some classification), but in-context learning is remarkably better at 13B, picking up a lot more nuance

Shoddy_Vegetable_115@alien.top · 2 years ago

My classification task is to classify a given essay into AI generated and human generated. And I need the answer to be between 0 and 1(both included) with 1 being AI generated and 0 being human generated.

Few-shot examples is a good idea for most classification tasks but I don’t think Generative LLMs can understand the more intricate semantic patterns to differentiate between the AI and human generated with just a few examples but I’ll try it once and let you know!

Btw do you think fine-tuning would be better?

sshh12@alien.top · 2 years ago

+1, when in doubt, LLM it out.

You could also ask for explanations so when it gets it wrong, you can work on modifying your prompts/examples to get better performance.

Potentially you wouldn’t want to do this if:

Your classification problem is very unusual/cannot be explained by a prompt
You want to be able to run this extremely fast or on a ton of data
You want to learn non-LLM deep learning/NLP (in which case I would’ve suggested basically some form of finetuning BERT)

tuxracer@alien.top · 2 years ago

https://www.refuel.ai/blog-posts/announcing-refuel-llm