I’m looking for suggestions for a transformer model that I can fine-tune for a text classification task. Due to hardware constraints the model has to be fairly small. Something in the order of a 50 MB weight file.
I’m looking for suggestions for a transformer model that I can fine-tune for a text classification task. Due to hardware constraints the model has to be fairly small. Something in the order of a 50 MB weight file.
While not a transformer, what about “Gaussian naive Bayes”? It’s not the best classifier around but for some tasks - it’s good enough. I used it to build a small search term classifier model which basically classifies e-commerce search terms against a category or tag.