[R] ConvNets Match Vision Transformers at Scale

psyyduck@alien.top · 2 years ago

[R] ConvNets Match Vision Transformers at Scale

GFrings@alien.top · 2 years ago

Has there been a study that performed a deep dive into the opposite end of the spectrum? There are myriad edge applications out there which cannot rely on training a large model and pruning it down for deployment. I wonder which architectures are most suited to learning at small scales.

currentscurrents@alien.top · 2 years ago

Generally, models with stronger inductive biases (like CNNs) work better at small scales - as long as those biases are correct for the kind of data you’re working with.