jxjq@alien.topBtoLocalLLaMA@poweruser.forum•Could multiple 7b models outperform 70b models?English
1·
1 year agoDoes this use of mixture-of-experts mean that multiple 70b models would perform ?better than multiple 7b models
Does this use of mixture-of-experts mean that multiple 70b models would perform ?better than multiple 7b models
Thank you for sharing, I understand now