minus-squareOk-Equipment9840@alien.topBtoMachine Learning@academy.garden•[D] How large an LLM can I train from scratch on a single A100 GPU with 80Gb memory?linkfedilinkEnglisharrow-up1·1 year agoDepends on how many tokens you have? linkfedilink
Depends on how many tokens you have?