minus-squareACreativeNerd@alien.topBtoMachine Learning@academy.garden•[D]Three things I think should get more attention in large language modelslinkfedilinkEnglisharrow-up1·1 year agoCould someone explain how/why the log(exp(x)+1) works? linkfedilink
Could someone explain how/why the log(exp(x)+1) works?