wen_mars@alien.topBtoMachine Learning@academy.garden•[D] Difference between CUDA and Tensor CoresEnglish
1·
1 year agoAnd recently tensor cores have started appearing with 8 bit float/int as well, which gives them a huge advantage in inference throughput. The memory bandwidth limitation can be mitigated by increasing the batch size.
If you multiply two 16-bit numbers the result can overflow the range that can be represented by 16 bits.