If it is truly memorizing the ENTIRE set of training data, then is it not lossless data compression that is much more efficient than any known compression algorithms?
It has to be lossy compression aka it doesn’t remember its ENTIRE set of training data, word for word.
If it is truly memorizing the ENTIRE set of training data, then is it not lossless data compression that is much more efficient than any known compression algorithms?
It has to be lossy compression aka it doesn’t remember its ENTIRE set of training data, word for word.