[D] Why are ML model outputs not tested regarding statistical significance?

Tigmib@alien.top · 1 year ago

[D] Why are ML model outputs not tested regarding statistical significance?

isparavanje@alien.top · 1 year ago

You’re right. This is likely one of the reasons why ML has a reproducibility crisis, together with other effects like data leakage. (see: https://reproducible.cs.princeton.edu/)

Sometimes, indeed, results are so different that things are obviously statistically significant, even by eye, and that is uncommon in natural sciences. Even then, however, it should be stated clearly that the researchers believe this to be the case, and some evidence should be given.