[D] Why are ML model outputs not tested regarding statistical significance?

Tigmib@alien.top · 1 year ago

[D] Why are ML model outputs not tested regarding statistical significance?

bethebunny@alien.top · 1 year ago

While it’s not super common in academia, it’s actually really useful in industry. I use statistical bootstrapping – poisson resampling of the input dataset – to train many runs on financial fraud models and estimate variance of my experiments as a function of sampling bias.

Having a measure of the variance of your results is critical when you’re deciding whether to ship models whose decisions have direct financial impact :P

Lanky_Product4249@alien.top · 1 year ago

Does it actually work? I.e. if you construct a 95% confidence interval with that variance, are your model predictions within the interval 95% of the time?