Hello,
I have a question that may seem trivial but which confused me because I found different approaches to doing it in different papers.
So let’s take the recall as an example : to calculate it on the test set I found two different methods in the codes available for published papers. Some calculate the recall for every image then calculate the mean on all images, other simply use the formula as if each pixel was an element in it’s own. What is the correct way to doing so ?
Thanks in advance !
As long as the specific method they use is documented (and, ideally, their ground truth and results) I would say it doesn’t really matter. There aren’t really any single correct methods for scoring most ML models, AFAIK