caret model$results outputs SD not se, right? #19

ttimbers · 2020-01-23T05:19:35Z

Here you take RMSESD from the results attribute of a train (model) object and then refer to this as se and standard error in the text. However, SD is commonly refers to standard deviation. I think something has to be done to calculate standard error from standard deviation. Or is the output of caret misleading?

The text was updated successfully, but these errors were encountered:

daviddalpiaz · 2020-01-24T00:02:31Z

That's likely just sloppiness on my part. I believe technically it is the sample standard deviation of the RMSE from each fold.

Although, I suppose on some level, it could be viewed as an estimate of the standard deviation of the RMSE when the model is applied to new data? So in that sense, it's a standard error of the generalization error? I think the caret authors agree with this interpretation. They flip flop using SD and SE in the documentation for the oneSE selection function here: https://www.rdocumentation.org/packages/caret/versions/6.0-84/topics/oneSE

I actually have a note to myself from a previous semester to figure out a better way to explain this interchange to my students. I like using caret which outputs things with the SD label, but I also like teaching the one-standard-error rule that is mentioned in ISLR and implemented in caret. This is a pretty timely issue as I start thinking about this for the new semester.

Somewhat unrelated and out of curiosity, how did you come across R4SL? It's a very unfinished project of mine, that I don't really tell anyone about except for my UIUC students. (So I'm always curious how it gets discovered.) I'm also in the process of starting to decommission it and replace it with this: https://github.com/daviddalpiaz/bsl (Mostly so I can sort of get a fresh start on it.)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

caret model$results outputs SD not se, right? #19

caret model$results outputs SD not se, right? #19

ttimbers commented Jan 23, 2020

daviddalpiaz commented Jan 24, 2020

caret model$results outputs SD not se, right? #19

caret model$results outputs SD not se, right? #19

Comments

ttimbers commented Jan 23, 2020

daviddalpiaz commented Jan 24, 2020