Skip to content

Error on missing values without missing values #652

Answered by mllg
phisanti asked this question in Q&A
Discussion options

You must be logged in to vote

However, I am not sure if I should use 0, or 1 as NA substitution. At least the hyperparameter tunning is running normally.

Well, it depends. Here are some good reads on this topic:

  1. https://github.com/dice-group/gerbil/wiki/Precision,-Recall-and-F1-measure
  2. https://stackoverflow.com/questions/44008563/zero-denominator-in-roc-and-precision-recall

There are (at least) two ways to tackle this problem:

  1. Ensure that each split has a reasonable number of observations of each label. You can do this manually or via stratification (set task role "stratum" before instantiation).
  2. Switch from macro to micro aggregation: msr("classif.f1", average = "micro").

Replies: 6 comments

Comment options

be-marc
May 10, 2021
Maintainer Sponsor

You must be logged in to vote
0 replies
Comment options

be-marc
May 10, 2021
Maintainer Sponsor

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

mllg
May 10, 2021
Maintainer Sponsor

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

mllg
May 10, 2021
Maintainer Sponsor

You must be logged in to vote
0 replies
Answer selected by pat-s
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants
Converted from issue

This discussion was converted from issue #640 on May 29, 2021 16:24.