Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Calculating EER #49

Open
nidhal1231 opened this issue Jul 18, 2019 · 4 comments
Open

Calculating EER #49

nidhal1231 opened this issue Jul 18, 2019 · 4 comments

Comments

@nidhal1231
Copy link

I can't understand well the process of calculating EER .
I would be greatful if someone could explain this to me .

@cbrochtrup
Copy link

cbrochtrup commented Jul 22, 2019

EER is a performance metric for binary classification systems that output class likelihoods. It corresponds to the amount mislabeled samples given equal (or as close to equal as possible) false negatives and false positives. This corresponds to the point in the DET curve where the false positive and miss detection ( false negative ) are equal.

For a different visualization read this.

@nidhal1231
Copy link
Author

@cbrochtrup Thanks a lot

@chrisspen
Copy link

@cbrochtrup That link basically says the EER is a worthless metric for any real work. Is that what you meant to link to?

"For starters, we can dismiss the equal error rate (EER). In all my research, I have yet to encounter a use case where having an equal probability of false accept or false reject was optimal. EER is an interesting academic construct but it’s hard to think of a use case where it matters."

@cbrochtrup
Copy link

cbrochtrup commented Oct 15, 2019

Haha, I can see why you'd think that. My apologies! I don't think EER is useless. EER is the standard metric to compare system performance in the speaker verification literature.

Why does the writer think EER is useless? Because EER is not important in many industry applications such as medical diagnosis or biometric security. For example, if you're building a system to detect cancer you don't want to have equal errors - which is what the EER would give. For detecting cancer you're okay with false alarms but false negatives are dangerous because missing a cancer diagnosis (false negative) could cost a patient their life. The writer is likely a researcher in a field where EER is not the standard metric.

In summary, EER is used in research to compare system performance but in practice, you would change the detection threshold to reduce the most costly type of system errors. Does that make sense? Do you know what I mean by "threshold"?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants