How are missing/ambiguous data handled? #180

wzbillings · 2024-12-09T17:33:46Z

In the documentation for dist.hamming(), it says "The default is handle missing data as in pml". However, the pml function documentation doesn't appear to explain the default way that missing data are handled. I took a look at the code in phylo.R but don't understand it enough to answer my own question.

I see that the default argument is exclude = "none", and when I use phangorn::dist.hamming(), and I get different answers than I do from a naive hamming distance method (e.g. stringdist::stringdist(..., method = "hamming"), which implies that the default is to handle ambiguous states and gaps in some way.

If you have time, I would appreciate a brief explanation of the default way that missing data are handled. In particular, is there a difference in the way ambiguous residues and gaps are handled?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How are missing/ambiguous data handled? #180

How are missing/ambiguous data handled? #180

wzbillings commented Dec 9, 2024 •

edited

Loading

How are missing/ambiguous data handled? #180

How are missing/ambiguous data handled? #180

Comments

wzbillings commented Dec 9, 2024 • edited Loading

wzbillings commented Dec 9, 2024 •

edited

Loading