-
Notifications
You must be signed in to change notification settings - Fork 121
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hashes/encodings below the heuristic limit are treated as typos #415
Comments
The challenge is being able to identify that has a hash. How do we tell a hash from an identifier? Right now, we support
|
yea that makes a lot of sense. I'm just starting to use the app and loving it so far, I've only had to whitelist two hashes that have ba in them so its not a big deal for me. Maybe some sort of regex or something so I could whitelist |
I think it causes also some problems with jupyter notebooks.
|
Hello, Git commit hashes tend to run in the range |
@tspearconquest for shorter git commit hashes, we'll need to rely on a heuristic like talked about in #484 because shorter commit hashes could just as easily be words. |
How about adding a heuristic "word contains characters preceded by numbers" (where "word" is a whitespace-separated segment, not a case-separated segment)? I don't think I've ever seen an identitifer be named |
sha1hash? |
Right, there's a few exceptions (I also remembered there being |
There is also all the this2that and thing4stuff |
@jplatte I'd probably refine your comment to be "any identifier that exclusively word splits due to numbers and not any other separator (be it case or The next question is the likelihood of a shortened sha having no numbers. I probably didn't bring this up in the other thread talking about heuristics but I suspect to have something always complain than it have it complain in a way people no longer expect. |
In the case of a hex string, that would be |
FYI #695 provides a new workaround for false positives |
The
I think we can safely close this issue. |
@azzamsa that regex is much too generic, it disables spell-checking for all 12-letter identifiers as well. |
original: #982 |
↑ This |
Example of SHA-256 hash "typo"
Using Seems like defaults might be a little loose |
I can add it to
[default.extend-identifiers]
so its not a blocker but figured you'd like another test case.The text was updated successfully, but these errors were encountered: