Matching with word boundaries #3963

maxaykin · 2024-08-09T14:18:28Z

Checklist

I have read through the manual page (man fzf)
I have searched through the existing issues
For bug reports, I have checked if the bug is reproducible in the latest version of fzf

Output of `fzf --version`

0.54.3 (c423c49)

OS

Linux
macOS
Windows
Etc.

Shell

bash
zsh
fish

Problem / Steps to reproduce

Hello,

recently I started using fzf (mainly with vim which is my primary development editor) and it is really a game-changer for me. Unfortunately, there is a major weakness which I face quite often in my daily work: inability to make fzf query more strict by specifying that its part(s) should be matched against whole word(s), i.e. with word boundary check.

The sequence of actions is usually as follows:

in vim: feeding fzf in fuzzy mode with results from 'ag' to find where a term is used in source code
finding hundreds or even thousands of lines
trying to reduce the number of results by prefixing the pattern with '
there are still too many results (e.g. if I look for a variable "Window" there may be many results like "bottom_window" or "InitWindow()")
switching to other methods like running 'ag -w' in shell

It would be very helpful to have a way to toggle word boundaries check for a part of query, for example by:

prefixing with "
surrounding with '
introducing regular expressions of some kind

Also it would be very nice to keep fuzzy mode still possible for other parts of the same query.

I know there are a number of similar issues requesting features like toggling "case + non-fuzzy" mode or something. It seems the closest one is #2394. Sadly, it looks like such requests are often rejected, so here are some argument in favor.

In my opinion, now fzf is not just a fuzzy finder, its primary goal is to allow a quick selection of one or few items among many others. It provides different modes and ways for doing that. It is a selector as such, it can switch quickly between fuzzy and non-fuzzy mode, it can stick the search pattern to beginning or end of line and so on. Thus, it is very disappointing if none of those modes and switches help and fzf still fails to limit the number of items in the list enough.

Switching to a different source of data (e.g. running ag with '-w') is too long and inconvenient and it may require several iterations of refining the query pipeline.

As a programmer I have had a quick look at the sources of fzf (though, I haven't used Golang before) and it seems the implementation should not be difficult. Please, consider it.

The text was updated successfully, but these errors were encountered:

junegunn · 2024-08-10T00:36:45Z

switching to other methods like running 'ag -w' in shell

Switching to a different source of data (e.g. running ag with '-w') is too long and inconvenient and it may require several iterations of refining the query pipeline.

I think you're looking for change:reload approach explained in https://junegunn.github.io/fzf/tips/ripgrep-integration/

fzf.vim provides this functionality as :RG (all caps) command.

maxaykin · 2024-08-12T09:13:22Z

Thank you for the quick response!

I have already been using reload with a binding for viewing git commits via fzf (for switching between all commits and current branch commits) but I missed the idea that the whole query can be passed to external command and that the reload can be initiated on every change of the query string. This way it is even possible to use regexp in the query. Cool! Though, isn't it to expensive to re-run rg/ag on a large set of files after every single keypress (when it changes the query string)?

Anyway, it still looks to me more natural and convenient (considering the existing switches of the extended search mode) and also quicker (no reload of data) and more unified (taking into account different commands in vim where fzf is used) to add just another switch for enabling word boundary for parts of the search query. I have already done that (a draft version) for myself in a local repository of fzf and so far it looks to be working fine. Probably I will stick to this approach.

junegunn · 2024-08-12T10:15:01Z

isn't it to expensive

It depends. It can be more expensive to load everything in memory and run fuzzy matching algorithm which is likely less efficient than the search algorithm of ripgrep in this particular scenario. Anyway, it works really well, and the performance has never been an issue for me. You should try it out yourself.

maxaykin · 2024-08-12T10:48:43Z

I have just had a first try of it. Yes, it looks to be working well (though, I executed 'ag' instead of 'rg' with fzf#vim#grep2()). And I agree that the performance depends on scenario: the reload seems to be better with separate independent queries while loading everything at start and then just filtering with fzf may be preferable when you do a research on the same source data (code base) and when on each iteration you inspect results in the preview window and change the query to find something else without returning to vim/shell (this is sometimes the case for me).

However, the approach with reload proposed by you does not allow combining part of queries bound to words with parts that should be matched with fuzzy algorithm. And also I doubt that it is usable in cases when the source of data is not a grep-like command. Anyway, thank you for the very good tool and your help!

Just in case if someone (as me) still wants feature "Matching with word boundaries", here is my draft version in a fork repository: maxaykin@38d37d7

junegunn · 2024-08-12T13:14:13Z

FWIW,

There is an example in ADVANCED.md where you can dynamically switch between fzf mode and ripgrep mode.
- https://github.com/junegunn/fzf/blob/master/ADVANCED.md#switching-between-ripgrep-mode-and-fzf-mode-using-a-single-key-binding
You can match literal space by escaping it with a backslash, so I think '\ foo\ can help in some cases.

Close #3963

junegunn · 2024-08-13T02:34:32Z

Just in case if someone (as me) still wants feature "Matching with word boundaries", here is my draft version in a fork repository: maxaykin@38d37d7

No need to duplicate the majority of the code. We can just look at the bonus points to determine if one end is at a word boundary. Also, it better handles non-ASCII characters. See #3967

Close #3963

maxaykin · 2024-08-13T10:23:40Z

No need to duplicate the majority of the code. We can just look at the bonus points to determine if one end is at a word boundary. Also, it better handles non-ASCII characters.

Yes, I suspected that but had not time to dig into the logic (besides, that was the first time I changed Golang source code).
Thank you for your efforts!

Close #3963

junegunn added the feature label Aug 13, 2024

junegunn added a commit that referenced this issue Aug 13, 2024

Implement exact-boundary match type

fd2f463

Close #3963

junegunn mentioned this issue Aug 13, 2024

Implement exact-boundary match type #3967

Merged

junegunn added a commit that referenced this issue Aug 13, 2024

Implement exact-boundary match type

b1b6da2

Close #3963

junegunn added a commit that referenced this issue Aug 16, 2024

Implement exact-boundary match type

9ce81f3

Close #3963

junegunn added a commit that referenced this issue Aug 23, 2024

Implement exact-boundary match type

7852862

Close #3963

junegunn added a commit that referenced this issue Aug 25, 2024

Implement exact-boundary match type

2ccea63

Close #3963

junegunn closed this as completed in #3967 Aug 29, 2024

junegunn closed this as completed in 6a67712 Aug 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Matching with word boundaries #3963

Matching with word boundaries #3963

maxaykin commented Aug 9, 2024 •

edited

Loading

junegunn commented Aug 10, 2024 •

edited

Loading

maxaykin commented Aug 12, 2024

junegunn commented Aug 12, 2024

maxaykin commented Aug 12, 2024

junegunn commented Aug 12, 2024

junegunn commented Aug 13, 2024

maxaykin commented Aug 13, 2024

Matching with word boundaries #3963

Matching with word boundaries #3963

Comments

maxaykin commented Aug 9, 2024 • edited Loading

Checklist

Output of fzf --version

OS

Shell

Problem / Steps to reproduce

junegunn commented Aug 10, 2024 • edited Loading

maxaykin commented Aug 12, 2024

junegunn commented Aug 12, 2024

maxaykin commented Aug 12, 2024

junegunn commented Aug 12, 2024

junegunn commented Aug 13, 2024

maxaykin commented Aug 13, 2024

maxaykin commented Aug 9, 2024 •

edited

Loading

Output of `fzf --version`

junegunn commented Aug 10, 2024 •

edited

Loading