feat(chat): Adding buffer watcher #610

bassamsdata · 2025-01-07T17:14:15Z

Description

This PR introduces a buffer watching mechanism to improve how CodeCompanion handles code context in conversations. Instead of resending entire pinned buffers with each message (which is token-intensive), the watcher tracks and reports only the changes made to referenced buffers.

This is a work in progress PR that aims to optimize token usage and provide better context management in the chat stack.

If the idea of this PR is good and can be merged then I can continue working on it.

TODO

Add pre-PR changes to include line numbers in buffer slash commands for LLM context
Refactor the buffer watcher mechanism for more robust change tracking
Add test coverage
Update documentation to reflect new features
~~Modify the prompts to prepare the llm for watcher. (smaller llm needs example of what to receive)~~. it needs more thinking and suitable separate PR.

Known Issues

Change detection becomes inconsistent (just sometimes) after multiple message exchanges (3rd/4th message). Need to verify if the change tracking logic in chat/init.lua is properly placed and triggered.
~~issue that the reference gets removed from the message after like the third response.~~
~~watcher now is better, but still ~~register~~ send the last changes only every time, leaving some changes unreported.~~
~~When adding a buffer watcher for the first time or stopping the watcher for a buffer, the buffer is printed in the chat as a duplicate. I’m unsure why this happens. Need help here (see video).~~
~~Occasionally, after a long conversation, I receive a warning message stating, “No messages to submit.” I’m not certain why this occurs or if it’s even related to this PR.~~ IT LOOKS like it doesn't happen any more.

Related Issue(s)

#575
Currently, pinned buffers, while useful for maintaining context, are inefficient in terms of API token usage as they resend the same code repeatedly. This PR addresses this by implementing a change-tracking system that only sends buffer modifications to the LLM, significantly reducing token consumption while maintaining context awareness.

Technical Details

Implements a buffer watcher system in watcher.lua
Integrates with existing reference management
Tracks and reports buffer changes between chat interactions
Optimizes token usage by sending only modified content

Screenshots

Screen.Recording.2025-01-07.at.12.00.52.PM.mov

Checklist

I've read the CONTRIBUTING guidelines and have adhered to them in this PR
I've updated the README
I've ran the make docs command

olimorris · 2025-01-07T17:38:25Z

I am very excited about this. No pressure at all though. Appreciate this will require a tonne of work.

I would very much like this to replace pins in the future...watcher feels more intuitive as well.

One thing to think about as you're testing this is a prompt playbook that we can use to test against various models. I expect this will be fine with an gpt-4 or claude-sonnet but could be more challenging for self-hosted LLMs. I'm thinking out loud here but maybe for some models, you can specify to just continue to send the whole buffer with every response (after all, token consumption doesn't matter too much for those users).

Awesome job so far and I'll enjoy checking it out later in the week.

olimorris · 2025-01-07T17:39:32Z

[x] Add pre-PR changes to include line numbers in buffer slash commands for LLM context

I added this at the weekend btw.

bassamsdata · 2025-01-07T20:25:38Z

I am very excited about this. No pressure at all though. Appreciate this will require a tonne of work.

Thank you for the nice words! indeed, especially for the tracking mechanism.

I expect this will be fine with a GPT-4 or Claude-Sonnet.

Yes, with Sonnet 3.5 I've had no issues at all. It's smart enough to keep track of line numbers.

I'm thinking out loud here but maybe for some models, you can specify to just continue to send the whole buffer with every response (after all, token consumption doesn't matter too much for those users).

I've tested with some 32B models on huggingface. It worked when I tell the LLM to track the line numbers with an exact example. I think some prompts after this will be changed a bit to include some examples when we finalize the tracking mechanism so the LLM can know exactly what to expect.

I'm building a test to send the same prompts automatically to multiple LLMs since I usually work with Hugging Face models, and they have all kinds of LLMs that can work locally.

If we succeed without sending the whole buffer again and again, we can benefit from not eating up the context window of the LLM, especially when smaller local LLMs have smaller context windows.

bassamsdata · 2025-01-07T20:27:44Z

I added this at the weekend btw.

Ah, I hadn’t noticed it since I started working from the local branch the other two days. Thanks a bunch! One todo item is gone.

olimorris · 2025-01-09T09:26:59Z

lua/codecompanion/strategies/chat/watcher.lua

+        timestamp = vim.loop.now(),
+        reported = false,
+      })
+      log:debug("Recording change in buffer %d: lines %d-%d: %s", buf, start_row + 1, end_row + 1, vim.inspect(lines))


Might be worth changing this line to log:trace as it will be writing to the log all the time. I ask users to turn logging to debug in their config when reporting an issue

Noted, I’ve added extensive debug logging to monitor the process and changes. However, in the final version, it will be reduced to log:trace.

olimorris · 2025-01-09T09:27:52Z

Btw, great move to use vim.api.nvim_buf_get_changedtick(bufnr). I always forget that exists!

olimorris · 2025-01-09T17:52:55Z

Regarding the latter issue, is it a storage issue? I.e Neovim is only sharing the last change versus all of the changes since a point in time?

bassamsdata · 2025-01-09T18:16:31Z

I don't think that's the case, The consolidation logic is too aggressive on purpose.
it's only keeping the last change for a line range. so basically sometimes multiple changes happen in one range, if line 2 changed and then line 3 changed , then the consolidate will send line 3 only.

I’m considering a different approach to keep watching lines only and then capture all modified lines that changed when FocusLost event on the buffer, or maybe a less aggressive consolidation mechanism.

bassamsdata · 2025-01-10T04:57:09Z

Current State of the PR:
The watcher seems to be sending the correct data to the LLM. However, please ensure the watcher starts from the second message, not the first, where the buffer is added. (I haven’t tested this yet, so I’m unsure if it works correctly when the watcher starts immediately after adding the buffer.)

This PR is still incomplete, but the basic structure seems solid. Next steps include refining and modifying the prompts, cleaning up the code, and conducting thorough testing (both with written tests and LLM testing).

olimorris · 2025-01-10T08:41:59Z

Awesome. Plan on testing this properly at the weekend.

olimorris · 2025-01-10T21:48:48Z

Btw, wondered whether oil.nvim is a useful reference for keeping track of buffer changes?

bassamsdata · 2025-01-11T02:22:32Z

When I first saw your comment, I was commuting and started wondering why oil.nvim would watch buffers (I don’t use it myself). Then it clicked, oil is a buffer, so of course it’s watching for changes!

Honestly, I’m glad I built the buffer watcher without knowing oil.nvim does something similar. It’s satisfying to create something independently without directly borrowing ideas from other plugins.

At first, I struggled with nvim_buf_attach() because handling multiline deletions was tricky, only a single line would register as deleted in Neovim. After some digging on Stack Exchange and Stack Overflow, I learned more about these quirks. My goal was to have the watcher gather detailed data and consolidate it for the LLM.

Last night, I decided to simplify things and ditched nvim_buf_attach(). Instead, I implemented a straightforward solution based on text differences. It turned out great, simple, robust, and capable of sending clear data to the LLM with distinct types like deletion, insertion, and modification. However, I didn’t have time to fully test it then but I was testing it with the llm.

Today, I wrote a lot of tests, including some edge cases I thought of, 13 tests in total spanning about 370 lines of code. Despite that, the core mechanism of the watcher remains simple, at around 130 lines. After all those tests, it’s proven to handle all those cases. I’m sure it’s not perfect, and users will likely uncover new edge cases, but that’s okay. I’m planning to add more edge cases to the tests as well.

bassamsdata · 2025-01-11T02:53:39Z

I took a quick look at the oil.nvim implementation, the codebase is large and complex. From what I observed, it uses a different approach and incorporates several advanced features, such as:

uv.new_fs_event() to monitor changes in the oil directory path.
Handling events like file additions, deletions, or renames with multiple buffer checks.
Using numerous Neovim events like TextChanged, CursorMoved, ModeChanged, InsertEnter, BufEnter, and BufUnload to track various changes.
Frequent use of the buffer-local variable vim.bo.modified, which I found particularly interesting.
Extensive use of timers. I experimented with timers in a more complex earlier version of the watcher, it worked well but didn’t seem strictly necessary.
it doesn't seem to use changedtick buffer variable :(

Overall, the oil.nvim implementation is significantly more complex, working closely with system events and serving a different purpose. Personally, I tend to favor simpler, easier-to-maintain solutions unless there’s a substantial benefit to a more intricate approach.

That said, if you think the oil.nvim implementation would better suit this purpose, I can revisit the oil.nvim code to fully understand the mechanism. It might take some time to go through, as it’s a bit complex.

For context, I’m a big fan of Steven’s work and regularly use some of his other plugins like quicker, overseer, and conform. If we encounter edge cases that this watcher doesn’t handle, we can consider integrating some of those ideas (e.g., involving vim.bo.modified or certain Neovim events).
what do you think?

bassamsdata · 2025-01-11T05:39:50Z

GitMurf · 2025-01-12T08:23:53Z

I have not reviewed any of the code. On a trip with the family. But I reviewed the conversation above. Thought maybe my 2cents in the form of a potentially "dumb" question may help as it is the first thing that came to mind when trying to think of a "simple" solution.

What if when a buffer is added to a chat it stores the last modified timestamp of that file? I know this may be an issue with handling temp buffers that are not tied to a timestamp but let's forget about that case for now.

Then you simply check each time you send a message to LLM to see if the last modified timestamp of the file has changed. If so, grab the new content, compare / diff the changes and send those changes.

Thoughts?

olimorris · 2025-01-12T11:46:27Z

lua/codecompanion/strategies/chat/references.lua


 local allowed_pins = {
  "<buf>",
  "<file>",
 }

+local allowed_watches = {


Think we should rename this to allowed_watchers

olimorris · 2025-01-12T11:47:37Z

lua/codecompanion/strategies/chat/references.lua

@@ -8,12 +8,17 @@ local config = require("codecompanion.config")
 local api = vim.api
 local user_role = config.strategies.chat.roles.user
 local pinned_icon = config.display.chat.icons.pinned_buffer
+local watched_icon = config.display.chat.icons.watched_buffer


Let's add these icons into a table then we can iterate over them to remove them on line 253

lua/codecompanion/strategies/chat/watcher.lua

olimorris · 2025-01-12T11:53:30Z

Just played around with gpt-4o-mini and it works perfectly. My playbook conversation:

user: #buffer What does this file do?
llm: This Lua file defines three functions: ...

user: [watches buffer] I've just changed the file. Can you tell what I've done?
llm: You have deleted the `hello_oli` function

user: @editor can you add it back?
llm: [Runs editor tool correctly]

user: Now what have I done?
llm: You have added a new function

user: Can you share with me what you think the file looks like now?
llm: [Shares correct output]

user: @editor can we change hello_oliver to hello_oli?
llm: [Does so correctly]

bassamsdata · 2025-01-13T03:45:54Z

Then you simply check each time you send a message to LLM to see if the last modified timestamp of the file has changed. If so, grab the new content, compare / diff the changes and send those changes.

@GitMurf, well basically this final solution is doing something similar in the idea, so It maintains
a state of the buffer, recording the current content and last sent content so basically comparing 2 states. also oil.nvim is doing something similar, yet very sophisticated but it check the changes, via b:modified variable, and save the initial buffer into cache mechanism, then when saving, it locks the buffer to prevent any additional changes.

bassamsdata · 2025-01-13T03:58:48Z

@olimorris, if there are no further comments, I believe this is ready to merge. Regarding the documentation, I was working with an older version of the README, so I wasn’t fully aware of the updated structure and the docs site, congratulations on that, by the way; it’s fantastic!

As for potential future improvements that could be addressed in separate PRs:

I've already wrote docs for this feature, do you want it on the website as PR in usage/chat buffer section?
We should provide an example to explain how changes will be sent to the LLM when the watcher is activated.
I’m planning to implement a change limit. If the user modifies more than 70% to 80% of the buffer, it’s more efficient to send the updated buffer content instead of changes. This PR is nearly ready, just needs some final cleanup.

feat(chat): Adding buffer watcher

0f8507f

olimorris reviewed Jan 9, 2025

View reviewed changes

Bassam Data added 3 commits January 9, 2025 12:14

refactor(watcher): refactor how to consolidate changes

1cf8d60

refactor(watcher): more reliable chnages report.

cb907ee

fix(watcher): fix cleaning the watcher correcly

f40d011

fix(watcher): fix unwatch

ab02666

refactor(watcher): change the whole mechanism

1d328f1

Bassam Data added 2 commits January 10, 2025 00:09

tests: add watched to references test

ae6313f

fix(references): ensure bother options have defaults

5d092ba

tests: Adding watcher feature tests

539d312

Bassam Data added 3 commits January 10, 2025 22:40

fix(watcher): fix the duplicate printing of reference

ec3d7a8

refactor(watcher): add more edge cases to be robust

ea76cdb

tests: Added more edge cases tests

f9e465f

Bassam Data added 3 commits January 11, 2025 01:20

fix(watcher): handle closing watched buffer

8d6944f

fix(watcher): handle unloaded buffers

35d967b

fix(watcher): don't send too much to the llm

926ee78

olimorris reviewed Jan 12, 2025

View reviewed changes

lua/codecompanion/strategies/chat/watcher.lua Outdated Show resolved Hide resolved

Bassam Data and others added 3 commits January 12, 2025 22:27

chore(watcher): clean up the code

c4eaf9d

tests: update watcher tests

1255a1a

Merge branch 'main' into buffer_watcher

5166bb8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(chat): Adding buffer watcher #610

feat(chat): Adding buffer watcher #610

bassamsdata commented Jan 7, 2025 •

edited

Loading

olimorris commented Jan 7, 2025 •

edited

Loading

olimorris commented Jan 7, 2025 •

edited

Loading

bassamsdata commented Jan 7, 2025

bassamsdata commented Jan 7, 2025

olimorris Jan 9, 2025

bassamsdata Jan 9, 2025

olimorris commented Jan 9, 2025

olimorris commented Jan 9, 2025

bassamsdata commented Jan 9, 2025

bassamsdata commented Jan 10, 2025

olimorris commented Jan 10, 2025

olimorris commented Jan 10, 2025

bassamsdata commented Jan 11, 2025

bassamsdata commented Jan 11, 2025

bassamsdata commented Jan 11, 2025 •

edited by olimorris

Loading

GitMurf commented Jan 12, 2025

olimorris Jan 12, 2025

bassamsdata Jan 13, 2025

olimorris Jan 12, 2025

bassamsdata Jan 13, 2025

olimorris commented Jan 12, 2025

bassamsdata commented Jan 13, 2025

bassamsdata commented Jan 13, 2025

feat(chat): Adding buffer watcher #610

Are you sure you want to change the base?

feat(chat): Adding buffer watcher #610

Conversation

bassamsdata commented Jan 7, 2025 • edited Loading

Description

TODO

Known Issues

Related Issue(s)

Technical Details

Screenshots

Checklist

olimorris commented Jan 7, 2025 • edited Loading

olimorris commented Jan 7, 2025 • edited Loading

bassamsdata commented Jan 7, 2025

bassamsdata commented Jan 7, 2025

olimorris Jan 9, 2025

Choose a reason for hiding this comment

bassamsdata Jan 9, 2025

Choose a reason for hiding this comment

olimorris commented Jan 9, 2025

olimorris commented Jan 9, 2025

bassamsdata commented Jan 9, 2025

bassamsdata commented Jan 10, 2025

olimorris commented Jan 10, 2025

olimorris commented Jan 10, 2025

bassamsdata commented Jan 11, 2025

bassamsdata commented Jan 11, 2025

bassamsdata commented Jan 11, 2025 • edited by olimorris Loading

GitMurf commented Jan 12, 2025

olimorris Jan 12, 2025

Choose a reason for hiding this comment

bassamsdata Jan 13, 2025

Choose a reason for hiding this comment

olimorris Jan 12, 2025

Choose a reason for hiding this comment

bassamsdata Jan 13, 2025

Choose a reason for hiding this comment

olimorris commented Jan 12, 2025

bassamsdata commented Jan 13, 2025

bassamsdata commented Jan 13, 2025

bassamsdata commented Jan 7, 2025 •

edited

Loading

olimorris commented Jan 7, 2025 •

edited

Loading

olimorris commented Jan 7, 2025 •

edited

Loading

bassamsdata commented Jan 11, 2025 •

edited by olimorris

Loading