support for unified diff or git diff format

Question

support for unified diff or git diff format

unicomp21 opened this issue a year ago · 13 comments

unicomp21 commented a year ago

otherwise we burn large token counts when doing large refactors?

unicomp21 commented a year ago

yes

👍1

Answer 1 · 2023-07-30T21:26:25.000Z

Hi @unicomp21, if I understand correctly you want an alternative to the "WriteFile" operation that is not writing the whole file every time, but just applies the edits in some diff format?

Answer 2 · 2023-07-30T21:40:49.000Z

I'm kinda wondering if "language server protocol" might be useful here as well? (ie deno lsp, typescript lsp, might also help drive down token counts during refactor operations)

Answer 3 · 2023-07-30T22:21:17.000Z

I understand, maybe there is something there. We have been actually running language servers in these envs in the past, but for other reasons.

Answer 4 · 2023-07-31T12:44:33.000Z

Looking at unified diff in openai evals, looks like we need a different/simpler approach for handling code deltas? I wonder if gpt-4 could supply simple search/replace string? along w/ target filename?

Answer 5 · 2023-07-31T13:06:35.000Z

You can probably start experimenting with it right now - tell it to use RunProcess to modify files by executing commands that apply the diffs. Then we can see if it can handle it somehow.

Answer 6 · 2023-07-31T13:28:22.000Z

Will do, I just applied for developer access to chat plugins, I'm on the waitlist.

I've been experimenting w/ this problem of eliminating cut/paste for a while. Here's an old eval of mine.
openai/evals#771

Starting to think sed w/ search/replace might be the simplest/reliable way for handling deltas.

using the new "custom instructions", this seems to work nicely, gpt-4 appears to be comfortable base64 encoding the deltas.

{
"deltas": [
{
"type": "code_diff",
"search_string": "cHJpbnQoIkhlbGxvLCBXb3JsZCEiKQ==",
"replace_string": "cHJpbnQoIkhlbGxvLCBXb3JsZCEiKQoKcHJpbnQoIkdvb2RieWUhIik=",
"file_path": "main.py"
}
]
}

Answer 7 · 2023-07-31T13:28:49.000Z

Lol, I look forward to a setup w/ zero cut/paste. I'm lazy !!!

Answer 8 · 2023-08-01T12:33:50.000Z

@ValentaTomas how long of a wait does it take on the plugins developer waitlist?

Answer 9 · 2023-08-01T12:47:35.000Z

I don't really know.

We actually don't have and developer access to plugins and developed this with the help of the community by remotely debugging it.

Answer 10 · 2023-08-01T12:50:53.000Z

Not sure I understand, how would I do that?

…

On Tue, Aug 1, 2023 at 7:47 AM Tomas Valenta ***@***.***> wrote: I don't really know. We actually don't have and developer access to plugins and developed this with the help of the community by remotely debugging it. — Reply to this email directly, view it on GitHub <#1 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAEFL7LOWT6GPIRHIDWR2CTXTD3HFANCNFSM6AAAAAA25NXHUE> . You are receiving this because you were mentioned.Message ID: ***@***.***>

Answer 11 · 2023-08-01T12:52:25.000Z

We just talked with some folks and they gave us remote control access to their computer and we were testing the extension that way.

Answer 12 · 2023-08-05T22:36:59.000Z

Looking at unified diff in openai evals, looks like we need a different/simpler approach for handling code deltas? I wonder if gpt-4 could supply simple search/replace string? along w/ target filename?

This seems to be the way cursor.so does it