chore(chat): Refactor prompt hierarchy #834

nirinchev · 2024-09-24T23:42:46Z

Description

This is a bit controversial, so putting it as a draft to get some feedback on the direction. It's preparation work for VSCODE-606 - it unifies all prompts under a common base class, which I'm planning to then extend with capability for counting user/participant message lengths as well as a flag for whether sample docs have been included in the prompt (per TD. While there are less disruptive ways to achieve this, this one felt the cleanest in terms of separation of concerns and extensibility.

It should also set us up for VSCODE-614 as abstracting the base functionality in a common place would allow us to count the tokens and drop history messages that push us past the model token limit.

Checklist

New tests and/or benchmarks are included
Documentation is changed or added
I have signed the MongoDB Contributor License Agreement (https://www.mongodb.com/legal/contributor-agreement)

Motivation and Context

Bugfix
New feature
Dependency update
Misc

Open Questions

Dependents

Types of changes

Backport Needed
Patch (non-breaking change which fixes an issue)
Minor (non-breaking change which adds functionality)
Major (fix or feature that would cause existing functionality to change)

nirinchev · 2024-09-24T23:48:05Z

src/participant/constants.ts

+interface DocsRequestMetadata {
+  intent: 'docs';
+  chatId: string;
+  docsChatbotMessageId?: string;
+}


This is a drive-by - I wanted to more strongly confine the metadata types to ensure we don't have docsChatbotMessageId unless the intent is docs.

nirinchev · 2024-09-25T00:00:35Z

src/participant/prompts/namespace.ts

  }

-  static buildMessages({
+  async buildMessages({


This one is the only one I couldn't map super cleanly and had to override the base class implementation. I have some ideas for how to restructure the abstraction to support these more complex history/prompt rewrites, but those would be more complicated than the current implementation and didn't feel it's justified to pursue until we have more than one use case.

We'll revisit this function and also how the other prompt message building functions handle namespace and connect messages/prompts in:
https://jira.mongodb.org/browse/VSCODE-611
I think the other prompts would benefit from a similar prompt change when the last message was selecting a namespace.

Agreed - I was following up on this PR with telemetry stuff and noticed history scraping could be applied to all messages, not just namespace.

Anemy

lgtm, I like the refactor to get the types more aligned. two some suggestions, I'd like if we can avoid the argument mutation one, not blockers.

src/participant/prompts/namespace.ts

src/participant/prompts/promptBase.ts

Anemy · 2024-09-25T17:49:30Z

src/participant/prompts/namespace.ts

  }

-  static buildMessages({
+  async buildMessages({


We'll revisit this function and also how the other prompt message building functions handle namespace and connect messages/prompts in:
https://jira.mongodb.org/browse/VSCODE-611
I think the other prompts would benefit from a similar prompt change when the last message was selecting a namespace.

Anemy · 2024-09-25T18:10:47Z

src/participant/prompts/schema.ts

 The schema is generated from a sample of documents in the user's collection.
-You must follows these rules.
+You must follow these rules.


Anemy · 2024-09-25T19:55:12Z

src/participant/sampleDocuments.ts

@@ -59,11 +58,14 @@ export async function getStringifiedSampleDocuments({
  }

  const stringifiedDocuments = toJSString(additionToPrompt);
-  promptInputTokens = await model.countTokens(prompt + stringifiedDocuments);
+
+  // TODO: model.countTokens will sometimes return undefined - at least in tests. We should investigate why.


This was something Alena ran into, The model returned by vscode.lm.selectChatModels is always undefined in tests. So there's no model to count the tokens.

Anemy

lgtm! Nice update to share the connect message prompt from the rest of the prompts.

nirinchev added 2 commits September 25, 2024 01:31

Refactor prompt hierarchy

1a02ebb

reformat

c11f4d8

nirinchev commented Sep 25, 2024

View reviewed changes

nirinchev requested review from Anemy and alenakhineika September 25, 2024 00:03

Fix build

955c202

nirinchev marked this pull request as ready for review September 25, 2024 17:13

Anemy approved these changes Sep 25, 2024

View reviewed changes

Anemy reviewed Sep 25, 2024

View reviewed changes

Address feedback, remove connect messages from all prompt builders

87a0112

Anemy reviewed Sep 25, 2024

View reviewed changes

Anemy approved these changes Sep 25, 2024

View reviewed changes

nirinchev added 2 commits September 26, 2024 01:44

Add tests and fix historyMessages not removing the pre-connection prompt

9dedc3b

Remove .only

8f7e6c1

nirinchev merged commit 08fac17 into VSCODE-528-mongodb-copilot Sep 26, 2024
3 checks passed

nirinchev deleted the ni/prompt-refactor branch September 26, 2024 00:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(chat): Refactor prompt hierarchy #834

chore(chat): Refactor prompt hierarchy #834

nirinchev commented Sep 24, 2024 •

edited

Loading

nirinchev Sep 24, 2024

nirinchev Sep 25, 2024

Anemy Sep 25, 2024

nirinchev Sep 25, 2024

Anemy left a comment •

edited

Loading

Anemy Sep 25, 2024

Anemy Sep 25, 2024

Anemy Sep 25, 2024

Anemy left a comment

chore(chat): Refactor prompt hierarchy #834

chore(chat): Refactor prompt hierarchy #834

Conversation

nirinchev commented Sep 24, 2024 • edited Loading

Description

Checklist

Motivation and Context

Open Questions

Dependents

Types of changes

nirinchev Sep 24, 2024

Choose a reason for hiding this comment

nirinchev Sep 25, 2024

Choose a reason for hiding this comment

Anemy Sep 25, 2024

Choose a reason for hiding this comment

nirinchev Sep 25, 2024

Choose a reason for hiding this comment

Anemy left a comment • edited Loading

Choose a reason for hiding this comment

Anemy Sep 25, 2024

Choose a reason for hiding this comment

Anemy Sep 25, 2024

Choose a reason for hiding this comment

Anemy Sep 25, 2024

Choose a reason for hiding this comment

Anemy left a comment

Choose a reason for hiding this comment

nirinchev commented Sep 24, 2024 •

edited

Loading

Anemy left a comment •

edited

Loading