Complete "computer use" support #216

ErikBjare · 2024-10-22T21:48:08Z

Since Anthropic just announced their computer use stuff (see #50 (comment)), we should just finish ours as we already have the screenshots.

We can take screenshots, we just need to enable acting on them by clicking or making input.

To not burn tons of tokens we should probably put it in a loop where it doesn't stack tons view/interact steps, maybe use a looping subagent or some kind of context-efficient tool-use loop until goal is achieved (we might need some stuff like this generally for automation goals). Should study how they do it.

They run it in a Docker container and stream it in a webapp using VNC. We should make it possible to do it this way with gptme, but I think gptme should be able to control the local system first, and a Docker system second.

Milestones

Can it Tweet?
Can it play Factorio? (prob joke tweet: https://x.com/aphysicist/status/1848802806729228782)
Can it play Doom? (latency if we don't want to suck)
https://manifold.markets/singer/will-ai-automate-guis-by-end-of-202

ErikBjare · 2024-11-01T13:12:29Z

Remaining tasks after merging #225

figure out what is causing the delays
- I seem to have fixed some of the delay by removing a speed limit argument that I accidentally added? Maybe?
- Still delays when starting new terminal windows
- I don't remember seeing these delays on my macOS machine
share the original conversation where gptme wrote most of it as an example
- Add a way to share conversations #32

ErikBjare added the enhancement New feature or request label Oct 22, 2024

ErikBjare mentioned this issue Oct 24, 2024

feat: implement anthropic-style computer tool #225

Merged

8 tasks

ErikBjare closed this as completed in #225 Nov 1, 2024

ErikBjare reopened this Nov 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Complete "computer use" support #216

Complete "computer use" support #216

ErikBjare commented Oct 22, 2024 •

edited

Loading

ErikBjare commented Nov 1, 2024

Complete "computer use" support #216

Complete "computer use" support #216

Comments

ErikBjare commented Oct 22, 2024 • edited Loading

Milestones

ErikBjare commented Nov 1, 2024

ErikBjare commented Oct 22, 2024 •

edited

Loading