We were excited to get our hands on it, but quickly realized there was still quite a bit of set-up required to actually spin up a VM and have the model do things. So we wanted to put together an easy way to deploy these OpenAI computer use VMs in an SDK format and open source it
Hopefully this tooling is helpful to other folks building AI agents! Here’s a link to the repo (https://github.com/aditya-nadkarni/spongecake) - please try it out and give us a star. If you have any feedback, add it as a comment to this post! Or if you simply just love spongecake, show support for the delicious treat
Also how do you guys think about multi agent workflows? i.e. having a couple agents take actions in parallel. Wondering if its possible to have two share a vm.
In terms of multi agent workflows - it's something we've been thinking about! We especially think this could be especially helpful when filling out a form to speed things up even more. It's hard for me to think of other use cases though where multiple agents might need to share a vm (as opposed to just spinning up another vm with another agent), but curious to hear your thoughts!
Filling out forms is I think one of the biggest use cases for computer use agents. We're working on some stuff to specifically make that use case faster and more accurate
Regarding monitoring a specific inbox, and auto-responding, you could also set-up an agent to do that, but I would advise having the agent automate more formulaic/simple emails rather than having it use too much of its own judgement when writing the copy for these emails. You can also add in checks for the agent to check in before you before hitting send