FilterHN

Matchlock – Secures AI agent workloads with a Linux-based sandbox

148 points

by jingkai_he

1 month ago

| past

| 18 comments

| github.com

| HN

▲

DanMcInerney

1 month ago

[-]

Sandboxing is a great security step for agents. Just like using guardrails is a great security step. I can't help but feel like it's all soft defense though. The real danger comes from the agent being able to read 3rd party data, be prompt injected, and then change or exfiltrate sensitive data. A sandbox does not prevent an email-reading agent from reading a malicious email, being prompt injected, and then sending an email to a malicious email address with the contents of your inbox. It does help in implementing network-layer controls though, like apply a policy that says this linux-based sandbox is only allowed to visit [whitelisted] urls. This kind of architectural whitelisting is the only hard defense we have for agents at the moment. Unfortunately it will also hamper their utility if used to the greatest extent possible.

▲

jingkai_he

1 month ago

[-]

Creator here.

Agreed, sandboxing by itself doesn't solve prompt injection. If the agent can read and send emails, no sandbox can tell a legit send from an exfiltration.

matchlock does have the network-layer controls you mentioned, such as domain whitelisting and secret protection toward designated hosts, so a rogue agent can't just POST your API key to some random endpoints.

The unsafe tool call/HTTP request problem probably needs to be solved at a different layer, possibly through the network interception layer of matchlock or an entirely different software.

▲

yencabulator

1 month ago

[-]

Huh. You're converting FUSE requests into your own custom protocol (with copy-pasted protocol definition) over vsock. Interesting. Not sure I'd trust it with my data[0], but interesting.

I don't think the current filepath.Join in realfs.go protects the host against a malicious guest, at all. I'm assuming this is configured as Guest --FUSE--> guest-fused (inside VM) --VSOCK--> realfs.

(The Firecracker people have explicitly refused to have virtio-fs, to keep it minimal: https://github.com/firecracker-microvm/firecracker/pull/1351...)

https://github.com/jingkaihe/matchlock/blob/123a4df680fb8cc0...

[0]: Well, I already know I won't trust hanwen/go-fuse with my data, so that part is a bit moot.

▲

ajb

1 month ago

[-]

We definitely need a vendor-independent tool like this. Have been reviewing the Claude setup and, despite initially being hopeful since it uses bubblewrap, it's quite problematic:

* The definitions of security config in the documentation of settings.json are unclear. Since it's not open source, you can't check the ground truth.

* The built in constructs are insufficient to do fully whitelist based access control (It might be possible with a custom hook).

* Security related issues go unanswered in the repo, and are automatically closed.

Haven't looked into copilot as much but didn't look great either. Seems like the vendors don't have the incentives to do this properly.

So I'm on the lookout for a better way, and matchlock seems like a contender.

▲

CuriouslyC

1 month ago

[-]

There are a lot of options in this space. Armin Ronacher is working on Gondolin (https://github.com/earendil-works/gondolin) for example. I built agentd as a layer in front of this stuff so you can expose secure shell capabilities over the network as a tool rather than baking it into the harness, or running the harness in that environment.

▲

arianvanp

1 month ago

[-]

Claude sandbox practically useless IMO. It gives read access to everything by default so its not deny-default.

▲

ushakov

1 month ago

[-]

very cool, if you want cross-platform microvms, there's an interesting project called libkrun that powers projects like Podman and Colima.

here's a Go binding: https://github.com/mishushakov/libkrun-go

demo (on Mac): https://x.com/mishushakov/status/2020236380572643720

▲

codethief

1 month ago

[-]

Since when does libkrun power Podman? Last time I checked, Podman used non-virtualized containers based on `crun`.

▲

codethief

1 month ago

[-]

(Though you can certainly configure Podman to use krun[0], which fires up a libkrun VM inside a crun container.)

[0]: https://github.com/containers/crun/blob/main/krun.1

▲

tyfnll

1 month ago

[-]

OP may be referring to `podman machine` on macOS, which gives access to containers through a Linux VM via libkrun.

▲

raphinou

1 month ago

[-]

I've been happily using a container to run my agents [1]. I tried to make it evolve with more advanced features, but it quickly became harder to use and I went back to a basic container which I just start with a run.sh script. Is a similar simple use possible with matchlock?

1:https://github.com/asfaload/agents_container

▲

0x696C6961

1 month ago

[-]

I use a very similar setup. I initially used nix to manage dev tools, but have since switched to mise and can't recommend it enough https://mise.jdx.dev/

▲

pmarreck

1 month ago

[-]

does mise use nix underneath or did you abandon nix entirely?

▲

rsyring

1 month ago

[-]

Mise doesn't use nix. I think the OP is stating he replaced nix with mise.

▲

pmarreck

1 month ago

[-]

Yeah I'm just confused why someone would go from a completely deterministic dependency management system back to a dice-rolling one especially when LLM's now exist where all the top tier ones are excellent at the Nix language

Because I myself am never going to anything else ever again, unless it's a derivative of the same idea, because it's the only one that makes sense

▲

cjbarber

1 month ago

[-]