Show HN: Local-First Linux MicroVMs for macOS
213 points
1 month ago
| 22 comments
| shuru.run
| HN
Shuru is a lightweight sandbox that spins up Linux VMs on macOS using Apple's Virtualization.framework. Boots in about a second on Apple Silicon, and everything is ephemeral by default. There's a checkpoint system for when you do want to persist state, and sandboxes run without network access unless you explicitly allow it. Single Rust binary, no dependencies. Built it for sandboxing AI agent code execution, but it works well for anything where you need a disposable Linux environment.
omrimaya
27 days ago
[-]
The design decision I find most interesting here is ephemeral-by-default with opt-in checkpointing, that inversion of the usual "persist everything, clean up manually" model fits agent code execution well. Most sandboxing approaches I've seen treat isolation as the hard problem, but state leakage across runs is the subtler foot-gun when you're executing LLM-generated code repeatedly.

One thing I ran into building agent infrastructure: the boundary between "sandbox that runs code" and "agent that decides what code to run" wants to be a clean HTTP interface, not a library call. Makes it easier to audit what crossed the boundary. Does Shuru expose any hook for streaming stdout back to the caller during execution, or is it strictly "wait for exit, get result"?

reply
srinath693
1 month ago
[-]
The value here isn't 'local VMs'. it's that the defaults are inverted. Everything else defaults to persistent and networked. This defaults to ephemeral and isolated. Small shift, but matters when you don't trust the code that's about to run.
reply
Xlab
1 month ago
[-]
I will steal this to make a local-first version of https://microterm.dev for macOS :)

My idea is to have unified environment across all targets, so the only thing that changes is speed and amount of RAM.

reply
scosman
1 month ago
[-]
How is this running the vm/container? Cloud or something like container2wasm?

Kinda cool I’m on my phone, on an alpine terminal, and genuinely need to ask if it’s running in the browser.

reply
scosman
1 month ago
[-]
checked from desktop: WASM container!
reply
chrisweekly
1 month ago
[-]
iOS Safari stuck in a redirect loop (loading... indicator reaches 90% then hard refresh, repeat till error message)
reply
todotask2
1 month ago
[-]
On on iPhone 13 Pro with iOS Safari 26.3, loading fine.
reply
chrisweekly
1 month ago
[-]
Mine's also a 13pro, iOS 18.7.3's Safari, still getting same error "A problem occurred repeatedly on microterm.dev"
reply
harshdoesdev
1 month ago
[-]
cool, would love to see it!
reply
josephg
1 month ago
[-]
What does local first mean in this context? Does it just mean local? Like, the software runs locally?
reply
harshdoesdev
1 month ago
[-]
yeah, it just means everything runs on your machine. there are services like E2B, sprites.dev and others that give you sandboxes in the cloud. shuru runs VMs locally using Apple's Virtualization.framework, so nothing leaves your Mac.
reply
fulafel
1 month ago
[-]
Seems it only support macOS so for practical purpouses it's local-only.
reply
userbinator
1 month ago
[-]
Unfortunately yes. It's just another stupid marketing buzzword these days.
reply
Xlab
1 month ago
[-]
it's the other way around, everything is in the cloud now (upload your files to us, we are privacy respecting, bla bla)

So it's good that the product actually highlights it is dealing with local hardware only.

reply
josephg
1 month ago
[-]
Yes, but we have a perfectly serviceable term for local software already: "local software".

To me, "local-first software" means something slightly different. The term was coined by this essay[1], which says:

> Local-first ideals include the ability to work offline and collaborate across multiple devices

> This means that while local-first apps keep their data in local storage on each device, it is also necessary for that data to be synchronized across all of the devices on which a user does their work.

But this is clearly not what's going on here. This project is just local software, like we've had forever.

If a fancy new "local first" buzzword makes local-only software seem more sexy, then I suppose I don't want to get too mad about it. I really like local software. But the autist in me likes it when technical terms have a well defined meaning.

[1]: https://www.inkandswitch.com/essay/local-first/

reply
userbinator
1 month ago
[-]
I don't expect "Linux MicroVMs for macOS" to have anything to do with clouds.
reply
allthetime
1 month ago
[-]
"Local First" implies that something is second.
reply
winstonwinston
1 month ago
[-]
To me it implies it is designed to be used local, instead of an afterthought feature.
reply
josephg
1 month ago
[-]
Thats like describing a mobile only game as "mobile first", even when the developers have no plans to ever port it to other platforms. Its a mobile game.

The grep command line utility isn't local first. Its just local software.

reply
7777777phil
1 month ago
[-]
The agent stack is splitting into specialized layers and sandboxing is clearly becoming its own thing. Shuru, E2B, Modal, Firecracker wrappers.

Earlier this month I wrote about how these layers have very different defensibility profiles and why going monolithic is the wrong call: https://philippdubach.com/posts/dont-go-monolithic-the-agent...

EDIT: Spelling

reply
camkego
1 month ago
[-]
It's a good article and seems to mirror my experience doing partial-AI software development. If you are not saving your context for decision making and your conclusions in software architecture (as made between developers and AI) you are losing very valuable context information on software design. Although I'm not sure the article ties closely to the topic of micro VMs.
reply
runako
1 month ago
[-]
How does this compare to Apple container[1]?

I am excited by the innovation happening in the space!

1 - https://github.com/apple/container

reply
harshdoesdev
1 month ago
[-]
apple container is more of a docker-style workflow, OCI images, registries, etc. shuru is just micro VMs with checkpointing, much simpler scope.
reply
jclay
1 month ago
[-]
Has anyone tackled this for Windows? WSL isn’t ideal when shipping a consumer app to a non-developer target audience since it requires some setup.
reply
xrd
1 month ago
[-]
What is the benefit of this over lima, for example?
reply
harshdoesdev
1 month ago
[-]
Lima can do a lot of what shuru does if you set it up for it. the difference is mostly in defaults and how much you have to configure upfront. with shuru you get ephemeral VMs, no networking, and a clean rootfs on every run without touching a config file. shuru run and you're in. Checkpoints and branching are built into the CLI rather than being an experimental feature you have to figure out. Lima is a much bigger and more mature project though. Shuru is something I am building partly to learn and partly because I wanted something with saner defaults for this specific use case.
reply
enneff
1 month ago
[-]
Thanks for doing this. I had basically the same experience with Lima. It is very nice but the defaults are not what I want, and I don't like having to wonder whether I turned off the stuff that I don't want enabled. Better that everything is disabled by default and I selectively turn things on (like networking) as I need them.

I'm gonna give shuru a try. My main concern is being based on Alpine (seemingly the only option?) I may not be able to easily pull in the dependencies for the projects I'm working on, but I'll see how it goes.

reply
harshdoesdev
1 month ago
[-]
glad to hear it, that's exactly the thinking behind it. alpine is the only option right now yeah. what kind of dependencies are you running into issues with? would help me figure out what to prioritize next.
reply
enneff
1 month ago
[-]
I haven't yet - just generally I have found it a bit of a hassle to figure out which packages to install whenever I use a different distro. I'll let you know how it goes!
reply
halostatue
1 month ago
[-]
Disclaimer: I haven't tried this yet.

I would want the equivalent of the trixie-slim Docker image (Debian 13, no documentation). It's ~46 Mb instead of ~4Mb as a Docker image, but gives a reasonably familiar interface.

(This is largely based on some odd experiences with Elixir on Alpine, which is where I am doing most of my work these days.)

reply
BrandiATMuhkuh
1 month ago
[-]
Very cool. Was looking for something like this for a new project of mine. (I'm working on a project that is like a marriage of retool+OpenClaw. It's used by SME to quickly build inhouse apps)
reply
scosman
1 month ago
[-]
This looks amazing. I’ve been wanting virtualization.framework micro VMs for months! Docker is fine, but the overhead isn’t ideal.

I like the defaults (ephemeral, network off). Any thoughts on adding host-mapped directories?

I have a MCP server for ephemeral sandboxes that supports various backends (Docker, E2B, Modal, even WASM). I’ll look at adding this. https://github.com/Kiln-AI/Kilntainers

reply
praveenhm
1 month ago
[-]
How does it compare to Lume. It uses Apple's native Virtualization Framework to run macOS and Linux VMs at near-native speed on Apple Silicon.
reply
harshdoesdev
1 month ago
[-]
lume is a much more full featured VM manager, macOS and Linux VMs, API server, prebuilt images, python SDK etc. shuru is intentionally minimal.
reply
rishabhaiover
1 month ago
[-]
I've noticed claude forks parallel agents on an assigned task. How would they communicate in isolated sandboxes like these? Would it be cleaner and more effective for a harness to orchestrate swarms of agents in a single clean linux environment like OrbStack?
reply
harshdoesdev
1 month ago
[-]
haven't thought about multi-agent communication yet. each sandbox is fully isolated which is the point. checkpoints help a bit here though, you can branch multiple agents from the same checkpoint so they all start from the same state.
reply
rishabhaiover
1 month ago
[-]
I think I made a cursory and incorrect assumption. Given this is backed by Apple's Virtualization, it has POSIX compliance and forks/execs are allowed within the sandbox which can support agent parallelization within a sandbox I believe.

Looks like a great project at surface!

reply
user3939382
1 month ago
[-]
macos also has a native sandbox-exec that can be used to isolate processes with a policy you like.

https://igorstechnoclub.com/sandbox-exec/

reply
sedawkgrep
1 month ago
[-]
Wow I never knew this existed. Thanks for pointing this out!
reply
abhikul0
1 month ago
[-]
reply
sedawkgrep
1 month ago
[-]
Wow I don't know how I missed that...
reply
Cyphase
1 month ago
[-]
Shuru looks cool! I've been working on something with a similar vibe, for Linux hosts first; MicroVMs, default offline, etc. Not ready to release, but I'm starting to dogfood it.
reply
apatheticonion
1 month ago
[-]
How are we going for gpu acceleration in Linux VMs on MacOS?
reply
alexellisuk
1 month ago
[-]
AFAIK that's not possible at the moment. Apple limits the full GPU acceleration for macOS guests.
reply
apatheticonion
1 month ago
[-]
Classic Apple
reply
steve1977
1 month ago
[-]
Let's not call this local-first please. Especially since there is no wherever- else-second. The term first makes no sense here.
reply
JSR_FDED
1 month ago
[-]
This looks awesome. How would you recommend setting up an allowlist for external network communications (for cases where networking is enabled)?
reply
raihansaputra
1 month ago
[-]
man this is cool. this is what i want since i read about fly.io's sprites.

slightly related to this, and i'm not familiar with linux sandboxing/containerization in depth, but any similar turn key solution for linux desktop/server? ideally i have something like sprites/shuru but on my own linux/ubuntu server instead of in the cloud.

reply
alexellisuk
1 month ago
[-]
We built SlicerVM for this in 2022, but not just for sandboxing. It's for servers + API launched VMs (i.e. what we now like to call a 'sandbox'). Feel free to take a look, a lot of our early users are saying things like this.
reply
servercobra
1 month ago
[-]
Same, I really want to use sprites (for me and my whole team) but every time I try to set up, I run into weird issues. Last time I got in some state where trying to launch Claude froze the whole VM every time.
reply
harshdoesdev
1 month ago
[-]
glad you liked it! I am currently exploring options for Linux support. will share an update soon.
reply
Krisso
1 month ago
[-]
Why was using straigt containers not enough?
reply
harshdoesdev
1 month ago
[-]
containers work fine for a lot of this. shuru is just what felt more natural to me. less config overhead and i wanted to learn by building it.
reply
wickrom
1 month ago
[-]
What would you say are a natural use case for Shuru or Lima vs say a full docker like environment? What does the sandbox allow you to do differently? There's additional overhead of the hypervisor. I'm mostly just trying to learn, as yourself.
reply
nsonha
1 month ago
[-]
installing new tools inside container requires you to update the Dockerfile and rebuild, here it seems you can simply run the installation command and create a checkpoint
reply
fulafel
1 month ago
[-]
You can do this with Docker too without Dockerfile or rebuilding. You can treat the container as mutable and just start/stop it, doing changes manually, and make snapshots with docker commit.

You'll forfeit the benefits of reproducible scripted environment of course but Docker does let you do it.

reply
tobyhinloopen
1 month ago
[-]
Neat! I was looking for something like this
reply
harshdoesdev
1 month ago
[-]
thanks! let me know how it goes
reply
akashkahlon
1 month ago
[-]
this is awesome!!
reply
conradev
1 month ago
[-]
Use OrbStack. It’s faster than Virtualization.framework because it has its own hypervisor.
reply
noname120
1 month ago
[-]
Not true, OrbStack uses Virtualization.framework: https://news.ycombinator.com/item?id=36189550
reply
nsonha
1 month ago
[-]
Yes but they also use a custom linux kernel to achieve better performance than plain vz. I'm not technical enough to tell if it's bs, but it boots subsecond
reply
saagarjha
1 month ago
[-]
I don't think they use Virtualization for most launches now
reply
noname120
1 month ago
[-]
I think you are right, I’m not sure that they actually use it for stuff that doesn’t require Rosetta (arm64 stuff)
reply
harshdoesdev
1 month ago
[-]
OrbStack is great but it is solving a different problem. it's a full Docker Desktop replacement. shuru is just a thin layer over Virtualization.framework for spinning up throwaway sandboxes.
reply
JoshTriplett
1 month ago
[-]
OrbStack has some invasive elements inside it trying to provide filesystem integration, and the filesystem they use is not POSIX compliant and causes breakage with some build systems and other software.
reply