"The LLM outputs compact code (~50 tokens) instead of raw Excalidraw JSON (~5,000 tokens)."
I see on the left that the LLM is outputting some instructions to add nodes and edges to the diagram. But what is interpreting those commands and turning them into an Excalidraw file?
Just like you wouldn't use same table in your system for all users in a multi tenant application.
Okay, I did so. I realize that in your later followup comment you might want something different (like for Chrome itself to cache these downloads or something) but for now I made what you asked for, here you go:
https://stateofutopia.com/experiments/ephemeralcdn/
It's an ultrafast temporary CDN for one-off experiments like this. Should be lightning fast. By including the script, you can include any file this CDN serves.
Or indeed use some sort of huggingface model downloader (if that exist with XET)