Silurus/ooxml: Pixel-faithful Office documents, rendered in the browser
35 points
2 hours ago
| 4 comments
| github.com
| HN
jbgt
19 minutes ago
[-]
Interesting because I'm building ooxml-cli right now, for editing pptx, docx, xlsx. At work I had to adapt a pptx to a corporate template and tried via agent. It kept failing so I started building and then it was able to relatively quick and accurate do what I needed. Then I needed it to make tables, add pictures. Recently wanted to get data from an xslx and replace text in a presentation etc.

So the tool is growing and maybe this would be interesting to have as the non LibreOffice dependent viewer...

reply
vlmutolo
22 minutes ago
[-]
Pretty cool, rendering PowerPoint files to an image is probably the only way for LLMs to make sense of them.

Does this work in Cloudflare’s workerd environment? Would be nice to have a cheap serverless render -> LLM (GLM-OCR / PaddleOCR) -> Markdown pipeline for the various MS Office formats.

reply
sublinear
52 minutes ago
[-]
> office-open-xml-viewer
reply
phplovesong
41 minutes ago
[-]
Its kind of sad that the first thing in the repo is a mention that no human was involved in the programming.
reply
llukas
29 minutes ago
[-]
Would author be able to do it otherwise? Is particular tool choice making result worse?

Bit identical/pixel-faithful reproductions are easy to verify…

reply
jstanley
22 minutes ago
[-]
"LLMs are amazing, I'm so much more productive now"

"oh yeah? Show me what you made, you can't, nobody can, it's all just AI psychosis"

"I made a pixel perfect Office document viewer"

"well... I wish you hadn't"

reply
isubkhankulov
31 minutes ago
[-]
Would this project exist otherwise? i doubt it
reply
cyanydeez
31 minutes ago
[-]
which means it probably gets all the halucinated assets correctly and any real world documents wrong.

Still, looks pretty; if it actually has proper testing, could close the gap. Code not being the hard part is a major impediment to good software coming out of these things.

reply