FilterHN

stephantul

2 months ago

[-]

Amazing post, I didn’t think this through a lot, but since you are normalizing the vectors and calculating the euclidean distance, you will get the same results using a simple matmul, because euclidean distance over normalized vectors is a linear transform of the cosine distance.

Since you are just interested in the ranking, not the actual distance, you could also consider skipping the sqrt. This gives the same ranking, but will be a little faster.

qingcharles

2 months ago

[-]

It's stuff like this I would have loved to know when I was doing game engine dev in the 90s.

mads_quist

2 months ago

[-]

I want to do game programming again like it's 1999. No more `npm i` or "accept all cookies" :/ rant off :)

corysama

2 months ago

[-]

Go make a game for the Sega Genesis https://mdengine.dev/

Or, the GameBoy Advance https://github.com/GValiente/butano

eru

2 months ago

[-]

I was seriously looking into the GameBoy Advance, but the real hardware has gotten quite expensive these days.

I wonder how the latest and greatest Wonderswan is doing in terms of price.

Keyframe

2 months ago

[-]

One uses emulator while developing anyways. Try with C64 and VICE and join us at https://csdb.dk/

eru

2 months ago

[-]

> One uses emulator while developing anyways.

Yes, but part of the joy is the anticipation of playing on a real device at the end.

> Try with C64 and VICE and join us at https://csdb.dk/

Thanks for the invitation! I used a C64 as my only computer in the late 1990s long past its prime, because my mother got a really good deal on a whole set with printer and disk drives and plenty of disks with software (mostly games, from magazines). However, I was still a bit annoyed by the limitations of the system. I guess, if I had had a forth disk, I might have felt different.

In any case, for personal reasons I don't want to explore the C64 more.

But I never had a GameBoy Advance nor a Wonderswan.

2 months ago

[-]

Or an alternative for the Sega Genesis https://github.com/Stephane-D/SGDK

Or the Super Nintendo Entertainment System https://github.com/alekmaul/pvsneslib

Or the Gameboy / GBC, Sega Master System, Gamegear, Nintendo Entertainment System https://github.com/gbdk-2020/gbdk-2020

Or the TurboGrafx-16 / PC Engine, Nintendo Entertainment System (alt), Commodore 64, Vic-20, Atari 2600, Atari 7800, Apple II/IIe, or Pet 2001 https://github.com/cc65/cc65

Or the ZX Spectrum, TRS-80, Apple II (alt), Gameboy (alt), Sega Master System (alt), and Game Gear (alt) https://github.com/z88dk/z88dk

Or the Fairchild Channel F https://channelf.se/veswiki/index.php?title=Main_Page

Note: Some are slightly pre-1999 (all these, I have at least successfully made a "Hello World" with)

----------------

If they're really wanting 1999, that's the 5th to 6th generation console range with Sega Saturn, PlayStation, Nintendo 64, and Dreamcast. (on these, only recommendations, no successful compiled software)

Playstation is really challenging and remains so even in 2026. Lots of Modchip and disk swap issues on real hardware. Possibilities: https://www.psx.dev/getting-started and https://github.com/Lameguy64/PSn00bSDK

N64 is less horrible, and there's quite a few resources: https://github.com/DragonMinded/libdragon and https://github.com/command-tab/awesome-n64-development

Sega Saturn is still pretty difficult. However, there is: https://github.com/yaul-org/libyaul?tab=readme-ov-file and https://github.com/ReyeMe/SaturnRingLib plus the old development kits from the 90's are still around https://techdocs.exodusemulator.com/Console/SegaSaturn/Softw...

Dreamcast is similar to the Saturn situation, yet strangely, a little better. There's https://github.com/dreamsdk/dreamsdk/releases and https://github.com/KallistiOS/KallistiOS along with the official SDKs that are still around https://www.sega-dreamcast-info-games-preservation.com/en/re...

doxeddaily

2 months ago

[-]

I mean we used dist^2 all the time for comparisons in our game engine back in the 90s (multiple different engines actually).

So it was a known thing...

meindnoch

2 months ago

[-]

>you will get the same results using a simple matmul, because euclidean distance over normalized vectors is a linear transform of the cosine distance.

Squared euclidean distance of normalized vectors is an affine transform of their cosine similarity (the cosine of the angle between them).

  EuclideanDistance(x, y) = sqrt(dot(x - y, x - y)) = sqrt(dot(x, x) - 2dot(x, y) + dot(y, y)) = sqrt(2 - 2dot(x, y))

stephantul

2 months ago

[-]

yes, you are right. I realized my mistake afterwards but it was after the edit window.

catlifeonmars

2 months ago

[-]

> you could also consider skipping the sqrt.

This is a trick I reach for all the time: it’s cheaper to compare squared distances than completing the Euclidean calculation. For example, to determine whether to stop calculating lerp: x*x+y*y <= epsilon.

sph

2 months ago

[-]

Every example I thought "yeah, this is cool, but I can see there's space for improvement" — and lo! did the author satisfy my curiosity and improve his technique further.

Bravo, beautiful article! The rest of this blog is at this same level of depth, worth a sub: https://alexharri.com/blog

https://meatfighter.com/ascii-silhouettify/

crazygringo

2 months ago

[-]

> I don’t believe I’ve ever seen shape utilized in generated ASCII art, and I think that’s because it’s not really obvious how to consider shape when building an ASCII renderer.

Not to take away from this truly amazing write-up (wow), but there's at least one generator that uses shape:

See particularly the image right above where it says "Note how the algorithm selects the largest characters that fit within the outlines of each colored region."

There's also a description at the bottom of how its algorithm works, if anyone wants to compare.

https://hpjansson.org/chafa/

Instantnoodl

2 months ago

[-]

In the "Image to Terminal character" space this is also a known solution. Map characters to their shape and then pick the one with the lowest diff to the real chunk in the image. If you consider that you have a foreground and a background colour you can get a pretty close image in the terminal :D

My go version: https://github.com/BigJk/imeji

2 months ago

[-]

Surprised you didn't include the output result for the test image as a showcase of the library's results.

Edit: nvm, confused by the libraries purpose. Thought it was primarily character based rendering focused based on the subject under discussion.

Instantnoodl

2 months ago

[-]

Sorry for the confusion. The use-case is a little difference because the goal is to display the image as close to the original as possible with the limitation of only being able to use a forground color, background color and character per cell. The character is selected based on it's shape just like in the article. So if you get rid of the colors in Chafa you end up with something similar to the article. That's what I wanted to say :D

[1] https://en.wikipedia.org/wiki/List_of_8-bit_computer_hardwar...

2 months ago

[-]

Cool, and thanks for the explanation. Gotten interested in retro software recently, so may actually be helpful for trying to set up pictures in some of the retro consoles. Most do tend to be limited to foreground / background. The stuff listed here [1] is pretty representative of what's being dealt with.

Note: If you happen to know how to do multi-color dithering with some of these that would actually make significant improvements on some of these old picture hardware tests.

Instantnoodl

2 months ago

[-]

Isn't your problem more about Color quantization than about dithering? If you have big character cells like in a terminal dithering won't help you much. For each cell you want to find the best shaped cell and a foreground and background colour that are the closest to a colour from the supported palette.

But maybe I didn't understand your real problem yet

https://upload.wikimedia.org/wikipedia/commons/3/32/Screen_c...

2 months ago

[-]

Agree in the case of large character cells like a terminal. For those cases, where you only have something like 40x48 in the Apple II Low Res mode, there's only so much you can do with the limited resolution.

However, for many the result is that the color choices are akin to a posterization filter in photoshop, where the nearest color is simply chosen. Often, there's actually the freedom 'available' to define a character set and choose at least a background / foreground color, with some kind of dithering pattern.

Sometimes the character set that can be defined is limited, so it has to be chosen carefully. Yet there's improvement from a 'large blobs of color' poster result to a smooth dither tone change.

The problem with the quantization result, is it just snaps to the 'nearest'. So even for relatively large areas of slowly gradiating color, if you only have one 'nearby' color, everything inbetween just snaps to that single color choice. You might have red, with slowly increasing green / yellow, yet it will always just snap to solid red.

This example from the Vic-20 kind of shows that issue. Large areas where it posterizes severely.

Dithering suggested is something like this (greyscale example) except with choosable foreground / background (maybe 3-4, although less frequently)

https://araesmojo-eng.github.io/images/GreyScale_Dithering.p...

This example from the Vic-20 game Tutankarman shows that kind of approach. Varying amounts of dither and color used in dithing give the impression of changing skin tones.

https://www.neilhuggett.com/vic20/tutankarman03.png

They're both the Vic-20

akie

2 months ago

[-]

Love the monochrome gallery of examples https://meatfighter.com/ascii-silhouettify/monochrome-galler...

nomel

2 months ago

[-]

Appears to be around 150 times slower. I suspect increasing the sample circle cell resolution would give similarly crisp edges.

hahahahhaah

2 months ago

[-]

Yes I have not not seen this technique used for image to ascii art. Maybe for hand crafted stuff it isn't.

https://forums.tigsource.com/index.php?topic=40832.msg136374...

roskelld

2 months ago

[-]

I do enjoy these kinds of write ups, especially when it's about something that might seem so simple on the surface, but in order to get looking great you really have to go in deep.

Lucas Pope did a really nice write up on how he developed his dithering system for Return of The Obra Dinn. Recommended if you also enjoyed this blog post.

[1]https://www.youtube.com/watch?v=gg40RWiaHRY

snackbroken

2 months ago

[-]

> I don’t believe I’ve ever seen shape utilized in generated ASCII art, and I think that’s because it’s not really obvious how to consider shape when building an ASCII renderer.

Acerola worked a bit on this in 2024[1], using edge detection to layer correctly oriented |/-\ over the usual brightness-only pass. I think either technique has cases where one looks better than the other.

zahlman

2 months ago

[-]

I can imagine there's room for "style", here, too. Just like how traditional 2d computer art varies from having thick borders and sharp delineations between colour regions, through https://en.wikipedia.org/wiki/Chiaroscuro style that seems to achieve soft edges despite high contrast, etc.

wonger_

2 months ago

[-]

Great breakdown and visuals. Most ASCII filters do not account for glyph shape.

It reminds me of how chafa uses an 8x8 bitmap for each glyph: https://github.com/hpjansson/chafa/blob/master/chafa/interna...

There's a lot of nitty gritty concerns I haven't dug into: how to make it fast, how to handle colorspaces, or like the author mentions, how to exaggerate contrast for certain scenes. But I think 99% of the time, it will be hard to beat chafa. Such a good library.

EDIT - a gallery of (Unicode-heavy) examples, in case you haven't seen chafa yet: https://hpjansson.org/chafa/gallery/

fwipsy

2 months ago

[-]

Aha! The 8x8 bitmap approach is the one I used back in college. I was using a fixed font, so I just converted each character to a 64-bit integer and then used popcnt to compare with an 8x8 tile from the image. I wonder whether this approach results in meaningfully different image results from the original post? e.g. focusing on directionality rather than bitmap match might result in more legible large shapes, but fine noise may not be reproduced as faithfully.

smusamashah

2 months ago

[-]

But the chafa gallery isn't showing off ascii text rendering. Are there examples that use ascii text?

wonger_

2 months ago

[-]

Good point. I haven't found many ascii examples online.

Here's a copy-paste snippet where you can try chafa-ascii-fying images in your own terminal, if you have uvx:

  uvx --with chafa-py python -c '
  from chafa import * 
  from chafa.loader import Loader 
  import sys 
  img = Loader(sys.argv[1])
  config = CanvasConfig() 
  config.calc_canvas_geometry(img.width,img.height,0.5,True,False)
  symbol_map = SymbolMap()
  symbol_map.add_by_tags(SymbolTags.CHAFA_SYMBOL_TAG_ASCII)
  config.set_symbol_map(symbol_map)
  config.canvas_mode = CanvasMode.CHAFA_CANVAS_MODE_FGBG
  canvas = Canvas(config)
  canvas.draw_all_pixels(img.pixel_type,img.get_pixels(),img.width,img.height,img.rowstride)
  print(canvas.print().decode())
  ' \
  myimage.jpg

But results are not as good as the OP's work. https://wonger.dev/assets/chafa-ascii-examples.png So I'll revise my claim that chafa is great for unicodey colorful environments, but hand-tailored ascii-only work like the OP is worth the effort.

keepamovin

2 months ago

[-]

my favorite ascii glyphs are the classic IBM Code Page 437: https://int10h.org/oldschool-pc-fonts/fontlist/

and damn that article is so cool, what a rabbithole.

[0] https://aleyan.com/projects/ascii-side-of-the-moon

aleyan

2 months ago

[-]

Great work! While I was building ascii-side-of-the-moon [0][1] I briefly considered writing my own ascii renderer to capture differences in shade and shape of the Lunar Maria[2] better. Ended up just using chafa [3] with the hope of coming back to ascii rendering after everything is working end to end.

Are you planning to release this as a library or a tool, or should we just take the relevant MIT licensed code from your website [4]?

[1] https://news.ycombinator.com/item?id=46421045

[2] https://en.wikipedia.org/wiki/Lunar_mare

[3] https://github.com/hpjansson/chafa

[4] https://github.com/alexharri/website/tree/master/src

2 months ago

[-]

The ASCII moon tool is fun to play around with!

No plans to build a library right now, but who knows. Feel free to grab what you need from the website's code!

If I were to build a library, I'd probably convert the shaders from WebGL 2 to WebGL 1 for better browser compatibility. Would also need to figure out a good API for the library.

One thing that a library would need to deal with is that the shape vector depends on the font family, so the user of the library would need to precompute the shape vectors with the input font family. The sampling circles, internal and external, would likely need to be positioned differently for different font families. It's not obvious to me how a user of the library would go about that. There'd probably need to be some tool for that (I have a script to generate the shape vectors with a hardcoded link to a font in the website repository).

echoangle

2 months ago

[-]

Very cool effect!

> It may seem odd or arbitrary to use circles instead of just splitting the cell into two rectangles, but using circles will give us more flexibility later on.

I still don’t really understand why the inner part of the rectangle can’t just be split in a 2x3 grid. Did I miss the explanation?

DexesTTP

2 months ago

[-]

It's because circles allow for a stagger and overlap as shown later on. It's not really possible to get the same effect from squares.

echoangle

2 months ago

[-]

But it seems like you only need the stagger and overlap because you’re using circles in the first place. Would it look worse if you just divided the rectangle into 6 squares without any gaps or overlap?

zestyping

2 months ago

[-]

My thought exactly. The sampling circles only enable you to (awkwardly) solve a problem that was fabricated by using circles in the first place.

panki27

2 months ago

[-]

I wondered the same thing, but characters usually don't reach the edges, so I guess circles fit the average character better?

kennethallen

2 months ago

[-]

There are many different supersampling patterns you can use: https://en.wikipedia.org/wiki/Supersampling#Supersampling_pa...

A grid can have unwanted aliasing effects. It all depends on the kinds of images you're working with.

MrJohz

2 months ago

[-]

I think this is connected to the overlap and offset that are used layer to account for complex or symmetrical letter shapes. If the author had just split the grid, those effects would have been harder to achieve.

Jyaif

2 months ago

[-]

It's important to note that the approach described focuses on giving fast results, not the best results.

Simply trying every character and considering their entire bitmap, and keeping the character that reduces the distance to the target gives better results, at the cost of more CPU.

This is a well known problem because early computers with monitors used to only be able to display characters.

At some point we were able to define custom character bitmap, but not enough custom characters to cover the entire screen, so the problem became more complex. Which new character do you create to reproduce an image optimally?

And separately we could choose the foreground/background color of individual characters, which opened up more possibilities.

2 months ago

[-]

Yeah, this is good to point out. The primary constraint I was working around was "this needs to run at a smooth 60FPS on mobile devices" which limits the type and amount of work one can do on each frame.

I'd probably arrive at a very different solution if coming at this from a "you've got infinite compute resources, maximize quality" angle.

brap

2 months ago

[-]

You said “best results”, but I imagine that the theoretical “best” may not necessarily be the most aesthetically pleasing in practice.

For example, limiting output to a small set of characters gives it a more uniform look which may be nicer. Then also there’s the “retro” effect of using certain characters over others.

Dylan16807

2 months ago

[-]

> limiting output to a small set of characters gives it a more uniform look which may be nicer

And in the extreme that could totally change things. Maybe you want to reject ASCII and instead use the Unicode block that has every 2x3 and 2x4 braille pattern.

spuz

2 months ago

[-]

Thinking more about the "best results". Could this not be done by transforming the ascii glyphs into bitmaps, and then using some kind of matrix multiplication or dot production calculation to calculate the ascii character with the highest similarity to the underlying pixel grid? This would presumably lend itself to SIMD or GPU acceleration. I'm not that familiar with this type of image processing so I'm sure someone with more experience can clarify.

mark-r

2 months ago

[-]

> This is a well known problem because early computers with monitors used to only be able to display characters.

It's not just monitors. My first exposure to ASCII art were posters that were printed on a Teletype, in the mid 1970's. The files had attributions to RTTY operators, which made me believe they were done by hand. Of course a Teletype had no concept of pixels.

2 months ago

[-]

In practice isn’t a large HashMap best for lookup, based on compile-time or static constants describing the character-space?

spuz

2 months ago

[-]

In the appendix, he talks about reducing the lookup space by quantising the sampled points to just 8 possible values. That allowed him to make a look up table about 2MB in size which were apparently incredibly fast.

2 months ago

[-]

I've been working on something similar (didn't get to this stage yet) and was planning to do something very similar to the circle-sampling method but the staggering of circles is a really clever idea I had never considered. I was planning on sampling character pixels' alignment along orthogonal and diagonal axes. You could probably combine these approaches. But yeah, such an approach seemed particularly powerful for the reason you could encode it all in a table.

Sharlin

2 months ago

[-]

And a (the?) solution is using an algorithm like k-means clustering to find the tileset of size k that can represent a given image the most faithfully. Of course that’s only for a single frame at a time.

mwillis

2 months ago

[-]

Fantastic technique and deep dive. I will say, I was hoping to see an improved implementation of the Cognition cube array as the payoff at the end. The whole thing reminded me of the blogger/designer who, years ago, showed YouTube how to render a better favicon by using subpixel color contrast, and then IIRC they implemented the improvement. Some detail here: https://web.archive.org/web/20110930003551/http://typophile....

zellyn

2 months ago

[-]

+1 yo wanting to see the cognition logo with contrast. It was set up as the target, but no payoff!

Lovely article, and the dynamic examples are :chefs-kiss:

frognumber

2 months ago

[-]

This was painful to read. It become better and simpler with a basic signals & systems background:

- His breaking up images into grids was a poor-man's convolution. Render each letter. Render the image. Dot product.

- His "contrast" setting didn't really work. It was meant to emulate a sharpen filter. Convolve with a kernel appropriate for letter size. He operated over the wrong dimensions (intensity, rather than X-Y)

- Dithering should be done with something like Floyd-Steinberg: You spill over errors to adjacent pixels.

Most of these problems have solutions, and in some cases, optimal ones. They were reinvented, perhaps cleverly, but not as well as those standard solutions.

Bonus:

- Handle above as a global optimization problem. Possible with 2026-era CPUs (and even more-so, GPUs).

- Unicode :)

snowmobile

2 months ago

[-]

Perhaps you're right but I won't believe you until you whip up a live-rendering proof of concept. It's a bit rude to dismiss somebody's cool work as "painful", with some hypothetical "improvements" that probably wouldn't even work.

iknowstuff

2 months ago

[-]

Jeez nobody’s going to respect you more for writing like a jackass

unnah

2 months ago

[-]

It's probably much more exciting to implement stuff like this when you can experiment with your own ideas to figure out the solution from scratch, compared to someone who sees it as a trivial exercise in signal processing, which they can't be bothered to implement.

https://greggman.github.io/doodles/textme10.html

greggman65

2 months ago

[-]

I didn’t put nearly as much effort as this post into shape matching but I did try a few other things like

Non-ascii, I tried various subsets of Unicode. There’s the geometric shape area, CJK, dingbats, lots of others

Different fonts - there are lots of different monospace fonts. I even tried non-monospaced fonts tho still drawn in grid

ANSI color style https://16colo.rs/

My results weren’t nearly as good as the ones in this article but just suggesting more ways of exploration

Note: options are buried in the menu. Best to pick a scene other than the default

AgentMatt

2 months ago

[-]

Great article!

I think there's a small problem with intermediate values in this code snippet:

  const maxValue = Math.max(...samplingVector)

  samplingVector = samplingVector.map((value) => {
    value = x / maxValue; // Normalize
    value = Math.pow(x, exponent);
    value = x * maxValue; // Denormalize
    return value;
  })

Replace x by value.

2 months ago

[-]

Just pushed a fix, should be live in a minute or two, thanks again!

Aaron2222

2 months ago

[-]

This loop is similarly suspect:

  let maxValue = value;
  for (const externalIndex of AFFECTING_EXTERNAL_INDICES[i]) {
    maxValue = Math.max(value, externalSamplingVector[externalIndex]);
  }

2 months ago

[-]

Good catch, thanks! I’ll push a fix once I’m home

dboon

2 months ago

[-]

Fantastic article! I wrote an ASCII renderer to show a 3D Claude for my Claude Wrapped[^1], and instead of supersampling I just decided to raymarch the whole thing. SDFs give you a smoother result than even super sampling, but of course your scene has to be represented with distance functions and combinations thereof whereas your method is generally applicable.

Taking into account the shape of different ASCII characters is brilliant, though!

[1]: https://spader.zone/wrapped/

2 months ago

[-]

Looks very cool! Thanks for sharing.

The resulting ASCII looks dithered, with sequences like e.g. :-:-:-:-:. I'd guess that it's an intentional effect since a flat surface would naturally repeat the same character, right? Where does the dithering come from?

CarVac

2 months ago

[-]

The contrast enhancement seems simpler to perform with an unsharp mask in the continuous image.

It probably has a different looking result, though.

thech6newshound

2 months ago

[-]

Quite amazing breakdown, thank you!

I'm hoping people who harness ASCII for stuff like this consider using Code Page 437, or similar. Extended ASCII sets comprising Foreign Chars are for staid business machines, and sort of familiar but out of place accented chars have a bit of a distracting quality.

437 and so on taps the nostalgia for BBS Art, DOS, TUIs scene NFOs, 8 bit micros.... Everything pre Code Page 1252, in other words. Whilst it was a pragmatic decision for MS, it's also true that marketing needs demanded all text interfaces disappeared because they looked old. Text graphics, doubly so. That design space was now reserved for functional icons. A bit of creativity went from (home) computing right there and then. Stuffing it all into a separate font ensured it died.

But, that stuff is genuinely cool to a lot of people in a way VIM, (for example) has never been and nor will it ever. This is a case of Form Over Function. Foreign chars are not as friendly or fun as hearts, building blocks, smileys, musical notes, etc.

jrmg

2 months ago

[-]

This is amazing all round - in concept, writing, and coding (both the idea and the blog post about it).

I feel confident stating that - unless fed something comprehensive like this post as input, and perhaps not even then - an LLM could not do something novel and complex like this, and will not be able to for some time, if ever. I’d love to read about someone proving me wrong on that.

Lerc

2 months ago

[-]

To develop this approach you need to think through the reasoning of what you want to achieve. I don't think the reasoning in LLMs is nonexistent, but it is certainly somewhat limited. This is disguised by their vast knowledge. When they successfully achieve a result by relying on knowledge you get an impression of more reasoning than their is.

Everyone seems now familiar with hallucinations. When a model's knowledge is lacking and it is fine tuned to give an answer. A simplistic calculation says that if an accurate answer gets you 100%, then an answer gets you 50% and being accurate gets you 50%. Hallucinations are trying to get partial credit for bullshit. Teaching a model that a wrong answer is worse than no answer is the obvious solution, turning that lesson into training methods is harder.

That's a bit of a digression but I think it helps explain the difference to why I think a model would find writing an article like this.

Models have difficulty in understanding what is important. The degree to which they do achieve this is amazing, but it is still trained on data that heavily biases their conclusions to the mainstream thinking. In that respect I'm not even sure if it is a fundamental lack in what they could do. It seems to be that they are implicitly made to think of problems as "it's one of those, I'll do what people do when faced with one of those"

There are even hints in fiction that this is what we were going to do. There is a fairly common sci-fi trope of an AI giving a thorough and reasoned analysis of a problem only to be cut off by a human wanting the simple and obvious answer. If not done carefully RLHF becomes the embodiment of this trope in action.

This gives a result that makes the most people immediately happy, without regard for what is best long term, or indeed what is actually needed. Asimov explored the notion of robots lying so as to not hurt feelings. Much of the point of the robot books was to express the notion that what we want AI to be is more complicated than it appears at first glance.

cryptonector

2 months ago

[-]

This. With good prompting you can get Opus 4.5 to do amazing things, but you have to know what you're doing -- it has to be the case that you could have implemented everything that Claude will do for you, and that what Claude is doing more than anything is a) go faster, b) be your well-read rubber ducky.

soulofmischief

2 months ago

[-]

I'm confident that they can. This isn't a new idea. Something like this would be a walk in the park for Opus 4.5 in the right harness.

Of course it likely still needs a skilled pair of eyes and a steady hand to keep it on track or keep things performant, but it's an iterative process. I've already built my own ASCII rendering engines in the past, and have recently built one with a coding model, and there was no friction.

teiferer

2 months ago

[-]

> skilled pair of eyes and a steady hand

But that's key here.

"A hammer and a chisel can build a 6ft wooden sculpture by themselves just fine .. as long as guided by a skilled pair of eyes and steady hands"

soulofmischief

2 months ago

[-]

Ok, but if you have a wooden hammer and chisel, and a steel hammer and chisel, choosing the wooden one is an artisanal choice, not a practical one. These tools enable an amount of velocity I've never had before, both in research and development.

fsckboy

2 months ago

[-]

>ASCII characters are not pixels: a deep dive into ASCII rendering

in general, ascii rendering is when ascii character codes are converted to pixels. if you wish to render other pixels onto a screen using characters, they are not ascii characters, they are roman or latin character glyphs, no ascii involved. that is all.

- https://en.wikipedia.org/wiki/Color_Graphics_Adapter

LexiMax

2 months ago

[-]

Only tangentially related, but the title reminds me of hack you could do on old DOS machines to get access to a 160x100 16-color display mode on a CGA graphics adapter.

The display mode is actually a hacked up 80x25 text mode. So in that specific narrow case, you have a display mode where text characters very much function as pixels.

- https://github.com/drwonky/cgax16demo

https://alumni.media.mit.edu/~nelson/courses/mas814/

joshu

2 months ago

[-]

nickdothutton

2 months ago

[-]

What a great post. There is an element of ascii rendering in a pet project of mine and I’m definitely going to try and integrate this work. From great constraints comes great creativity.

symisc_devel

2 months ago

[-]

There is already a C library that does realtime ascii rendering using décision trees:

GitHub: https://github.com/symisc/ascii_art/blob/master/README.md Docs: https://pixlab.io/art

nowayhaze

2 months ago

[-]

The OP's ASCII art edges look way better than this

nxobject

2 months ago

[-]

I'm playing with a related problem in my spare time - braille character-based color graphics; while we have enough precision for sharp edges, the fundamental issues with color are the still the same: if we begin with a supersampling pass for assignment, we lack precision, so we may need to do some contrast fixups afterward. I think some contrast enhancement based on your sampling schemes might be useful :) Thank you so much for posting this!

(I've previously tried pre-transforming on the image side to do color contrast enhancement, but without success: I take the Sobel filter of an image, and use it to identify regions where I boost contrast. However, since this is a step preceding "rasterization", the results don't align well with character grids.)

MPSimmons

2 months ago

[-]

This is an awesome effort. I stared and played with the rotating graphics at the top for a while before reading the rest of the article, trying to figure out why it was so much better than a lot of the efforts I'd seen before, and I kind of figured out what you must be doing, but I'll admit, I wouldn't have ever done it as well or put in as much work as you had - really excellent techniques for determining character!

I am actually really curious how performant this is and whether something like this would be able to contribute beyond just demo displays. It's obviously beautiful and a marvel of work, but it seems like there should be a way to use it for more.

Also, I did find myself wondering about the inevitable Doom engine

Really nice job!

markshtat

2 months ago

[-]

Great writeup! I put together a Python CLI implementation: https://github.com/mayz/ascii-renderer

Supports color output, contrast enhancement, custom charsets. MIT licensed.

chrisra

2 months ago

[-]

> To increase the contrast of our sampling vector, we might raise each component of the vector to the power of some exponent.

How do you arrive at that? It's presented like it's a natural conclusion, but if I was trying to adjust contrast... I don't see the connection.

c7b

2 months ago

[-]

What about the explanation presented in the next paragraph?

> Consider how an exponent affects values between 0 and 1. Numbers close to experience a strong pull towards while larger numbers experience less pull. For example 0.1^2=0.01, a 90% reduction, while 0.9^2=0.81, only a reduction of 10%.

That's exactly the reason why it works, it's even nicely visualized below. If you've dealt with similar problems before you might know this in the back of your head. Eg you may have had a problem where you wanted to measure distance from 0 but wanted to remove the sign. You may have tried absolute value and squaring, and noticed that the latter has the additional effect described above.

It's a bit like a math undergrad wondering about a proof 'I understand the argument, but how on earth do you come up with this?'. The answer is to keep doing similar problems and at some point you've developed an arsenal of tricks.

2 months ago

[-]

In general for analytic functions like e^x or x^n the behaviour of the function on any open interval is enough to determine its behaviour elsewhere. By extension in mathematics examining values around the fundamental additive and multiplicative units \{ 0, 1 \} is fruitful in illustrating of the quintessential behaviour of the function.

lysace

2 months ago

[-]

Seems like stellar work. Kudos.

I am however am struck with the from an outsider POV highly niche specific terminology used in the title.

"ASCII rendering".

Yes, I know what ASCII is. I understand text rendering in sometimes painful detail. This was something else.

Yes, it's a niche and niches have their own terminologies that may or may not make sense in a broader context.

HN guidelines says "Otherwise please use the original title, unless it is misleading or linkbait; don't editorialize."

I'm not sure what is the best course of action here - perhaps nothing. I keep bumping into this issue all the time at HN, though. Basically the titles very often don't include the context/niche.

voidUpdate

2 months ago

[-]

> The image of Saturn was generated with ChatGPT.

Was there something wrong with using an actual image of saturn? NASA lets you use their images for stuff if you want https://www.nasa.gov/nasa-brand-center/images-and-media/, and if you're worried that might change down the line, you could just add a little attribution thing for NASA

gpt5

2 months ago

[-]

I'm not sure why it bothers you. But to guess why OP has done it - if you look at his request to ChatGPT - he wanted a square image with Saturn at 45 degree angle for this demonstration. I don't know if NASA has that image, and if it does, how long it would it take to dig it up (from a quick search - I couldn't find any), so it's pretty sensible to just use ChatGPT for this demonstration and credit it for the image.

voidUpdate

2 months ago

[-]

I searched "Saturn at 45 degree angle" and found https://commons.wikimedia.org/wiki/File:Saturn_during_Equino... in less than a minute of looking at google images. NASA has loads of images like that from their Cassini program.

Maybe it's just me, but I'd prefer a real image rather than something generated by the plagiarism machine that almost certainly took in that exact image as part of its training data

sandos

2 months ago

[-]

I did something similar to use images in a mosaic, and taking the image contents into consideration. This turns out is super-simple as long as you do everything in JPEG space: Just use however many coefficients to compare! So, scale the original image to have 8x8 pixels per "image pixel" in the final output, and then scale every candidate to 8x8. Now just compare the DCT coeffs directly!

A similar technique could probably be used here.

TeamCommet1

2 months ago

[-]

This is a great deep dive. Most ASCII renderers feel "muddy" because they treat intensity as the only variable. Treating characters as structural embeddings (the 6D vector approach) is much closer to how our eyes actually perceive edges. It reminds me of how font hinting works at low resolutions. Truly impressive work on the contrast enhancement pass too.

https://github.com/tammoippen/plotille

nurettin

2 months ago

[-]

I love that they don't just work on the edges and declare their work complete. No, shadows also have to be perfect!

Reminds me of this underrated library which uses braille alphabet to draw lines. Behold:

It's a really nice plotting tool for the terminal. For me it increases the utility of LLMs.

Izkata

2 months ago

[-]

I dunno, going to the last example at the bottom of the page and comparing the contrast slider all the way up and all the way down, all these enhancements combined turns it into a blurry mush where it's harder to distinguish the shapes. It's the exact same problem I had with anti-aliasing fonts on older monitors (smaller resolutions) and why I always disabled it wherever I could.

aghilmort

2 months ago

[-]

really great! adjacent well-done ASCII using Braille blocks on X this week:

nolen: "unicode braille characters are 2x4 rectangles of dots that can be individually set. That's 8x the pixels you normally get in the terminal! anyway here's a proof of concept terminal SVG renderer using unicode braille", https://x.com/itseieio/status/2011101813647556902

ashfn: "@itseieio You can use 'persistence of vision' to individually address each of the 8 dots with their own color if you want, there's some messy code of an example here", https://x.com/ashfncom/status/2011135962970218736

[1] https://www.lookuptables.com/text/extended-ascii-table

nomel

2 months ago

[-]

It would be interesting to see how things changed if you included extended ascii characters [1], which were widely used for ascii UI.

2 months ago

[-]

I did actually try out various alphabets e.g. Cyrillic, Greek and symbols (e.g. box drawing symbols), but ended up removing them: https://github.com/alexharri/website/commit/d969ef839

Using only ASCII felt more in the "spirit" of the post and reduced scope (which is always good)

octoberfranklin

2 months ago

[-]

Application error: a client-side exception has occurred (see the browser console for more information).

Sesse__

2 months ago

[-]

I did something very similar to this (searching for similar characters across the grid, including some fuzzy matching for nearby pixels) around 1996. I wonder if I still have the code? It was exceedingly slow, think minutes for a frame at the Pentiums of the time.

eerikkivistik

2 months ago

[-]

It reminds me quite a bit of collision engines for 2D physics/games. Could probably find some additional clever optimisations for the lookup/overlap (better than kd-trees) if you dive into those. Not that it matters too much. Very cool.

shiandow

2 months ago

[-]

I'm not sure if this exponent is actually enhancing contrast or just fixing the gamma.

NelsonMinar

2 months ago

[-]

I'd love to see this extended to non-ASCII characters. Not the full Unicode set, but maybe a big bag of alphabetic writing.

BTW, aalib was using character shape back in the 90s. This is very cool but there is prior art!

nathaah3

2 months ago

[-]

that was so brilliant! i loved it! thanks for putting it out :)

TuringNYC

2 months ago

[-]

Amazing, my son and I went thru the whole post! This is as excellent on communication and technical writing as it is on the math and science.

ripe

2 months ago

[-]

Wonderful article and illustrations! I got sucked in by the successive disclosures of "but this is a problem, so we do that to solve it." Bravo!

avadodin

2 months ago

[-]

I don't think anyone has mentioned this but I miss a picture of people or possibly even a movie.

adam_patarino

2 months ago

[-]

Tell me someone has turned this into a library we can use

2 months ago

[-]

Author here. There isn't a library around this yet, but the source code for the blog is open source (MIT licensed): https://github.com/alexharri/website

The code for this post is all in PR #15 if you want to take a look.

nathell

2 months ago

[-]

Well there's aalib and libcaca, but I'm not sure about their fidelity compared to this.

https://github.com/cacalabs/libcaca

guerby

2 months ago

[-]

Don't know what algorithm are used by the famous libcaca:

minimaxir

2 months ago

[-]

I was investigating a fun webcam-to-ASCII project so now I am tempted to take an approach at porting the logic from the blog post into something reusable.

minimaxir

2 months ago

[-]

Update: I tested a port of the OP's methodology using Claude Code/Claude Opus 4.5 with some specific performance optimizations, and per the benchmarks, converting a 1024x1024 image to ASCII takes 16 microseconds. I suspect that will decrease after some more polish/iteration but that's enough for potentially real-time generation even on mobile hardware.

BobbyTables2

2 months ago

[-]

That doesn’t seem right.

Surely you mean 16 milliseconds ?

minimaxir

2 months ago

[-]

Benchmark says 15.654 µs. Rendering the text as a 1024x1024 image is 2.8737 ms.

However, the ASCII output quality is nondiverse despite using the same technique, so will need to do significantly more testing and this likely won't be released soon.

baud9600

2 months ago

[-]

This is such a great article!

I found myself thinking, “I wonder if some of this could be used to playback video on old 8-bit machines?” But they’re so underpowered…

https://youtu.be/wM3deQAgMpE?si=h2O1uTQqxFtCRCsh

mackid

2 months ago

[-]

Might checkout what people have done on the 6502 in an Apple II.

maxglute

2 months ago

[-]

Mesmerizing, the i, ! shading is unreasonably effective.

cjlm

2 months ago

[-]

Very impressive blogpost. No wonder it took 6 months. Makes me think I need to step up the game with my photo ASCII art compositor, printscii.com

Johnny_Bonk

2 months ago

[-]

Amazing post, I was able to take what you did and recreate it and have some fun, matrix green etc. Thanks for the great post

estimator7292

2 months ago

[-]

Those 3D interactive animations are the smoothest 3D rendering I've ever seen in a mobile browser. I'm impressed

_blk

2 months ago

[-]

Wow. Pretty cool. Now just replace the characters with the set from the Matrix and swallow the blue pill.

mark-r

2 months ago

[-]

This is something I've wanted to do for 50 years, but never found the time or motivation. Well done!

fragmede

2 months ago

[-]

very cool. I may have to look a bit closer at the pipeline I used to create the art at ssh funky.nondeterministic.computer. The graphics could always be improved, however I will note that it needs color for best effect.

zdimension

2 months ago

[-]

Well-written post. Very interesting, especially the interactive widgets.

pcj-github

2 months ago

[-]

Nice work! ASCII rendering will never be the same, in a good way.

LowLevelBasket

2 months ago

[-]

Comments weren't kidding. Amazing post. Great job

account42

2 months ago

[-]

> Application error: a client-side exception has occurred (see the browser console for more information).

Thanks for erasing all the content once the page loads, saved me the time I would have spent reading the article.

There really needs to be a name for error handling that is worse than the initial error.

charmpic

2 months ago

[-]

I want to use this technology to make a game.

jwr

2 months ago

[-]

Hmm. This renderer is impressive. Will it be available for toy projects? (such as an online page with JavaScript for converting family pictures)

jurf

2 months ago

[-]

This at the same time super cool and really disappointing, as I've been carrying around this idea in my head for maybe ten years as a cool side project and never got around to implementing it.

However, there might still be room for competition, heh. I always wanted to do this on the _entirety_ of Unicode to try getting the most possible resolution out of the image.

steve1977

2 months ago

[-]

Thanks! This article put a genuine smile on my face, I can still discover some interesting stuff on the Internet beyond AI slop.

BarryGuff

2 months ago

[-]

Great article!

blauditore

2 months ago

[-]

Nice! Now add colors and we can finally play Doom on the command line.

More seriously, using colors (not trivial probably, as it adds another dimension), and some select Unicode characters, this could produce really fancy renderings in consoles!

krallja

2 months ago

[-]

"finally"? We were playing Quake II in AAlib in 2006. https://www.jfedor.org/aaquake2/

jrmg

2 months ago

[-]

At least six dimensions, right? For each character, color of background, color of foreground, and each color has at least three components. And choosing how the components are represented isn’t trivial either - RGB probably isn’t a good choice. YCoCg?

chrisra

2 months ago

[-]

Next up: proportional fonts and font weights?

2 months ago

[-]

I had been thinking of messing around with a DOM-based ‘console’ in Tauri that could handle a lot more font manipulation for a pseudo-TUI application similar to this. It's definitely possible! It would be even simpler to do in TS.

monitron

2 months ago

[-]

> The image of Saturn was generated with ChatGPT.

Wait...wh...why?!? Of all the things, actual pictures of the planet Saturn are readily available in the public domain. Why poison the internet with fake images of it?

https://news.ycombinator.com/newsguidelines.html

dang

2 months ago

[-]

"Please don't pick the most provocative thing in an article or post to complain about in the thread. Find something interesting to respond to instead."

"Eschew flamebait. Avoid generic tangents."

https://www.theverge.com/2023/3/13/23637401/samsung-fake-moo...

pjc50

2 months ago

[-]

Are we sure the planets are real?

taneq

2 months ago

[-]

How can planets be real if our eyes aren’t real?

userbinator

2 months ago

[-]

More like, why have it regurgitate something likely to have been in its training data?

echelon

2 months ago

[-]

> > The image of Saturn was generated with ChatGPT.

> Wait...wh...why?!?

It has just begun. Wait until nobody bothers using Wikipedia, websites, or even one day forums.

This is going to eat everything.

And when it's immediate to say something like, "I need a high contrast image of Saturn of dimensions X by Y, focus on Saturn, oblique angle" -- that's going to be magic.

We'll look at the internet and Google like we look at going to the library and grabbing an encyclopedia off the shelves.

The use of calculators didn't kill ingenuity, nor did the switch to the internet. Despite teachers protesting both.

Humans will always use the lowest friction thing, and we will never stop reaching for the stars.

taneq

2 months ago

[-]

I’ve been having The Talk with my kids recently. They’ll say “I looked up this question and the answer was X.” And I’ll ask “was that answer on a credible website, or was it an AI summary?” And then explain, again, that LLMs are great at producing plausible sounding explanations for things, but that you have to ground-truth anything that they tell you if it’s important that it’s correct.

leptons

2 months ago

[-]

Some countries are banning social media for teenagers, but they really should be banning "AI" all teenagers. Most adults can't even be trusted with asking an "AI" about anything, so children are going to have a very warped world view the more they interact with "AI". The tech really is not ready for prime time.

echelon

2 months ago

[-]

I see plenty of people getting real work done with it.

Why on earth would we ban it?

leptons

2 months ago

[-]

I see plenty of people taking "hallucinations" as the truth, and teenagers above all do not have the mental capacity to tell truth from nonsense, so they are learning things that are completely false from "AI". Teenagers are not "people getting real work done with AI". I'm not sure how you could so completely misunderstand my comment.

awesome_dude

2 months ago

[-]

I, for one, have been hoping that AI slop would cause people to be a LOT more cynical about the information they get (from the internet in particular, but from any source in general)

But it's not happened yet

echelon

2 months ago

[-]

What statistical measures of "people" are you doing to measure this? How can you be sure nothing has changed?

Anecdotally, I'm seeing a lot of "it looks like AI" comments on photos and videos now. That's the new "is it Photoshop?"

I'd hold off on judgment until we get population studies on this.