Example: [https://fontgenerator.design/symbol/almost-equal-to](https://fontgenerator.design/symbol/almost-equal-to)
Includes Unicode, HTML, CSS, JS, UTF-8/16 bytes, URL encoding, and usage examples.
The same structure is used across thousands of symbols (math, arrows, currency, tech/UI, punctuation).
Built because existing references are fragmented. Feedback welcome.
FileFormat.Info[1] has a page per codepoint. It has been around awhile, so the UI isn't as whizzy, but it has all the data and works w/o JavaScript
UnicodeSearch[2] is an updated search UI that uses JavaScript and the excellent Tabulator grid widget.
There are actually a ton of similar sites with a page-per-codepoint. It is all fun to make one, until the bots come along and hammer every page.
[1] https://www.fileformat.info/info/unicode/char/2248/index.htm
Or better, if 90% of all symbol names are abbreviated, your design simply doesn't work. This is especially apparent in the "arrows" section.
The abbreviations (arrows in particular) made scanning worse than it should be. I’ve pushed a quick update: full names on hover/focus, plus a Compact / Readable toggle for the grid.
Thanks for the nudge.
Oh well! Still good.
When I click the "Click to copy" my UI reflex tells me to look for a "Copied!" or similar acknowledgement. But I don't see one, so there's uncertainty if it was copied safely to my invisible clipboard or not.
Please keep making this, it's good! What inspired you for the design? I like this style, and notice it around, but can't pinpoint.
There is also a "Copied x" toast (is this the correct term? idk) at the bottom of the viewport when you click a character box, maybe it was also added later on.
I found it odd, that tapping on a square “highlights” it, by making it “pop,” but nothing else really happens.
It took me a bit to figure out that I need to actually select the arrow in the upper right corner, to get the page.
Using the `unicode' command from plan9userspace, for example
unicode 2ff0-2fff
the last three symbols are shown inside emacs as squares with the four hex values inside.Typing in the search field `2fff' finds `no match found'.
The symbols search now supports code-point lookup, so you can search by:
U+2FFF
0x2FFF
plain hex (2fff, 4–6 digits)
This makes it possible to jump straight to a symbol page even when the glyph doesn’t render locally and you only have the code point (like the Emacs/BSD case you described).
One limitation to note: some symbols aren’t covered by common or default system fonts, so they may still appear as tofu boxes depending on the font stack. That’s a font coverage issue rather than Unicode itself.
Appreciate you calling this out — this was exactly the kind of workflow gap I wanted to catch.
Maybe get rid of all the noise and just display the symbols in a nice grid without all the fluff or layers.
Please don't display text directly on the grid background image. It makes it impossible to read the text easily. Currently, this is the case when you open the page for a specific symbol in the 'Usage & Context' section.
Fixed.
Please no: just write the character. <, & and (in quoted attributes) " or ' are the only characters that need to be encoded; a few others have arguable benefit to being encoded (most notably NO-BREAK SPACE), but most Unicode characters should just be put in literally. The days when you couldn’t be confident of the file encoding are past: your HTML is being served as UTF-8 (or in the rare case it isn’t, you should fix that instead of avoiding non-ASCII in the source).
Same deal with CSS (" and \ are the only ones you need to escape) and JavaScript (" or ' or `, as appropriate).
URLs? Occasionally you may encounter a legacy system where you need to percent-encode it yourself (similarly around punycoding internationalised domain names), but you can almost always (and thus, in my opinion, should) just write it and leave anything that wants it to be ASCII to perform the percent-encoding itself.
Excel I can’t comment on, but I presume you can just write "≈" and UNICHAR should almost never be used.
For modern HTML/CSS/JS, you should just write the character and serve UTF-8. The entities / codes are there purely as reference for legacy cases, debugging, or when you only have a code point and no rendered glyph — not as a recommendation for normal authoring.
I don't need to be told on each one to "Click to Copy".
But nice concept.
No. Please just give me an option to reject all tracking cookies instead of just kicking me in the face with a done deal.
Whoever wrote this 'EU/UK users: this serves as our cookie notice' is ignorant of the actual law. Have a look at: