Hacker News new | ask | show | jobs
by orangepanda 25 days ago
What everyone actually cares about — new emojis!

* Cracking face

* Left/Right thumb sign

* Monarch butterfly

* Pickle

* Lighthouse

* Meteor

* Eraser

* Net with handle

6 comments

Other great additions:

- Left and Right parenthesis with middle ring [1]

- A wiggly exclamation mark expressing mirth or laughter [1] (edit: and something I completely missed: the inverted version can express sarcasm)

- Cuneiform numerals, including lots of arranged dots that might be useful in other contexts [2]

- New variations of "measured angle" and "sector" [3]

- A transparent cube and a white cube [4]

Also a couple new combining marks

And for anyone who wants to see what the reference images for the new emojis look like:

Lighthouse: https://www.unicode.org/charts/PDF/Unicode-18.0/U180-1F680.p...

Other new Emojis: https://www.unicode.org/charts/PDF/Unicode-18.0/U180-1FA70.p...

1: https://www.unicode.org/charts/PDF/Unicode-18.0/U180-2E00.pd...

2: https://www.unicode.org/charts/PDF/Unicode-18.0/U180-12550.p...

3: https://www.unicode.org/charts/PDF/Unicode-18.0/U180-1CEC0.p...

4: https://www.unicode.org/charts/PDF/Unicode-18.0/U180-1F780.p...

I would be more interested if they are ever going to cancel HAN unification. Looking at their "Factors for Exclusion" list it could be summarized by "we made some mistakes in past but are sticking to it" :D
Han Unification was effectively "fixed" by Ideographic Variation Sequences, so no.
You mean theoretically. Effectively, nothing is fixed yet.
IVD works, theoretically and practically (recent versions of OpenType have an explicit support for them). It's not their fault that Japanese vendors have been not very quick to adopt them.
If a Japanese and Taiwanese person type things with their keyboards and end up with the same bytes for different logical characters then no things do not work practically for any practical definition of "practically".
Your argument is absurd because people don't see code---they see glyphs, and using the same code for slightly different glyphs is a non-issue when they are not interchanged. (And when they are interchanged, both would see glyphs "correct" to them anyway.) Japaneses are sensitive to Han unification only because they recognize more glyph variations (Z-variants) than what Unicode originally could, and IVS is exactly a tool for ensuring exact glyphs assuming cooperative vendors. Not to mention that Han unification was already quite weakened by source separation principles in the first place.
Chinese AI labs are reducing Japanese images and text out of AI models - they leave much smaller amount for text models that has to be literate in Japanese, and explicitly nuke it out of dataset for image models so that it only supports Simplified and English languages, so to avoid GIGO.

I mean, making or help making sovereign AI models is nowhere near responsibilities of Unicode, but Han Unification and sort of a default-enforced IVD support is literally adding small but non-zero amount of fuel to cultural division and xenophobia perpetuate in East Asia. I doubt blaming users would work here.

While I agree that Han Unification is not optimal (and fixing them is a welcoming development), it is already too late to reverse it. Even counter-proposals like TRON didn't work at all so far. IVD is the best compromise we can have in this situation.

> cultural division and xenophobia perpetuate in East Asia

By the way, I recently have seen multiple claims from Japanese Twitter users that Korea would have been better keeping Chinese characters (Hanja) in use. If this is a cultural division and xenophobia we are talking about, I will gladly take it---why on earth do they have any saying in Korea's choice of scripts? The "sinosphere" is an illusion, the fact that CJKV countries have or had shared the same set of characters is just a fun fact and not a cultural mandate or anything else like that.

Cancel how? There's documents encoded like that which would break it it were changed now.

Unicode takes backward compatibility like this very seriously.

Sadly it looks like it will be a dead monarch butterfly

https://www.emilydamstra.com/please-enough-dead-butterflies/

I need a table emoji because then I could combine it with a horse emoji. This would be "Pferd Tisch" (Horse Table) in German which sounds similar to "Fertig" which translates to "done". Yes I want it only for that dumb joke.
Still no seahorse
If the seahorse emoji is introduced, we will have to train new foundation models. The costs connected to the introduction of the seahorse emoji will be in the billions.
You're absolutely right—the seahorse emoji was added in Unicode version 19.0.0 after OpenAI purchased the Unicode Consortium and converted it to a for-profit corporation.
The seahorse being, of course, among the first commercial Unicode characters that require a subscription to use.
Does anyone know why a monarch butterfly was added, when there is already a butterfly emoji?