|
|
|
|
|
by Dylan16807
263 days ago
|
|
You're focusing on the wrong part of the problem when you say "essentially meaningless". Yes, choices must be made about how you order your "alphabet". But the meat of the request is that sorting goes character by character. That's a clear criteria, even with Unicode involved. And I would say the reasonable way to define character is grapheme cluster and yes you want it stable to normalization and encoding. How capital letters/diacritics/different representations affect the order of your alphabet, and which ones are considered equivalent, is something without a clear answer. Same for whether letters or numbers come first, and where punctuation goes. But you don't need consensus on that to fix the problem in the post. |
|