|
|
|
|
|
by cmroanirgo
1384 days ago
|
|
It definitely gets a bit murky when dealing with mbcs, when you want characters spanning multiple bytes rather than individual bytes. I understand the topic is strXxx() funcs which are ascii only, but it does need to be said that size!=len for wide and multi char sets. |
|
Honestly "string" is a very harmful word that we've all grown used to. As an abstraction it sits somewhere between raw bytes and properly encoded text with proper unicode functions such as those provided by ICU. Python 3 finally forced people to start thinking about this stuff and nobody liked it.