| > We would like to have a string that is very cheap to construct and points to a region of memory that is currently valid, but may become invalid later without the string having control over it. > This is where transient strings come in. They point to data that is currently valid, but may become invalid later, e.g., when we swap out the page on which the payload is stored to disk after we’ve released the lock on the page. > Creating them has virtually no overhead: They simply point to an externally managed memory location. No memory allocation or data copying is required during construction! When you access a transient string, the string itself won’t know whether the data it points to is still valid, so you as a programmer need to ensure that every transient string you use is actually still valid. So if you need to access it later, you need to copy it to memory that you control. Hm. What if I don't bother with that and I just read from the transient string? It's probably still good. > In C, strings are just a sequence of bytes with the vague promise that a \0 byte will terminate the string at some point. > This is a very simple model conceptually, but very cumbersome in practice: > What if your string is not terminated? If you’re not careful, you can read beyond the intended end of the string, a huge security problem! This sounds like a problem that transient strings were designed to exemplify. How do they improve on the C model? ----- I was interested that the short strings use a full 32-bit length field. That's a lot of potential length for a string of at most 12 characters. If we shaved that down to the four bits necessary to represent a number from 0-12, we'd save 28 bits, which is 3.5 characters. Adding three characters to the content would bring the potential length of a short string up to 15, requiring 0 additional length bits. And we'd have four bits left over. I assume we aren't worried about this because strings of length 13-15 are already rare and it adds a huge amount of complexity to parsing the string, but it was fun to think about. |