| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by flyingswift 1829 days ago
	What is the safe way to achieve the same result?

3 comments

Kranar 1829 days ago

The concept behind this is reference stability, and if you need a collection that has stable references, you must introduce a level of indirection, that is, instead of a vector<T>, you use a vector<unique_ptr<T>> and then you can take references as follows:

    auto& r = *some_vector[0];

link

owl57 1829 days ago

Or std::deque. It's conceptually similar, but additional allocations are hidden under the hood and batched.

link

MauranKilom 1829 days ago

std::dequeue is as good as useless. The defaults for "batch size" in different compilers are at extreme opposites of the tradeoff spectrum. So unless you really don't care about performance, memory or portability, it's not a datastructure you can rely on.

From memory:

In MSVC, dequeue will allocate memory for every single element if your elements are > 8 bytes. This will never be changed, due to ABI compatibility.

Clang and gcc have batching sizes of 1K and 4K (i.e. you throw out a whole page of memory even if your dequeue contains only 1 element).

link

owl57 1829 days ago

I had a vague feeling that std::deque is a "heavy" thing which you shouldn't have a million of, but iterating through a couple big ones is pretty fast. 1–4K batches wouldn't hurt my feelings. Looked up GCC, it's actually slightly less heavy at 512 bytes per node[1]. But the MSVC part — that caught me completely off guard.

https://github.com/gcc-mirror/gcc/blob/47749c43acb460ac8f410...

link

jcelerier 1829 days ago

In general it's better to use the same standard library everywhere - discrepancies like that occur for almost every type so if you care about having the same performance on every platform... Either use libc++ or boost

link

gpderetta 1829 days ago

IIRC boost.containers has a standard conforming deque with configurable batch size.

link

flyingswift 1829 days ago

Thanks! I am just learning C++ for a new gig, and coming from Javascript land, it is a lot to take in :)

link

layoutIfNeeded 1829 days ago

Yikes. I feel sorry for your client in advance. One does not learn C++ “for a gig”, especially when coming from JS…

link

Karsteski 1829 days ago

For all you know, this person is simply talking about a new job. Is it necessary to be so condescending..? Sigh

link

saagarjha 1829 days ago

I would generally suggest avoiding reference stability here (extra heap allocations) and going with the offset-based approach mentioned in the other responses.

link

Kranar 1829 days ago

I would generally suggest going for correctness over performance and the solution I provided is correct in the general case. Using an offset is only correct in the special case where objects will not be inserted or removed at an index less than the offset, otherwise you will end up with bugs as the offset becomes invalid upon such operations.

Furthermore, depending on the size of T, the performance penalty of the extra heap allocations is amortized over the cost of resizing the vector. That is vector reallocation is significantly faster for a unique_ptr<T> than it is for T when T is large and almost all memory allocators are tuned to allocate objects close together in space when they are allocated close together in time, so you don't lose the cache locality or need to worry about memory fragmentation.

link

yongjik 1829 days ago

In addition to other answers, sometimes you do know the final/max length of the vector when you construct it. In that case reserve() can reserve the necessary space, and as long as you stay under the limit all the addresses will remain valid.

(Though it's still pretty brittle, so you may want to add a ton of comments to warn yourself in the future...)

link

TylerGlaiel 1829 days ago

store the index 3 as an int instead of &vec[3]

link