| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by cmovq 127 days ago
	That makes sense. LLVM could probably do better here by using the memory operand version: https://godbolt.org/z/jeqbaPsMz

2 comments

jxors 127 days ago

The memory operand version tends to be as slow or slower than the manual implementation, so LLVM is right to avoid it.

link

cmovq 124 days ago

Right, it has much worse throughput:

Memory: https://uica.uops.info/tmp/f022a3c0a70e4ae5ab3588ebe65fd2a5_...

link

ack_complete 127 days ago

Don't think the memory operand version would work here. If I understand the x86 architectural manual description, the 32-bit operand form interprets the bit offset as signed. A 64-bit operand could work around that but then run into issues with over-read due to fetching 64 bits of data.

link