|
|
|
|
|
by avianes
1433 days ago
|
|
Yes, the issue raised definitely not prevents a high-performance implementation. But it's still interesting to ask ourselves if this is not an unnecessary cost? The concern I have here is that to deal with this question you need very good microarchitecture knowledge about vector-unit, and the author doesn't seem to have them, but he reaches a confident conclusion. How does one reach a conclusion with so much confidence on a technical subject that one does not know? |
|
Whilst the presentation of this as a ‘problem’ is debatable as may be some of the reasoning it doesn’t seem to me that it’s an unreasonable question to ask.