Hacker News new | ask | show | jobs
by termain 4468 days ago
"You have the inverse story at the low end of the family: the lowest Mill has only five pipes, and no floating point at all. Not barn-burning performance, but much lower power even than existing non-OOO offerings."

Does the Mill even need an FP unit? Or rather, couldn't a VLIW architecture be able to emulate floating point in such a way that it's nearly as fast and/or more flexible as far as precision and/or might be more optimizable for certain values?

Minimally, if you break down the FP opp into it's constituent integer operations, you put all of those in flight at the same time or schedule them to hide latencies of other operations, I would think.