Hacker News new | ask | show | jobs
by thesz 1339 days ago
These compute units are usually sliced - they can perform either four FP32 multiples or one FP64 multiply on the same die part. This trick was done as long ago as PA-RISC was developed, from what I remember it was HP who introduced sliced ALU, capable of doing one large or several smaller operations on the same hardware.

I can be wrong about who did that first, but most FPUs now are done like that.

1 comments

On GPUs, they're not sliced like this anymore since quite a long time to save die area.
The slicing was introduced to save die area. Not to slice is to have slightly smaller computation delay traded for greater die area.