| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ashirviskas 940 days ago
	>Flops really are quite cheap by now, e.g. vision inference chip ~$2/teraflop/s !! I'm really interested, can you share where you got these numbers?

1 comments

algo_trader 940 days ago

Axelera [1] or Halio [2] give you 100-200tflop for ~$200.

8-bit ops, inference only, low memory embedded, excluding the host, implied utilization from FPS specs is ~20%

But the trend is there.

There are also newer ADAS/AV units from China which claim 1000tflops and cant really cost more than $1000/$2000 per car.

These are all tiled designed (see also dojo/tesla) heavily over-weighed on flops vs memory

[1] https://www.axelera.ai/

[2] https://hailo.ai/

link

Y_Y 940 days ago

You can't get flops on a Hailo-8, they're fixed-point only. As much as these specialised inference chips are cool, we're a long way from just being able to drop them in where a GPU was. Not to mention the memory is hugely constrained. The Hailo chips I've worked with were all limited to 20MiB for the weights which is a squeeze even at 4-bit.

link