|
|
|
|
|
by sliken
455 days ago
|
|
273GB/sec with good FP4 performance should be fine for developers playing with inference. This isn't the kind of thing that you'd use for inference workloads supporting millions of users. I'd like to see a inference benchmark vs the strix halo, which has better memory bandwidth and costs 2/3rds as much. |
|