Hacker News new | ask | show | jobs
by eeks 2937 days ago
I will certainly gets dozens of downvotes for obvious fanboyism, but I'm prepared to take the heat.

For the doubters and the disbelievers that have been wondering what is the relevance of IBM in this day an age: this. This is what IBM is all about.

And it's not just about PFLOPS; each node has 1/2 terabyte of memory, globally addressable across the entire cluster using RDMA over Mellannox 200Gb/s EDR.

It's also P9: 44 cores per node; but most importantly each node drives a couple of V100 through NVlinks, which allows the GPU to share the system's main memory.

2 comments

FYI - It is generally frowned upon to make a comment on HN along the lines of “I will get downvoted, but...”. It is completely unnecessary and you could just make your point without it.

Also, you can defend yourself later when people respond. Prematurely saying “I am ready to take the heat” is not how we communicate here.

NVlink allows gpus to share system's memory?
Summit node CPUs can access GPU memory coherently and unified memory allows for a single pointer across all processors. On most systems that involves page faults migrating pages, but summit has something called ATS that allows GPU to directly access all system memory.

https://vimeo.com/262870773/recommended

IBM integrated NVlink into their Power CPUs, well I'll be damned.
Think of it as a much higher bandwidth PCIE. You still have to think about data locality, the same way you do with cpu cache levels. It just makes the problem of not having data on the gpu less painful than before.