| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by abcdef123xyz123 2655 days ago
	We have been moving away from IB for our platform (algorithmic trading) since Ethernet now has almost comparable latency and is a lot easier to understand and manage.

3 comments

angry_octet 2655 days ago

Latency really isn't comparable, but now probably isn't an issue for algorithmic trading. Still many other closely coupled codes for which it dominates cluster performance.

IB is actually easier to reason about and debug than DCE, but obviously a different community of practice.

link

shaklee3 2655 days ago

Understand, manage, and buy networking equipment for. Infiniband is a thing of the past, especially with the Mellanox VPI adapters that support both Ethernet and Infiniband with a single bit flipped on the adapter.

link

jabl 2655 days ago

IDK, IME IB is pretty much plug and play in an HPC setting. Plain ethernet is too, sure, but if you want to do HPC type workloads you'll have to do a lot of configuration and testing to setup DCB, RoCEv2, EVPN+VXLAN+BGP or such.

But I think this is the way the market is going in the longer term.

link

shaklee3 2655 days ago

It is, but if you're a large enough company to be buying millions of dollars in adapters and switches, reading a guide from Mellanox to turn on DCB should be fairly seamless. RoCEv2 is API-compatible with IB for the most part, so there is really no configuration on that layer. The other pieces -- not really sure what you're getting at. Most of those are for going across data centers, which IB won't do anyways. At least Ethernet would give you the option to run RDMA from the east coast to the west coast.

link

jabl 2655 days ago

> The other pieces -- not really sure what you're getting at.

What I'm getting at is setting up clusters larger than what you can fit behind a single switch. So you'll want e.g. a CLOS fabric with multipathing (the typical IB setup, FWIW). As Trill and SPB seem pretty dead, it seems the momentum is to do the multipathing at the L3 level, using the aforementioned EPVN+VXLAN+BGP, or something similar.

link

wmf 2655 days ago

You really don't need EVPN+VXLAN though. (And if you do need it I recommend finding a way to not need it.)

link

jabl 2655 days ago

You mean you have separate subnets for each leaf switch, and then BGP or such for multipath routing between the leaf and spines? Sure, but what about subnet-level services like DHCP & PXE? Sounds cumbersome if you have to replicate that across all your leaf switches?

Or maybe you could do one "provisioning and admin" VLAN that spans the entire cluster and which uses spanning tree, and then the high-performance RDMA stuff uses the per-leaf VLAN's and L3 multipath routing? Is that simpler and better performing that EVPN + VXLAN?

What is the routing latency on such BGP setups BTW? I find it hard to image you can get even close to eth (not to mention IB!) L2 latencies? Or can the fast paths be done in hw (or FPGA's)?

link

godelmachine 2655 days ago

May I ask what’s IB?

link

ckozlowski 2655 days ago

InfiniBand

https://en.wikipedia.org/wiki/InfiniBand

link

sacheendra 2655 days ago

infiniband - The high performance interconnect by Mellanox

link

patrickg_zill 2655 days ago

Infiniband is actually a standard and Mellanox became the go to supplier. However in the early stages there were many more suppliers.

link

baruch 2655 days ago

Intel bought one of the IB players and for all I know killed it.

link

jabl 2655 days ago

Intel bought Qlogic, made some proprietary enhancements to their IB tech, which they now sell under the omni-path brand.

link

SEJeff 2655 days ago

By Voltaire, the infiniband company acquired by mellanox :)

Their ethernet accelerator VMA stands for "Voltaire Messaging Accelerator".

link

wmf 2655 days ago

IIRC Mellanox was always doing Infiniband; they made the hardware and Voltaire wrote the drivers (before people understood open source and created OFED).

link

SEJeff 2655 days ago

Mellanox made the HCAs (aka network cards) and Voltaire made the switches. Mellanox had their own drivers and their own MOFED (their incompatible fork of openfabrics upstream OFED with Mellanox specific enhancements).

link

ams6110 2655 days ago

We just pulled a Voltaire switch from our data center. They definitely made hardware. Or at least put their name on it.

link