The comparison is not great for objective comparison—the metrics of Interface have been changed so that 11px =/= 11px from one to the other… this was the best approximation that I could get in-browser. Usually you'd want to set the two typefaces up so that the x-height is the same, and in this case Roboto is probably slightly larger still, and hence a bit darker in overall color.
Except that isn't how font metrics work—the size of the type on the body is not necessarily equivalent between two different fonts. There is no stanard that says that 12pt Times shares any vertical metrics with 12pt Helvetica, or 12pt anything else.
The most informed comparision can be made when the x-height is optically equivalent. The other option, cap height, isn't as useful because it is then harder to judge the lowercase (a majority of the letterforms).
I think what jack's saying is that "font-size" to some extent is arbitrary. It's not scientific or worthwhile to compare fonts across a specific size if they have different x-heights. The only meaningful comparison is to evaluate legibility for fonts that are "relatively" the same size. For example, if Interface 12pt is equivalent in x-height to Roboto at 12.5pt, and Roboto is more legible, and legibility is the goal, then obviously the user should pick Roboto at 12.5pt (all things considered).
Also subjective, but at that size on my machine (13" MBA, non-Retina, El Capitan, Chrome), I find Roboto is a lot easier to read. Interface looks 'greyer' and the contrast isn't so good.
While not the intended use case, I find Interface to be surprisingly readable at extremely tiny font sizes (7, 6, and even 5!).
Sure, you shouldn't be designing an interface with those sizes, but it may be something to consider for some zooming interfaces (primarily maps with lots of tags: street names, POIs, etc.)