Do you think we will recognize any walls? Or is there a point where the output might look different with respect to different paradigms / modalities we throw at it, but we won't be able to understand the quantitative differences as good/bad/scalable?