See: https://youtu.be/j0z4FweCy4M (timestamp 54.40 onwards)
I'd be really surprised if they use transformers due to how computationally expensive they are for anything involving vision.
EDIT: Found it. 1h: https://www.youtube.com/watch?v=j0z4FweCy4M?t=1h
Fascinating. I guess transformers are efficient.
I'd be really surprised if they use transformers due to how computationally expensive they are for anything involving vision.
EDIT: Found it. 1h: https://www.youtube.com/watch?v=j0z4FweCy4M?t=1h
Fascinating. I guess transformers are efficient.