How does preflight deal with selectors used to identify elements changing? This is the critical piece to solve before systems like this are useful. As identified in the paper discussed here:
https://blog.acolyer.org/2016/05/30/why-do-recordreplay-test...