love everything here but I'm skeptical of real time control via wifi. for me there's always been a noticeable delay in video streaming and receiving control signal so I'm curious how this works?
if you've ever streamed video on a local network over web sockets you'll notice latency. no matter how fast sending text based control messages may be, latency in the video stream will cause users to send controls with the same latency which makes control impossible as far as I know.
if that's true then that might be helpful as your visual feedback comes directly from your eyes and not latent video plus latency for control signals is probably less. thanks for pointing this out