What's the most amazing is that Elon mentioned on twitter that it's only using data gathered in the past month since they announced the hardware/first video.
That's a little bit of a misdirection. They certainly learned a lot about how to build their models and what data is important, and this results in more effective training.