For an production GCP API, I think faster than real-time would be necessary.
For example, WaveNet took a year to go from research to production in Google Assistant: https://deepmind.com/blog/wavenet-launches-google-assistant/