Hacker News new | ask | show | jobs
by m101 462 days ago
I feel like this is true but would be great if you could provide examples so we could get a better idea of why you think/know this.
1 comments

I work for DeepMind on project Astra. Not to dwell too deep into confidentiality of what capabilities I have been looking at, but it has been the theme since the flamingo model that you only gain about 1 model-generation by fine-tuning versus prompt-tuning.