Hacker News new | ask | show | jobs
by changoplatanero 1537 days ago
Aren’t most of the models at the top not suitable for text generation? That’s what makes gpt different from Bert
1 comments

What are the models at the top used for? Excuse my ignorance.
Mostly mask fill, but Transformers can be fine tuned to downstream tasks relatively easily (T5 was built for translation but is used for autocomplete in many cases)
would you mind sharing some references (or even just googleable terms) for this process of fine tuning?