Hacker News new | ask | show | jobs
by colordrops 1528 days ago
What are the models at the top used for? Excuse my ignorance.
1 comments

Mostly mask fill, but Transformers can be fine tuned to downstream tasks relatively easily (T5 was built for translation but is used for autocomplete in many cases)
would you mind sharing some references (or even just googleable terms) for this process of fine tuning?