Y
Hacker News
new
|
ask
|
show
|
jobs
by
changoplatanero
1537 days ago
Aren’t most of the models at the top not suitable for text generation? That’s what makes gpt different from Bert
1 comments
colordrops
1537 days ago
What are the models at the top used for? Excuse my ignorance.
link
priansh
1537 days ago
Mostly mask fill, but Transformers can be fine tuned to downstream tasks relatively easily (T5 was built for translation but is used for autocomplete in many cases)
link
gfodor
1537 days ago
would you mind sharing some references (or even just googleable terms) for this process of fine tuning?
link