|
|
|
|
|
by gillesjacobs
716 days ago
|
|
This is entirely unsurprising and in-line with the finding that even small specialized models do better in information extraction and text classification. So no wonder finetuned large LMs do good too. Personally, my PhD did fine grained ACE-like event and sentiment extraction and "small" specialized finetuned transformers outperformed prompting LLMs like BERT and Roberta-large. Would love to see an inclusion of small model scores with some sota pipelines. This is great work anyway even if it replicates known results! |
|
https://www.threads.net/@ethan_mollick/post/C46AfItO8RS?hl=e...