Hacker News new | ask | show | jobs
by the_decider 806 days ago
This work is based on a fine-tuned Google Palm Model from 2022. I'm not sure if this is a fair comparison to the latest groundbreaking series of LLMs
2 comments

In view of the fact that we have been experiencing a breakthrough in the public perception of LLMs for 1.5 years and that the resources for their further development have increased explosively, it is indeed questionable to publish this now in March '24. A review based on the latest, most powerful LLMs would be urgently needed.
FLAN-T5-XL, a 3B model, to be precise.