Hacker News new | ask | show | jobs
by terafo 785 days ago
t5 is an architecture, t5x is a framework for training models that was created with that architecture in mind, but can be used to train other architectures, including decoder-only ones(there is one in examples).
1 comments

t5x was used to train PaLM 1.