|
|
|
|
|
by YetAnotherNick
1023 days ago
|
|
This is the least detailed foundational model release I have seen. Llama paper offers lot more details like ablations, loss curves etc. Falcon has data preparation details etc. Google's model release papers like T5 are some of the best and includes many ablations. |
|