|
|
|
|
|
by hooande
1842 days ago
|
|
These results are terrible. Almost all of the generated text posted here is non-sensical, and playing with it online is just confusing. There really needs to be a better method of evaluation, an MNIST for transformer text generation. like a list of pre-defined prompts that every GPT-X has to use to generate text, which can be scored against a list of "correct" answers in a variety of ways. I have no way of knowing if the output of this flavor of transformer is good or not, whatever that would mean. Very difficult to see how it compares to similar models. Setting up a model with this number of parameters and their reported training times is impressive. But I have no idea if this particular number of parameters makes a difference, or what that difference is supposed to look like |
|