|
|
|
|
|
by Der_Einzige
2184 days ago
|
|
Yet another paper with results that basically look like this:
https://d3b8hk1o42ev08.cloudfront.net/wp-content/uploads/201... Still impressive, don't get me wrong, but I am starting to believe that NLP will be dominated increasingly by the big players since they are the only ones who can train a 1 TRILLION parameter model (they show that in the paper). I can't even do inference with a 36 layer, 2048 neuron per layer network with my GTX 2080ti. Sad.... |
|
Not even for a single instance? Your GPU has 11GB of RAM. Why isn't 14k per neuron enough? Is the input really large, or does each neuron have very high precision?