|
|
|
|
|
by edude03
808 days ago
|
|
Maybe the only downside to how fast LLMs are moving is papers come out faster than anyone (not at Google) can train and test the improvements. I got into deep learning around when ReLU and dropout was hot and on my consumer 1080 I was able to change one or two lines of code and test the improvements in a few hours, whereas now, I guess I'll need to wait a few weeks for mistral et al to try it out |
|
I'm focusing in quantization approaches and testing on my obsolete last gen GPUs.