|
|
|
|
|
by omneity
976 days ago
|
|
I feel like that would make it harder for a vendor to keep up with the industry. Say you took all the effort in the world to build your custom LLM toolchain to train a Llama on custom hardware. And then suddenly someone comes up with LoRA. You didn't even finish porting it to your toolkit then someone comes up with GPTQ. Can't keep up with a custom toolchain imo. It's like a forked linux kernel. Eventually you're gonna have to upstream if you're serious about it, which is what AMD is actively doing with pytorch for ROCm (masquerading it as CUDA for compatibility). |
|
[0] https://github.com/ggerganov/llama.cpp