|
|
|
|
|
by LoganDark
1110 days ago
|
|
> IMO GGML is great (And I totally use it) but it's still not as fast as running the models on GPU for now. I think it was originally designed to be easily embeddable—and most importantly, native code (i.e. not Python)—rather than competitive with GPUs. I think it's just starting to get into GPU support now, but carefully. |
|