|
|
|
|
|
by Muskyinhere
557 days ago
|
|
No one is running LLMs on consumer NVidia GPUs or apple MacBooks. A dev, if they want to run local models, probably run something which just fits on a proper GPU. For everything else, everyone uses an API key from whatever because its fundamentaly faster. IF a affordable intel GPU would be relevant faster for inferencing, is not clear at all. A 4090 is at least double the speed of Apples GPU. |
|