|
|
|
|
|
by PeterStuer
316 days ago
|
|
Given that for a non quantized 700B monolithic model with let's say a 1M token context, you would need around 20TB of memory, I doubt your spark or M4 will get very far. I'm not saying those machines can't be usefull or fun, but it's not in the range of the 'fantasy' thing you're responding to. |
|