|
|
|
|
|
by zozbot234
14 days ago
|
|
Have you tried it? It would be slow for sure, but the main limitation AIUI would actually be storing the context in RAM - models like Kimi and GLM have high demands there which limit your ability to get meaningful aggregate throughput via large batches. |
|
How is that supposed to give good results?