|
|
|
|
|
by ttt3ts
1006 days ago
|
|
You can run 70B LLAMA on dual 4090s/3090s with quantization. Going with dual 3090s you can get a system that can run LLAMA 2 70B with 12K context for < $2K. I built two such a systems after burning that much in a week on ChatGPT. |
|
What are you doing!?