|
|
|
|
|
by ghc
312 days ago
|
|
Here's a sample of running the 120b model on Ollama with my MBP: ``` total duration: 1m14.16469975s load duration: 56.678959ms prompt eval count: 3921 token(s) prompt eval duration: 10.791402416s prompt eval rate: 363.34 tokens/s eval count: 2479 token(s) eval duration: 1m3.284597459s eval rate: 39.17 tokens/s ``` |
|