Y
Hacker News
new
|
ask
|
show
|
jobs
by
andyferris
139 days ago
The notes explicitly call out you may want to dial the effort setting back to medium to reduce latency/tokens (high being default, apparently there is a max setting too).
1 comments
gverrilla
138 days ago
There's 3 options to choose from on /model: Low, medium and high effort.
link