Y
Hacker News
new
|
ask
|
show
|
jobs
by
santiagobasulto
1118 days ago
But you'd need to keep both models in parallel, right? Using M1 to keep computing embeddings and using M2 for completions.