|
|
|
|
|
by jumploops
8 days ago
|
|
I've been quite impressed with DeepSeek v4 Flash running via antirez's ds4[0]. It feels like a GPT-4 class model in terms of "stored knowledge" but is better at long-horizon tool calling than any of the GPT-4 class models. Running on a 128GB MBP M4 Max, I'm getting ~24 t/s on generation and ~200 t/s on prefill. I was expecting it to feel slow, and it certainly does when e.g. generating code, but it's surprisingly useful as a "machine orchestrator" for simple tasks. For non-agentic usecases, it's a decent enough model to converse with, and has the benefit of being entirely self-contained/private. [0]https://github.com/antirez/ds4 |
|