Hacker News new | ask | show | jobs
by wolfgangK 492 days ago
DeepSeek is not a model.Which model did you use (v3 ? R1 ? a distillation ?) at which quantization ?