| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by strangescript 424 days ago
	curious why you went with Phi as the default models, that seems a bit unusual compared to current trends

2 comments

codingmoh 424 days ago

I went with Phi as the default model because, after some testing, I was honestly surprised by how high the quality was relative to its size and speed. The responses felt better in some reasoning tasks-but were running on way less hardware.

What really convinced me, though, was the focus on the kinds of tasks I actually care about: multi-step reasoning, math, structured data extraction, and code understanding.There’s a great Microsoft paper on this: "Textbooks Are All You Need" and solid follow-ups with Phi‑2 and Phi‑3.

link

jasonjmcghee 424 days ago

agreed - thought the qwen2.5-coder was kind of standard non-reasoning small line of coding models right now

link

codingmoh 424 days ago

I saw pretty good reasoning quality with phi-4-mini. But alright - I’ll still run some tests with qwen2.5-coder and plan to add support for it next. Would be great to compare them side by side in practical shell tasks. Thanks so much for the pointer!

link