Hacker News new | ask | show | jobs
by greensh 525 days ago
Microsoft developed and trained Phi-4. How can there be bugs in their official implementation? Does this mean they trained und evaluated it on their own completly different code and then ported it to the huggingface library for compatibility?
1 comments

The chat template adding an assistant prompt by default for example is also shown in the technical report - so they did this during training. The issue is inference workloads should not have this, otherwise inference workloads might inadvertently append extra assistant prompts or forget about it - so hence I removed it.

The rest I'm not sure - for eg the EOS token should be im_end and not endoftext - it could be a small mistake

Thanks. I guess this means for Benchmarks they didn't use it. I find it fascinating and admire your dedication to fixing and improving those models.