Hacker News new | ask | show | jobs
by danielhanchen 525 days ago
The chat template adding an assistant prompt by default for example is also shown in the technical report - so they did this during training. The issue is inference workloads should not have this, otherwise inference workloads might inadvertently append extra assistant prompts or forget about it - so hence I removed it.

The rest I'm not sure - for eg the EOS token should be im_end and not endoftext - it could be a small mistake

1 comments

Thanks. I guess this means for Benchmarks they didn't use it. I find it fascinating and admire your dedication to fixing and improving those models.