Hacker News new | ask | show | jobs
by orbital-decay 98 days ago
If it was, that line is not an indicator. Distillation is done on useful prompts, not on "Who are you?" - "I'm this model of that company".

Name training is always shallow, Claude itself would claim it's GPT-3, GPT-4, or Reddit (heh) when confused. It's just dataset contamination, because the web is full of slop. Never trust self-reported names.