Hacker News new | ask | show | jobs
by red75prime 657 days ago
> Given that it's a big next-word-predictor

That was instruction-tuned, RLHFed, system-prompt-priority-tuned, maybe synthetic-data-tuned, and who knows what else.

Maybe they just used illeisms in system prompt prioritization tuning.