Hacker News new | ask | show | jobs
by thoughtpeddler 460 days ago
Wholly agreed with this. There are key phrases that can act as, as you said, 'conversational signals' for where to steer things. The amount of intense post-training that these models undergo to make them "helpful assistants" means these words/phrases have high 'steering ratio'. Double-win that you sharpen your own politeness habit when dealing with humans too.