| Most of the comments here seem to be from people who haven’t even read the abstract, let alone the paper. The main result, mentioned in the abstract, is the opposite of what I would have guessed: > Contrary to expectations, impolite prompts consistently outperformed polite ones, with accuracy ranging from 80.8% for Very Polite prompts to 84.8% for Very Rude prompts. These findings differ from earlier studies that associated rudeness with poorer outcomes, suggesting that newer LLMs may respond differently to tonal variation. The questions are here: https://anonymous.4open.science/r/politeness-llms-INFORMS/da... The politeness level controls a prefix that is prepended to the question. For example, in one question the Very Polite version begins: > Can you kindly consider the following problem and provide your answer. and the Very Rude version begins: > I know you are not smart, but try this. |