Hacker News new | ask | show | jobs
by exhaze 770 days ago
No, the correct way is to have it reason from first principles:

1. "Think about what are the underlying principles for evaluating the truthiness of statements like 'X' - list them out, explain why you chose each one, what tradeoffs you made, why you believe it's the right tradeoff in this case"

2. Start a new conversation and make the system prompt be that set of principles

3. In the user prompt, ask it to decompose X into a weighted formula for those principles and give a sub-score for each principle.

4. Finally, based on the weighted sum, ask it to determine if X is true or not true, and ask it to provide a confidence score between 0 and 1 for its response