Hacker News new | ask | show | jobs
by MattSayar 356 days ago
Could you have a higher-order reasoning LLM generate a better confidence rating? That's how eval frameworks generally work today