|
|
|
|
|
by euanai
113 days ago
|
|
Hi HN! I'm the author. Novum is a Claude Code extension that runs an autonomous ML research loop with mechanical guardrails designed to reduce result fabrication. The key idea is that instead of relying on prompts like "don't hallucinate", the system enforces constraints mechanically (e.g., preventing edits to protected result files and enforcing phase gates in the research pipeline). In a recent test run, a single /research command ran autonomously for about 30 hours:
10 hypotheses tested, 4 iteration cycles, and one champion solution selected. Happy to answer questions or hear feedback on the guard design and research workflow. |
|
- How strict are the phase gates? Like, is it a hard checklist or can the system be more lenient depending on the task? - When picking the champion solution out of 10 hypotheses, what's actually being measured?