| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by dataviz1000 68 days ago
	What do you mean by harness here?

3 comments

Ifkaluva 68 days ago

When you go to the command line and type “Claude”, there is an LLM, and everything else is the harness

link

dataviz1000 68 days ago

I'm having an hard time getting my mind to see this.

> Users should re-tune their prompts and harnesses accordingly.

I read this in the press release and my mind thought it meant test harness. Then there was a blog post about long running harnesses with a section about testing which lead me to a little more confusion.

Yes, the word 'harness' is consistently used in the context as a wrapper around the LLM model not as 'test harness'.

link

dboreham 68 days ago

This field is chock full of people using terms incorrectly, defining new words for things that already had well known names, overloading terms already in use. E.g. shard vs partition. TUI which already meant "telephony user interface ". "Client" to mean "server" in blockchain.

link

suttontom 66 days ago

Some people also call evaluations "tests". There are unexpected things that come along with new models, like the model in a workflow you'd set up suddenly starts calling a tool and never stops or decides to no longer call a particular tool, so running your existing evaluations to catch regressions like this and potentially updating the prompts is considered "testing" your prompts and harnesses.

link

kreig 67 days ago

I understood this concept with this simple equation: Agent = LLM + harness

link

ElFitz 67 days ago

It’s the tool that calls the model, give it access to the local file system, calls the actual tools and commands for the model, etc, and provide the initial system prompt.

Basically a clever wrapper around the Anthropic / OpenAI / whatever provider api or local inference calls.

link

codybontecou 68 days ago

pi vs. claude code vs. codex These are all agent harnesses which run a model (in pi's case, any model) with a system prompt and their own default set of tools.

link