| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by czl 145 days ago
	FYI: Newer LLM hosting APIs offer control over amount of "thinking" (as well as length of reply) -- some by token count others by an enum (high low, medium, etc.).