| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by riku_iki 979 days ago
	In this case I used 'usually' because don't remember all details and didn't want to generalize by saying 'always', but also training/benchmarking protocol can be flawed, for example LLM still can solve shallow reasoning problem by memorizing pattern.