| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by simonw 598 days ago
	I am not particularly interested in those benchmarks that deliberately expose weaknesses in models: I know that models have weaknesses already! What I care about is the things that they're proven to be good at - can I do those kinds of things (RAG, summarization, code generation, language translation) directly on my laptop?