| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by sanjiwatsuki 908 days ago

If you're looking for the best small model, I'd recommend using Berkeley's Starling-7B model [0].

It'll run on a lot of commodity GPUs and performs well in head-to-head comparisons against bigger models, edging out the most up-to-date GPT-3.5-Turbo [1].

[0] https://starling.cs.berkeley.edu/ [1] https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboar...