| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by noman-land 1144 days ago
	Apparently Vicuna 13B is quite good according to Google's own leaked docs. https://twitter.com/jelleprins/status/1654197282311491592

3 comments

space_fountain 1144 days ago

That's according to this (https://lmsys.org/blog/2023-03-30-vicuna/) promotional blog post and just cited by the google memo right? Which isn't really even a doc, just a memo that was circulating inside google.

I also find it strange they don't contrast gpt4 and gpt3.5

link

int_19h 1144 days ago

This assessment is based largely on GPT-4 evaluation of the output. In actual use, Vicuna-13B isn't even as good as GPT-3.5, although I do have high hopes for 30B if and when they decide to make that available (or someone else trains it, since the dataset is out).

And don't forget that all the LLaMA-based models only have 2K context size. It's good enough for random chat, but you quickly bump into it for any sort of complicated task solving or writing code. Increasing this to 4K - like GPT-3.5 has - would require significantly more RAM for the same model size.

link

amelius 1144 days ago

Is there a way to always stay up to date with the latest and best performing models? Perhaps it's me but I find it difficult to navigate HuggingFace and find models sorted by benchmark.

link

noman-land 1144 days ago

Honestly, I just read hackernews :).

link

amelius 1144 days ago

HN posts are not always in chronological order.

link

noman-land 1144 days ago

I didn't say it was the best way, just the way I'm doing it right now :).

link

nickthegreek 1144 days ago

I check r/LocalLlama

link