|
|
|
|
|
by MyFirstSass
929 days ago
|
|
True i've been using the OpenOrca finetune and just downloaded the new UNA Cybertron model both tuned on the Mistral base. They are not far from GPT-3 logic wise i'd say if you consider the breadth of data, ie. very little in 7GB's; so missing other languages, niche topics and prose styles etc. I honestly wouldn't be surprised if 13B would be indistinguishable from GPT-3.5 on some levels. And if that is the case - then coupled with the latest developments in decoding - like Ultrafastbert, Speculative, Jacobi, Lookahead etc. i honestly wouldn't be surprised to see local LLM's on current GPT-4 level within a few years. |
|