Y
Hacker News
new
|
ask
|
show
|
jobs
by
masterofsome
924 days ago
You have to look at the pre-trained models and not the "tuned" models. Its number one in that category (which is what I think they are referring to, given the benchmarks are against Mistral and llama)
2 comments
ilc
924 days ago
Doesn't the article say this is a LoRA off ORCA? Isn't that a finetune?
link
GaggiX
924 days ago
DeciLM-7B-instruct is the finetuned model, not DeciLM-7B.
link
audessuscest
924 days ago
thanks for the clarification
link