Hacker News new | ask | show | jobs
by covi 1164 days ago
Kudos to Databricks! Anyone has insights into benchmark & real-world quality?

From https://huggingface.co/databricks/dolly-v2-12b#benchmark-met..., it seems like dolly-v2-12b's benchmark results are actually slightly worse than dolly-v1-6b.

A commercially viable instruction-tuned LLM is still a huge deal.

1 comments

Right, but this is still impressive given how quickly Databricks created and open-sourced the dataset. I would expect more improvements in the future.