Hacker News new | ask | show | jobs
by pauldix 1159 days ago
Shame that this is flagged, I think this is a really exciting development and was hoping to see the discussion around it. Open sourcing the fine tuning training set is a great building block. Will be exciting to see if others continue to build on this. More open source datasets, models, and evaluation frameworks will accelerate the development and adoption of LLMs. It adds more hackers to the mix building the core, rather than just the stuff at the edges (i.e. apps).
2 comments

The post was rightfully flagged while trying to make it's way to the frontpage of HN with blatant astroturfing.

I was hoping to see good discussion around too. And it would have happened had Data bricks employees or PR people didn't create a hundred accounts to comment on this and the previous DOLLY post.

Yeah, if it's getting astroturfed and pushed up artificially that's definitely cause to flag it. It's just a shame that happened because this almost certainly would have landed on the front page on its own.
There are already a number of instruction-tuning datasets.

This announcement does not provide any benchmarks, so it is impossible to tell how useful the model is.