Hacker News new | ask | show | jobs
by jarek83 1158 days ago
Can someone help me to understand why categories for these two differ?

row #51 "Think of some family rules to promote a healthy family relationship" - brainstorsming [1]

row #68 "What is the future for human?" - general_qa [2]

In nature they both are brainstorming to me - does the question mark is what assigned the #68 as _qa?

[1] https://lite.datasette.io/?json=https://github.com/databrick...

[2] https://lite.datasette.io/?json=https://github.com/databrick...

1 comments

The labelling doesn't seem to be entirely consistent to me, but I think the idea is that 51 is inviting you to brainstorm, while 68 is asking a question that just happens to be open ended.
Hey! Worked on this here at Databricks: the blog post goes into the dataset collection design a bit (https://www.databricks.com/blog/2023/04/12/dolly-first-open-...). In summary, you're right - brainstorming and GeneralQA will have overlap because the taxonomy naturally has some overlap