Hacker News new | ask | show | jobs
by mpixel 1009 days ago
Do people want that though? I don't want phd level responses for my queries. I want it to be better than what I could come up in a minute or by searching half an hour. Rather than some highly advanced highly detailed response I could probably not understand if the topic is not something I'm sufficient in to begin with.

Think common use cases. A lot of users are students, do I want it to write an essay like a linguist? Or solve my homework using the better but more advanced techniques and style?

2 comments

It's a lot easier to add a pass to simplify an explanation by rephrasing or eliding information than it is to smarten up an overly simplified answer.

You do want the underlying model to be capable of the advanced answers, since if it is, it can be used to supply simple answers. You can't make that work the other way around in the same way.

I think they want it. What do you think the most perfect or ideal question answer-er or teacher is? It would probably be an expert in the field, but also with the ability to deliver that content in a level appropriate way for the recipient/student. Unreasonable for the most part, we get away with good enough in the real world. This is a skill we all try to learn though. Like when you need to give a technical presentation to non technical audience

If you have input data that includes a high rated reddit eli5 question, the content of that answer might be hard to verify, the style and way its delivered would be ideal to keep around in the training data. on the other side, technical in-depth answers have content that is worth keeping around, the style of its delivery would be very specific.

Keeping the entire internet around in your training data would still give you access to all these types of delivery still. hope that makes sense.

Maybe the answer is multi-step: first use curated primary sources, e.g. scientific papers. Then reinforce using well written summaries, perhaps by actual models or well graded student papers. Finally, somehow apply negative weights using wrong answers only. Bonus points if you can automate the whole process