| HN Mirror

Those are the essential steps, yes.

(Spoiler: I worked on this feature @ Slite) But in practice, the effectiveness of your pipeline will depend greatly upon exactly how you implement each piece.

Here are some of the things we've had to consider and develop, in order for the Q&A to be production worthy:

- at which level is it best to encode documents (sentence, paragraph, whole doc) and how can those tiered embeddings be weighed effectively?

- how to use the natural document hierarchy in the workspace?

- where to add NLI, so we can actually compare in-model "does this statement imply that statement" rather than just comparing vectors

- how to prioritize passages using additional fine-grained ranking against the question (cross-encoding, etc)?

- how to compute a "confidence" score so that we actually take the generative (GPT) part out of the equation and bail out early, if the documents do no contain anything which strongly implies an answer to your specific query

These are just a few of the pieces. But what we learned quickly is that solving the problem of building a great Q&A means first solving many problems that are deeply intertwined with search algorithms, natural language understanding, and other text problems in general.