|
|
|
|
|
by smarri
736 days ago
|
|
Hey team, nice work. Can you help me understand this better. How does the process work in terms of the human agent evaluations? Is it real time so that the right (maybe a better word is best) answers go to users as they are needed, or is it done asynchronously/batch style so that the humans are training models to be better? Once the best answers are selected, is it fed back into an LLM / AI agent model? Thanks |
|