|
|
|
|
|
by shagie
1174 days ago
|
|
There's a "thumbs up" and "thumbs down" next to each generated response. While this model may not be getting updated in real time, I would be surprised if that feedback isn't used when looking at updating the model with good feedback being used for retraining the model. |
|
Based on what I have been reading, training the core model is a work of statistical analysis on how often words follow each other. This produces a graph tree of words.
Now how do you re-evaluate the entire graph based on a single additional feedback without actually retraining the entire model with the said feedback (because it is costly, and result wouldn't be immediate, unless you're thinking of injecting the input as an initial condition to the readily trained model)