|
|
|
|
|
by ruby314
674 days ago
|
|
"An advantage of being aware of early papers is that it accumulates citations so you can often find good works in reverse citations." - absolutely. Thanks again for sharing interesting references. Cool that GRATH uses contrast pairs in an iterative process with DPO and TruthX is steering (using the term broadly) with a creating architecture to determine the inference time edits. One thing about Lynx and HaluBench - as we understand it, Halubench is the test set for Lynx's training data. They do have a couple of held out data sources besides the four they train with, but as far as we could tell from their paper they use the same hallucination-inducing function. Be curious to hear your thoughts on that. |
|