Hacker News new | ask | show | jobs
by qosmo 319 days ago
What kind of tweak has enough of an impact is still an open question. According to the paper it does generalize a bit between different models, but at least different architectures require retraining for coverage.