Y
Hacker News
new
|
ask
|
show
|
jobs
by
bird0861
103 days ago
Just want to add to this that with custom calibration data it's incredibly effective and surgical, you can get VERY LOW KL divergence this way. Many MoEs are supported too, it's actively maintained.