Hacker News new | ask | show | jobs
by minimaxir 901 days ago
That is already accounted for with categorical cross-entropy loss.