Hacker News new | ask | show | jobs
by FisDugthop 2352 days ago
What I find interesting is the implicit assumption that the underlying function being learned is differentiable or continuous to begin with. That's not always the case; for example, we often work with "categorically labeled" discrete binning problems.
2 comments

In those cases, people tend to use something like a categorical cross-entropy loss where you assign a continuous likelihood score to each discrete possibility, thereby making things differentiable again.

https://www.tensorflow.org/api_docs/python/tf/nn/sparse_soft...

Data normalization is the 101 of any respectable machine learning course.
Yeah but it’s still enough to stall and hinder newbies (me) that deal primarily with categorical data until you can start to intuitively map the continuous back into the discrete.

Nature of the beast it seems but still kind of a pain.