Hacker News new | ask | show | jobs
by jjoonathan 2310 days ago
For those of us out of the loop, could you summarize the idea of contrastive learning as a whole?
2 comments

A fully illustrated article [1]

And Lilian Weng blog on self-supervision [2]

.. CPC is .. translating a generative modeling problem to a classification problem... uses cross-entropy loss to measure how well the model can classify the “future” representation amongst a set of unrelated “negative” samples...

[1] https://ankeshanand.com/blog/2020/01/26/contrative-self-supe...

[2] https://lilianweng.github.io/lil-log/2019/11/10/self-supervi...

Thanks!
(one variant of) The task is: Given a crop of an image (or a short audio snippet etc.), can you find the matching crop that also comes from the same image from a set containing a lot of negative samples (crops of others, unrelated images)?

To succeed, the encoder needs to be able to extract the underlying, useful information (called slow features) contained in the patch and discard the noise as this will make the retrieval process much easier.

This yields an encoder that gives pretty good representations of your inputs and you can then finetune some additional layers on top of it for your final task.