Hacker News new | ask | show | jobs
by OJFord 1076 days ago
How do they work then that semantic similarity would be any different? That's just a matter of grouping semantically similar 'representations' in training, surely?
1 comments

Yes, what I'm saying is that gzip does not perform as well when it's not overlapping tokens exact.

Gzip does not support a "semantic" mode, hence it won't and does not (according to the papers metric) perform as well.

Deep learning can capture these semantic similarities.