Hacker News new | ask | show | jobs
by adrian1973 3189 days ago
> even if content is of no interest.

Can't they just keep (at most) the metadata?

1 comments

As a data scientist, I think losing actual words would be a loss. Words would be only used by word embeddings like word2vec, but actual words let you switch to better word embedding later.