Hacker News new | ask | show | jobs
by afiodorov 595 days ago
Good spot, will deduplicate in the next iteration.

However titles are repeated often due to the region/language variations.

1 comments

Since you're denormalizing to a single table, I think the correct way to handle this would be to aggregate all the titles into the title column.

Although "Untitled Pixar Animation Project" is basically garbage data, but that's a harder problem to solve...