Hacker News new | ask | show | jobs
by phs318u 69 days ago
As any DBA worth their salt knows.

Index the many valued column, not the column with few discrete values.

1 comments

Are you sure that's a good strategy if every unique index value requires you to buy a physical container?
You can put a few shapes into one container and it's still much faster than searching color-first.
How about arithmetic coding? That will give you the highest amount of entropy reduction for any possible number of containers. Which probably means that you’ll sort similar pieces far apart but group by colors that are easy to separate, like red+yellow, brown+green
Radix sort. Decide how many containers you're fine with, and group accordingly.