Hacker News new | ask | show | jobs
by geoah 1514 days ago
Absolute stab in the dark:

You’ll need to store a good chunk of the web in order to allow for retraining/reindexing when algorithms get added or updated. That’s expensive as disk space is not cheap, and bandwidth is even less cheap.

You then beed to be constantly processing all that content through multiple algorithms, and storing their resulting indexes in relatively fast storage so it can be retrieved. That’s a lot of processing and even more storage.

Even if this all works, your algorithms need to be performant in order to be usable. That means time and expertise.

Finally you need to figure out who actually cares enough to pay for this thing. Who pays for my crappy algorithm that is just wasting cpu and disk that no one is using?