|
|
|
|
|
by jandrewrogers
1513 days ago
|
|
People severely underestimate the velocity and volume of data implied by this if you actually did it, never mind having to run analytics processes at the same scale alongside it. We are talking about bespoke state-of-the-art data infrastructure platforms. You can't support anything like this with open source software, not that it stops people from trying. Glorified data brokers are usually not bastions of world-class software engineering, and if they were, they wouldn't be in such a low margin business. Most of these companies are just recycling the same low-quality and stale data sets. The question I always ask, when evaluating companies making these types of claims, is "what hard computer science problems did you solve to make this possible". If you've actually done what is claimed like in the above, it will be an interesting list. In practice, this question usually elicits confusion. There are legions of dubious companies making claims like this, which you can safely ignore. Their data quality is so poor that they would have difficulty violating most peoples' privacy even if they wanted to. The couple orgs with the technical expertise to actually pull it off competently don't talk about it. |
|
(Built and sold a Data Broker, it's not a low margin biz btw, we had >85% gross margins because of how cheap the source raw data is, and we were doing double-digit millions $$ in revenue).