|
|
|
|
|
by stevage
206 days ago
|
|
>These general data models start to become useful and interesting at around a trillion edges That is a wild claim. Perhaps for some very specific definition of "useful and interesting"? This dataset is already interesting (hard to say whether it's useful) at a much tinier scale. |
|
Almost every non-trivial graph data model about the world is a graph of human relationships in the population. If not directly then by proxy. Population scale human relationship graphs commonly pencil out at roughly 1T edges, a function of the population size. It is also typically the highest cardinality entity. Even the purpose isn’t a human relationship graph, they all tend to have one tacitly embedded with the scale implied.
If you restrict the set of human entities, you either end up with big holes in the graph or it is a graph that is not generally interesting (like one limited to company employees).
The OP was talking about generalizing this to a graph of people, places, events, and organizations, which always has this property.
It is similar to the phenomenon that a vast number of seemingly unrelated statistics are almost perfectly correlated with GDP.