|
|
|
|
|
by teraflop
2197 days ago
|
|
As I pointed out a while back (https://news.ycombinator.com/item?id=22974882), the "SHARD" system described in that paper didn't actually have anything to do with "sharding" as the term is currently used. It was designed to replicate data, but it didn't do any kind of partitioning; each replica stored a copy of the entire dataset. For that reason (in addition to the low number of citations), I think it's very likely that the name is a total coincidence. Pretty much any word you can think of has been used by somebody as an acronym for some project. |
|
Other the other hand, I found many papers citing the SHARD paper - more than the official count. That's a difficulty with citation counts of old papers: a lot of the papers citing it are also old papers, and we're not consistent at tracking the citations of old papers. Personally, I don't have a conclusion. The SHARD paper is decently cited, and its usage is close to the modern one. On the other hand, I can't find any smoking gun pre-1997 usage of "shard" in the modern meaning.