|
|
|
|
|
by tbv
3188 days ago
|
|
I'm one of the creators of the Beaker browser[1] and the reason we use Dat is that as a p2p protocol, it offers a lot of neat properties, including making datasets more resilient. As long as one peer on the network is hosting a dataset, it will be reachable, even if the original author has stopped hosting it. I won't speak authoritatively on behalf of the Dat team, but I believe one of their goals is to make it difficult for public scientific datasets to be lost, and data living on a centralized server is particularly vulnerable to that. 1. https://github.com/beakerbrowser/beaker |
|
I spent a while trying to download recent updates to the Reddit comment corpus [1], which is hosted on BitTorrent. The downloads never seem to finish.
It seems to me that decentralization means that, when a dataset stops being new and exciting, it will disappear. How will Dat counter this?
[1] https://www.reddit.com/r/datasets/comments/65o7py/updated_re...