Hacker News new | ask | show | jobs
Ask HN: Best way to distribute multi-GB dataset?
13 points by ggoodale 4570 days ago
I'm in the process of shutting down Word2[1], an MMO scrabble-like word game we built for Node Knockout back in 2010. The game's world is amazing - ~265MM tiles, ~123MM played words, over 1MM unique players. Once all personally identifiable information has been removed, I'd like to provide it to interested parties to play with. The clean dataset will be ~15GB compressed - what's the preferred way to share multi-GB files these days? Bittorrent, S3 (probably Requester Pays unless there's another option with manageable costs), or something else? Suggestions welcome.

[1] http://massivelyfun.com/saying-goodbye/

4 comments

Rad! Haven't run across Archive Team before. Looking into them now.
Bittorrent Sync
Sourceforge
Interesting - hadn't considered that possibility. Thanks! I'll look into it.
Bittorrent
Seems reasonable, though given the size of the file I'm probably going to have to host it somewhere for seeding purposes rather than just parking it on my home fileserver (yay Comcast monopoly in my area).
Looks like that only works for 5GB files or less, which is a shame.