|
|
|
|
|
by nl
4047 days ago
|
|
There's a slide deck[1] that explains it rather well. Basically, Tachyon acts as a distributed, reliable, in memory file system. To generalise enormously, programs have problems sharing data in RAM. Tachyon lets you share data between (say) your Spark jobs and your Hadoop Map/Reduce jobs at RAM speed, even across machines (it understands data-locality, so will attempt to keep data close to where it is being used). [1] http://www.cs.berkeley.edu/~haoyuan/talks/Tachyon_2014-10-16... |
|