|
|
|
|
|
by apavlo
265 days ago
|
|
The backstory is complicated. The plan was to establish a consortium between CMU, Tsinghua, Meta, CWI, VoltronData, Nvidia, and SpiralDB to unify behind a single file format. But that fell through after CMU's lawyers freaked out over Meta's NDA stuff to get access to a preview of Velox Nimble. IANAL, but Meta's NDA seemed reasonable to me. So the plan fell through after about a year, and then everyone released their own format: → Meta's Nimble: https://github.com/facebookincubator/nimble → CWI's FastLanes: https://github.com/cwida/FastLanes → SpiralDB's Vortex: https://vortex.dev → CMU + Tsinghua F3: https://github.com/future-file-format/f3 On the research side, we (CMU + Tsinghua) weren't interested in developing new encoders and instead wanted to focus on the WASM embedding part. The original idea came as a suggestion from Hannes@DuckDB to Wes McKinney (a co-author with us). We just used Vortex's implementations since they were in Rust and with some tweaks we could get most of them to compile to WASM. Vortex is orthogonal to the F3 project and has the engineering energy necessary to support it. F3 is an academic prototype right now. I note that the Germans also released their own fileformat this year that also uses WASM. But they WASM-ify the entire file and not individual column groups: → Germans: https://github.com/AnyBlox |
|