Hacker News new | ask | show | jobs
by ed 305 days ago
TL;DR the “high” “low” level modules don’t matter much, and the rest of the architecture is similar to Universal Transformers https://arxiv.org/abs/1807.03819