Hacker News new | ask | show | jobs
by another_twist 363 days ago
So very similar approach to Conformer - convolution head for downsampling and transformer for time dependencies. Hmm, surprising that this idea works across application domains.