|
|
|
|
|
by kmacdough
408 days ago
|
|
What are the barriers to mixed architecture models? Models which could seamlessly pass from autoregressive to diffusion, etc. Humans can integrate multiple sensory processing centers and multiple modes of thought all at once. It's baked into our training process (life). |
|
The main concern is taking a single probabilistic stream (eg a book) and comparing autoregressive modelling of it with a diffusive modelling of it.
Regarding mixing diffusion and autoregressive—I was at ICLR last week and this work is probably relevant: https://openreview.net/forum?id=tyEyYT267x