Hacker News new | ask | show | jobs
by liuliu 370 days ago
Yeah. It is great. So apparently separating spatial / temporal attention works if you are careful and train with large enough dataset too!