|
|
|
|
|
by bravura
461 days ago
|
|
What a smear. A lot of applied work in vision and audio glues together different existing modules, instead of training the whole thing end-to-end. In an ideal world, things are ideal. But the world isn't a grad student's wet dream. But there are not as many influential papers in the world as there are people who respond: "Why didn't they train the whole thing end-to-end?" |
|