|
|
|
|
|
by closetCS
2402 days ago
|
|
I think the main question I have about these smaller, "lottery ticket" networks is that they are being trained over and over on the same problem as the bigger network, and are being evaluated on the same dataset as the big network, which leads me to believe that the model will fail to generalize to different but still related problems. Like if the model was trained on Imagenet an the winning ticket was found that had ridiculous accuracy for a relatively small network, I would think it would be heavily overfitted |
|