|
|
|
|
|
by adipandas
683 days ago
|
|
Beautiful idea. My take on it: I find it difficult to generalize the notion of layer removal when the bit depth of that layer goes to zero. It's wouldn't be straight forward although the authors provide equation 5. It feels like lot of information is missing in this work to even reproduce it. And authors do only 1 case study. I believe some implementation is required to understand the authors completely. Example, optimizer modification for layer when it is removed in training. |
|