fc1
has (3136,512) params and it seems like current implementation always resets
and sets 0
to the dead-neuron for out-going layer-512. The implementation is supposed to reset
the dead-neurons of the incoming-layer
and set 0
the dead-neuron of the outgoing-layers
. https://github.com/timoklein/redo/blob/main/src/redo.py#L120