variance across layers
linear
tanh
ReLU
GELU
layers
10