Dynamics & Generalization in Deep Networks -Minimizing the Norm