Publication
Export 4 results:
Filters: Author is Mengjia Xu [Clear All Filters]
The Janus effects of SGD vs GD: high noise and low rank. (2023). Updated with appendix showing empirically that the main results extend to deep nonlinear networks (2.95 MB) Small updates...typos... (616.82 KB)
Norm-Based Generalization Bounds for Compositionally Sparse Neural Networks. (2023). Norm-based bounds for convnets.pdf (1.2 MB)
Norm-based Generalization Bounds for Sparse Neural Networks. NeurIPS 2023 (2023). at <https://proceedings.neurips.cc/paper_files/paper/2023/file/8493e190ff1bbe3837eca821190b61ff-Paper-Conference.pdf> NeurIPS-2023-norm-based-generalization-bounds-for-sparse-neural-networks-Paper-Conference.pdf (577.69 KB)
Implicit dynamic regularization in deep networks. (2020). v1.2 (2.29 MB) v.59 Update on rank (2.43 MB)