Publication
Export 4 results:
Filters: Author is Mengjia Xu [Clear All Filters]
The Janus effects of SGD vs GD: high noise and low rank. (2023).
Updated with appendix showing empirically that the main results extend to deep nonlinear networks (2.95 MB)
Small updates...typos... (616.82 KB)


Norm-Based Generalization Bounds for Compositionally Sparse Neural Networks. (2023).
Norm-based bounds for convnets.pdf (1.2 MB)

Norm-based Generalization Bounds for Sparse Neural Networks. NeurIPS 2023 (2023). at <https://proceedings.neurips.cc/paper_files/paper/2023/file/8493e190ff1bbe3837eca821190b61ff-Paper-Conference.pdf>
NeurIPS-2023-norm-based-generalization-bounds-for-sparse-neural-networks-Paper-Conference.pdf (577.69 KB)

Implicit dynamic regularization in deep networks. (2020).
v1.2 (2.29 MB)
v.59 Update on rank (2.43 MB)

