Publication
Computational and Cognitive Neuroscience of Vision 85-104 (Springer, 2017).
Empirical Inference 59 - 69 (Springer Berlin Heidelberg, 2013). doi:10.1007/978-3-642-41136-610.1007/978-3-642-41136-6_7
Author's Version (147.25 KB)
Deep Convolutional Networks are Hierarchical Kernel Machines. (2015).
CBMM Memo 035_rev5.pdf (975.65 KB)
A Deep Representation for Invariance And Music Classification. (2014).
CBMM-Memo-002.pdf (1.63 MB)
For interpolating kernel machines, the minimum norm ERM solution is the most stable. (2020).
CBMM_Memo_108.pdf (1015.14 KB)
Better bound (without inequalities!) (1.03 MB)
Holographic Embeddings of Knowledge Graphs. (2015).
holographic-embeddings.pdf (677.87 KB)
On Invariance and Selectivity in Representation Learning. (2015).
CBMM Memo No. 029 (812.07 KB)
I-theory on depth vs width: hierarchical function composition. (2015).
cbmm_memo_041.pdf (1.18 MB)
The Janus effects of SGD vs GD: high noise and low rank. (2023).
Updated with appendix showing empirically that the main results extend to deep nonlinear networks (2.95 MB)
Small updates...typos... (616.82 KB)
Learning An Invariant Speech Representation. (2014).
CBMM-Memo-022-1406.3884v1.pdf (1.81 MB)
Notes on Hierarchical Splines, DCLNs and i-theory. (2015).
CBMM Memo 037 (1.83 MB)
Symmetry Regularization. (2017).
CBMM-Memo-063.pdf (6.1 MB)
Theory I: Why and When Can Deep Networks Avoid the Curse of Dimensionality?. (2016).
CBMM-Memo-058v1.pdf (2.42 MB)
CBMM-Memo-058v5.pdf (2.45 MB)
CBMM-Memo-058-v6.pdf (2.74 MB)
Proposition 4 has been deleted (2.75 MB)
Theory III: Dynamics and Generalization in Deep Networks. (2018).
Original, intermediate versions are available under request (2.67 MB)
CBMM Memo 90 v12.pdf (4.74 MB)
Theory_III_ver44.pdf Update Hessian (4.12 MB)
Theory_III_ver48 (Updated discussion of convergence to max margin) (2.56 MB)
fixing errors and sharpening some proofs (2.45 MB)
Theory of Deep Learning III: explaining the non-overfitting puzzle. (2017).
CBMM-Memo-073.pdf (2.65 MB)
CBMM Memo 073 v2 (revised 1/15/2018) (2.81 MB)
CBMM Memo 073 v3 (revised 1/30/2018) (2.72 MB)
CBMM Memo 073 v4 (revised 12/30/2018) (575.72 KB)
Assumption violations in causal discovery and the robustness of score matching. 37th Conference on Neural Information Processing Systems (NeurIPS 2023) (2024). at <https://proceedings.neurips.cc/paper_files/paper/2023/file/93ed74938a54a73b5e4c52bbaf42ca8e-Paper-Conference.pdf>
A Deep Representation for Invariance and Music Classification. ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (IEEE, 2014). doi:10.1109/ICASSP.2014.6854954
Discriminative Template Learning in Group-Convolutional Networks for Invariant Speech Representations. INTERSPEECH-2015 (International Speech Communication Association (ISCA), 2015). at <http://www.isca-speech.org/archive/interspeech_2015/i15_3229.html>
Dynamics & Generalization in Deep Networks -Minimizing the Norm. NAS Sackler Colloquium on Science of Deep Learning (2019).
Estimating Koopman operators with sketching to provably learn large scale dynamical systems. 37th Conference on Neural Information Processing Systems (NeurIPS 2023) (2023). at <https://proceedings.neurips.cc/paper_files/paper/2023/file/f3d1e34a15c0af0954ae36a7f811c754-Paper-Conference.pdf>
Heteroscedastic Gaussian Processes and Random Features: Scalable Motion Primitives with Guarantees. 7th Conference on Robot Learning (CoRL 2023 (2023). at <https://proceedings.mlr.press/v229/caldarelli23a/caldarelli23a.pdf>
Holographic Embeddings of Knowledge Graphs. Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16) (2016).
1510.04935v2.pdf (360.65 KB)
Learning manifolds with k-means and k-flats. Advances in Neural Information Processing Systems 25 (NIPS 2012) (2012). at <https://papers.nips.cc/paper/2012/hash/b20bb95ab626d93fd976af958fbc61ba-Abstract.html>
]