Publication
Emergence of Pragmatic Reasoning From Least-Effort Optimization . 13th International Conference on the Evolution of Language (EvoLang) (2020).
A Deep Representation for Invariance and Music Classification. ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (IEEE, 2014). doi:10.1109/ICASSP.2014.6854954
Generative modeling of audible shapes for object perception. The IEEE International Conference on Computer Vision (ICCV) (2017). at <http://openaccess.thecvf.com/content_iccv_2017/html/Zhang_Generative_Modeling_of_ICCV_2017_paper.html>
Single-Shot Object Detection with Enriched Semantics. (2018).
CBMM-Memo-084.pdf (1.92 MB)
Single-Shot Object Detection with Enriched Semantics. Conference on Computer Vision and Pattern Recognition (CVPR) (2018). at <http://cvpr2018.thecvf.com/>
Theory of Deep Learning IIb: Optimization Properties of SGD. (2017).
CBMM-Memo-072.pdf (3.66 MB)
Machine Learning Based Automated Fault Detection in Seismic Traces. EAGE Conference and Exhibition 2014 (2014). at <http://cbcl.mit.edu/publications/eage14.pdf>
Musings on Deep Learning: Properties of SGD. (2017).
CBMM Memo 067 v2 (revised 7/19/2017) (5.88 MB)
CBMM Memo 067 v3 (revised 9/15/2017) (5.89 MB)
CBMM Memo 067 v4 (revised 12/26/2017) (5.57 MB)
Finding any Waldo with zero-shot invariant and efficient visual search. Nature Communications 9, (2018).
DeepVoting: An Explainable Framework for Semantic Part Detection under Partial Occlusion. Conference on Computer Vision and Pattern Recognition (CVPR) (2018). at <http://cvpr2018.thecvf.com/>
Phone Classification by a Hierarchy of Invariant Representation Layers. INTERSPEECH 2014 - 15th Annual Conf. of the International Speech Communication Association (International Speech Communication Association (ISCA), 2014). at <http://www.isca-speech.org/archive/interspeech_2014/i14_2346.html>
Shape and Material from Sound. Advances in Neural Information Processing Systems 30 1278–1288 (2017). at <http://papers.nips.cc/paper/6727-shape-and-material-from-sound.pdf>
Decoding of human identity by computer vision and neuronal vision. Scientific Reports 13, (2023).
s41598-022-26946-w.pdf (1.88 MB)
Decoding of human identity by computer vision and neuronal visionAbstract. Scientific Reports 13, (2023).
What am I searching for?. (2018).
CBMM-Memo-096.pdf (1.74 MB)
Look twice: A generalist computational model predicts return fixations across tasks and species. PLOS Computational Biology 18, e1010654 (2022).
journal.pcbi_.1010654.pdf (4.51 MB)
Eccentricity Dependent Neural Network with Recurrent Attention for Scale, Translation and Clutter Invariance . Vision Science Society (2019).
Discriminative Template Learning in Group-Convolutional Networks for Invariant Speech Representations. INTERSPEECH-2015 (International Speech Communication Association (ISCA), 2015). at <http://www.isca-speech.org/archive/interspeech_2015/i15_3229.html>
DeepVoting: A Robust and Explainable Deep Network for Semantic Part Detection under Partial Occlusion. (2018).
CBMM-Memo-083.pdf (2.32 MB)
A Deep Representation for Invariance And Music Classification. (2014).
CBMM-Memo-002.pdf (1.63 MB)
Analysis of Macaque Monkeys’ Social and Physical Interaction Processing with Eye tracking Data. The Rockefeller University 2019 Summer Science Research Program (SSRP) (2019).
Hypothesis-driven Online Video Stream Learning with Augmented Memory. arXiv (2021). doi:10.48550/arXiv.2104.02206
2104.02206.pdf (2.25 MB)
]