Publication
Emergence of Pragmatic Reasoning From Least-Effort Optimization . 13th International Conference on the Evolution of Language (EvoLang) (2020).
The evolution of color naming reflects pressure for efficiency: Evidence from the recent pastAbstract. Journal of Language Evolution (2022). doi:10.1093/jole/lzac001
Let's talk (efficiently) about us: Person systems achieve near-optimal compression. Proceedings of the Annual Meeting of the Cognitive Science Society 43, (2021).
Discriminative Template Learning in Group-Convolutional Networks for Invariant Speech Representations. INTERSPEECH-2015 (International Speech Communication Association (ISCA), 2015). at <http://www.isca-speech.org/archive/interspeech_2015/i15_3229.html>
A Deep Representation for Invariance And Music Classification. (2014).
CBMM-Memo-002.pdf (1.63 MB)
Generative modeling of audible shapes for object perception. The IEEE International Conference on Computer Vision (ICCV) (2017). at <http://openaccess.thecvf.com/content_iccv_2017/html/Zhang_Generative_Modeling_of_ICCV_2017_paper.html>
Decoding of human identity by computer vision and neuronal vision. Scientific Reports 13, (2023).
s41598-022-26946-w.pdf (1.88 MB)
Single-Shot Object Detection with Enriched Semantics. (2018).
CBMM-Memo-084.pdf (1.92 MB)
Single-Shot Object Detection with Enriched Semantics. Conference on Computer Vision and Pattern Recognition (CVPR) (2018). at <http://cvpr2018.thecvf.com/>
A Deep Representation for Invariance and Music Classification. ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (IEEE, 2014). doi:10.1109/ICASSP.2014.6854954
DeepVoting: An Explainable Framework for Semantic Part Detection under Partial Occlusion. Conference on Computer Vision and Pattern Recognition (CVPR) (2018). at <http://cvpr2018.thecvf.com/>
Finding any Waldo with zero-shot invariant and efficient visual search. Nature Communications 9, (2018).
Eccentricity Dependent Neural Network with Recurrent Attention for Scale, Translation and Clutter Invariance . Vision Science Society (2019).
Machine Learning Based Automated Fault Detection in Seismic Traces. EAGE Conference and Exhibition 2014 (2014). at <http://cbcl.mit.edu/publications/eage14.pdf>
Hypothesis-driven Online Video Stream Learning with Augmented Memory. arXiv (2021). doi:10.48550/arXiv.2104.02206
2104.02206.pdf (2.25 MB)
Phone Classification by a Hierarchy of Invariant Representation Layers. INTERSPEECH 2014 - 15th Annual Conf. of the International Speech Communication Association (International Speech Communication Association (ISCA), 2014). at <http://www.isca-speech.org/archive/interspeech_2014/i14_2346.html>
What am I searching for?. (2018).
CBMM-Memo-096.pdf (1.74 MB)
Analysis of Macaque Monkeys’ Social and Physical Interaction Processing with Eye tracking Data. The Rockefeller University 2019 Summer Science Research Program (SSRP) (2019).
Theory of Deep Learning IIb: Optimization Properties of SGD. (2017).
CBMM-Memo-072.pdf (3.66 MB)
Decoding of human identity by computer vision and neuronal visionAbstract. Scientific Reports 13, (2023).
Musings on Deep Learning: Properties of SGD. (2017).
CBMM Memo 067 v2 (revised 7/19/2017) (5.88 MB)
CBMM Memo 067 v3 (revised 9/15/2017) (5.89 MB)
CBMM Memo 067 v4 (revised 12/26/2017) (5.57 MB)
Look twice: A generalist computational model predicts return fixations across tasks and species. PLOS Computational Biology 18, e1010654 (2022).
journal.pcbi_.1010654.pdf (4.51 MB)
DeepVoting: A Robust and Explainable Deep Network for Semantic Part Detection under Partial Occlusion. (2018).
CBMM-Memo-083.pdf (2.32 MB)
Shape and Material from Sound. Advances in Neural Information Processing Systems 30 1278–1288 (2017). at <http://papers.nips.cc/paper/6727-shape-and-material-from-sound.pdf>
]