Publication
The evolution of color naming reflects pressure for efficiency: Evidence from the recent pastAbstract. Journal of Language Evolution (2022). doi:10.1093/jole/lzac001
What am I searching for?. (2018).
CBMM-Memo-096.pdf (1.74 MB)
Decoding of human identity by computer vision and neuronal visionAbstract. Scientific Reports 13, (2023).
Hypothesis-driven Online Video Stream Learning with Augmented Memory. arXiv (2021). doi:10.48550/arXiv.2104.02206
2104.02206.pdf (2.25 MB)
Look twice: A generalist computational model predicts return fixations across tasks and species. PLOS Computational Biology 18, e1010654 (2022).
journal.pcbi_.1010654.pdf (4.51 MB)
DeepVoting: A Robust and Explainable Deep Network for Semantic Part Detection under Partial Occlusion. (2018).
CBMM-Memo-083.pdf (2.32 MB)
Discriminative Template Learning in Group-Convolutional Networks for Invariant Speech Representations. INTERSPEECH-2015 (International Speech Communication Association (ISCA), 2015). at <http://www.isca-speech.org/archive/interspeech_2015/i15_3229.html>
A Deep Representation for Invariance And Music Classification. (2014).
CBMM-Memo-002.pdf (1.63 MB)
Eccentricity Dependent Neural Network with Recurrent Attention for Scale, Translation and Clutter Invariance . Vision Science Society (2019).
Theory of Deep Learning IIb: Optimization Properties of SGD. (2017).
CBMM-Memo-072.pdf (3.66 MB)
Generative modeling of audible shapes for object perception. The IEEE International Conference on Computer Vision (ICCV) (2017). at <http://openaccess.thecvf.com/content_iccv_2017/html/Zhang_Generative_Modeling_of_ICCV_2017_paper.html>
A Deep Representation for Invariance and Music Classification. ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (IEEE, 2014). doi:10.1109/ICASSP.2014.6854954
Single-Shot Object Detection with Enriched Semantics. (2018).
CBMM-Memo-084.pdf (1.92 MB)
Single-Shot Object Detection with Enriched Semantics. Conference on Computer Vision and Pattern Recognition (CVPR) (2018). at <http://cvpr2018.thecvf.com/>
Musings on Deep Learning: Properties of SGD. (2017).
CBMM Memo 067 v2 (revised 7/19/2017) (5.88 MB)
CBMM Memo 067 v3 (revised 9/15/2017) (5.89 MB)
CBMM Memo 067 v4 (revised 12/26/2017) (5.57 MB)
Decoding of human identity by computer vision and neuronal vision. Scientific Reports 13, (2023).
s41598-022-26946-w.pdf (1.88 MB)
Analysis of Macaque Monkeys’ Social and Physical Interaction Processing with Eye tracking Data. The Rockefeller University 2019 Summer Science Research Program (SSRP) (2019).
Shape and Material from Sound. Advances in Neural Information Processing Systems 30 1278–1288 (2017). at <http://papers.nips.cc/paper/6727-shape-and-material-from-sound.pdf>
Machine Learning Based Automated Fault Detection in Seismic Traces. EAGE Conference and Exhibition 2014 (2014). at <http://cbcl.mit.edu/publications/eage14.pdf>
DeepVoting: An Explainable Framework for Semantic Part Detection under Partial Occlusion. Conference on Computer Vision and Pattern Recognition (CVPR) (2018). at <http://cvpr2018.thecvf.com/>
Finding any Waldo with zero-shot invariant and efficient visual search. Nature Communications 9, (2018).
Phone Classification by a Hierarchy of Invariant Representation Layers. INTERSPEECH 2014 - 15th Annual Conf. of the International Speech Communication Association (International Speech Communication Association (ISCA), 2014). at <http://www.isca-speech.org/archive/interspeech_2014/i14_2346.html>
]