Publication
Zoom better to see clearer: Human and object parsing with hierarchical auto-zoom net. ECCV (2016).
auto-zoom_net.pdf (5.77 MB)
Zero-shot linear combinations of grounded social interactions with Linear Social MDPs. Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI) (2023).
Young Children’s Use of Surface and Object Information in Drawings of Everyday Scenes. Child Development (2016). doi:10.1111/cdev.12658
Young children’s automatic and alternating use of scene and object information in spatial symbols. Budapest CEU Conference on Cognitive Development (2015).
XDream: Finding preferred stimuli for visual neurons using generative networks and gradient-free optimization. PLOS Computational Biology 16, e1007973 (2020).
gk7791.pdf (2.39 MB)
Write, Execute, Assess: Program Synthesis with a REPL. Neural Information Processing Systems (NeurIPS 2019) (2019).
9116-write-execute-assess-program-synthesis-with-a-repl.pdf (3.9 MB)
Mechanisms of Sensory Working Memory: Attention and Performance XXV. (Elsevier Inc. , 2015). at <https://www.sciencedirect.com/book/9780128013717/mechanisms-of-sensory-working-memory>
Word-level Invariant Representations From Acoustic Waveforms. INTERSPEECH 2014 - 15th Annual Conf. of the International Speech Communication Association (International Speech Communication Association (ISCA), 2014). at <http://www.isca-speech.org/archive/interspeech_2014/i14_2385.html>
Why does deep and cheap learning work so well?. Journal of Statistical Physics 168, 1223–1247 (2017).
1608.08225.pdf (2.14 MB)
Why Are Face and Object Processing Segregated in the Human Brain? Testing Computational Hypotheses with Deep Convolutional Neural Networks . Conference on Cognitive Computational Neuroscience (2020).
Why and when can deep-but not shallow-networks avoid the curse of dimensionality: A review. International Journal of Automation and Computing 1-17 (2017). doi:10.1007/s11633-017-1054-2
art%3A10.1007%2Fs11633-017-1054-2.pdf (1.68 MB)
Whole-agent selectivity within the macaque face-processing system. Proceedings of the National Academy of Sciences (PNAS) 112, (2015).
Authors' last version of article. (3.1 MB)
Where do hypotheses come from?. (2016).
CBMM-Memo-056-v2.pdf (733.35 KB)
When Pigs Fly: Contextual Reasoning in Synthetic and Natural Scenes. International Conference on Computer Vision (ICCV) (2021). doi:10.1109/iccv48922.2021.00032
Bomatter_When_Pigs_Fly_Contextual_Reasoning_in_Synthetic_and_Natural_Scenes_ICCV_2021_paper.pdf (3.24 MB)
When Does Model-Based Control Pay Off?. PLoS Comput Biol 12, e1005090 (2016).
KoolEtAl_PLOS_CB.PDF (5.85 MB)
When Computer Vision Gazes at Cognition. (2014).
CBMM-Memo-025.pdf (3.78 MB)
When and Why Are Deep Networks Better Than Shallow Ones?. AAAI-17: Thirty-First AAAI Conference on Artificial Intelligence (2017).
When and how convolutional neural networks generalize to out-of-distribution category–viewpoint combinations. Nature Machine Intelligence 4, 146 - 153 (2022).
What Matters In Branch Specialization? Using a Toy Task to Make Predictions. Shared Visual Representations in Human and Machine Intelligence (SVRHM) Workshop at NeurIPS (2021). at <https://openreview.net/forum?id=0kPS1i6wict>
What is changing when: decoding visual information in movies from human intracranial recordings. NeuroImage 180, Part A, 147-159 (2018).
Human neurophysiological responses during movies (2.78 MB)
What is changing when: Decoding visual information in movies from human intracranial recordings. Neuroimage (2017). doi:https://doi.org/10.1016/j.neuroimage.2017.08.027
What if Eye..? Computationally Recreating Vision Evolution. arXiv (2025). at <https://arxiv.org/abs/2501.15001>
2501.15001v1.pdf (5.2 MB)
]