People

CBMM is committed to developing and supporting new interdisciplinary collaborations that will further our understanding of intelligence.

Videos
Support Us

Andrei Barbu

Andrei Barbu

Andrei

Barbu

Research Scientist

Massachusetts Institute of Technology (MIT)

Associated Research Module:

Associated Research Thrust:

Advisor/s:

Boris Katz, Shimon Ullman

Past Advisees

Allison Fu - Graduate Student

Nazar Ilamanov - UROP

Houssam Kherraz - UROP

David Mayo - Graduate Student

Battushig Myanganbayar - UROP

Projects

Grounded language acquisition

Grounded question answering

Investigating neural signals underlying language processing in the human brain

Multi-sentence event recognition

Objects and hands in context

CBMM Publications

R. Tejwani, Kuo, Y. - L., Shu, T., Stankovits, B., Gutfreund, D., Tenenbaum, J. B., Katz, B., and Barbu, A., “Zero-shot linear combinations of grounded social interactions with Linear Social MDPs”, in Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI), 2023.

C. Wang, Subramaniam, V., Yaari, A. Uri, Kreiman, G., Katz, B., Cases, I., and Barbu, A., “BrainBERT: Self-supervised representation learning for Intracranial Electrodes”, International Conference on Learning Representations. Kigali, Rwanda, Africa, 2023.

V. Subramaniam, Conwell, C., Wang, C., Kreiman, G., Katz, B., Cases, I., and Barbu, A., “Using Multimodal DNNs to Study Vision-Language Integration in the Brain”, in ICLR 2023, 2023.

L. Schiatti, Gori, M., Schrimpf, M., Cappagli, G., Morelli, F., Signorini, S., Katz, B., and Barbu, A., “Modeling Visual Impairments with Artificial Neural Networks: a Review”, in International Conference on Computer Vision 2023, Paris, 2023.

A. Yaari, DeWitt, J., Hu, H., Stankovits, B., Felshin, S., Berzak, Y., Aparicio, H., Katz, B., Cases, I., and Barbu, A., “The Aligned Multimodal Movie Treebank: An audio, video, dependency-parse treebank”, in Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022.

Y. - L. Kuo, Huang, X., Barbu, A., McGill, S. G., Katz, B., Leonard, J. J., and Rosman, G., “Trajectory Prediction with Linguistic Representations”. 2022.

E. Cheng, Kuo, Y. - L., Cases, I., Katz, B., and Barbu, A., “Spontaneous sign emergence in humans and machines through an embodied communication game”, in JCoLE Workshop, 2022.

R. Tejwani, Kuo, Y. - L., Shu, T., Stankovits, B., Gutfreund, D., Tenenbaum, J. B., Katz, B., and Barbu, A., “Incorporating Rich Social Interactions Into MDPs”. 2022.

E. Cheng, Kuo, Y. - L., Correa, J., Katz, B., Cases, I., and Barbu, A., “Quantifying the Emergence of Symbolic Communication”, CogSci, 2022.

B. Wang, Mayo, D., Deza, A., Barbu, A., and Conwell, C., “On the use of Cortical Magnification and Saccades as Biological Proxies for Data Augmentation”, in Shared Visual Representations in Human and Machine Intelligence (SVRHM) Workshop at NeurIPS, 2021.

R. Tejwani, Kuo, Y. - L., Shu, T., Katz, B., and Barbu, A., “Social Interactions as Recursive MDPs”. 2021.

C. Ross, Katz, B., and Barbu, A., “Measuring Social Biases in Grounded Vision and Language Embeddings”, NAACL (Annual Conference of the North American Chapter of the Association for Computational Linguistics). 2021.

C. Conwell, Mayo, D., Buice, M. A., Katz, B., Alvarez, G. A., and Barbu, A., “Neural Regression, Representational Similarity, Model Zoology Neural Taskonomy at Scale in Rodent Visual Cortex”. 2021.

C. Ross, Barbu, A., and Katz, B., “Measuring Social Biases in Grounded Vision and Language Embeddings”. 2021.

C. Conwell, Mayo, D., Buice, M., Katz, B., Alvarez, G., and Barbu, A., “Large-scale benchmarking of deep neural network models in mouse visual cortex reveals patterns similar to those observed in macaque visual cortex”, Cosyne. 2021.

Y. - L. Kuo, Barbu, A., and Katz, B., “Compositional RL Agents That Follow Language Commands in Temporal Logic”. 2021.

I. Palmer, Rouditchenko, A., Barbu, A., Katz, B., and Glass, J., “Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset”. 2021.

A. Netanyahu, Shu, T., Katz, B., Barbu, A., and Tenenbaum, J. B., “PHASE: PHysically-grounded Abstract Social Events for Machine Social Perception”, in AAAI-21, 2021.

Y. - L. Kuo, Katz, B., and Barbu, A., “Compositional Networks Enable Systematic Generalization for Grounded Language Understanding”. 2021.

A. Netanyahu, Shu, T., Katz, B., Barbu, A., and Tenenbaum, J. B., “PHASE: PHysically-grounded Abstract Social Events for Machine Social Perception”. 2021.

Y. - L. Kuo, Katz, B., and Barbu, A., “Compositional RL Agents That Follow Language Commands in Temporal Logic”, Frontiers in Robotics and AI, vol. 8, 2021.

I. Palmer, Rouditchenko, A., Barbu, A., Katz, B., and Glass, J., “Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset”, in Interspeech 2021, ISCA, 2021.

A. Uri Yaari, Sherman, M., Priebe, O. Clarke, Loh, P. - R., Katz, B., Barbu, A., and Berger, B., “Multi-resolution modeling of a discrete stochastic process identifies causes of cancer”, in International Conference on Learning Representations, 2021.

Y. - L. Kuo, Katz, B., and Barbu, A., “Encoding formulas as deep networks: Reinforcement learning for zero-shot execution of LTL formulas”. 2020.

Y. - L. Kuo, Katz, B., and Barbu, A., “Encoding formulas as deep networks: Reinforcement learning for zero-shot execution of LTL formulas”, in 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA, 2020.

Y. - L. Kuo, Katz, B., and Barbu, A., “Deep compositional robotic planners that follow natural language commands”. 2020.

Y. - L. Kuo, Katz, B., and Barbu, A., “Deep compositional robotic planners that follow natural language commands ”, in International Conference on Robotics and Automation (ICRA), Palais des Congrès de Paris, Paris, France, 2020.

C. Wang, Ross, C., Kuo, Y. - L., Katz, B., and Barbu, A., “Learning a natural-language to LTL executable semantic parser for grounded robotics”. 2020.

A. Netanyahu, Shu, T., Katz, B., Barbu, A., and Tenenbaum, J. B., “PHASE: PHysically-grounded Abstract Social Eventsfor Machine Social Perception”, in Shared Visual Representations in Human and Machine Intelligence (SVRHM) workshop at NeurIPS 2020, 2020.

C. Wang, Ross, C., Kuo, Y. - L., Katz, B., and Barbu, A., “Learning a Natural-language to LTL Executable Semantic Parser for Grounded Robotics”, Proceedings of Conference on Robot Learning (CoRL-2020), 2020.

A. Barbu, Banda, D., and Katz, B., “Deep video-to-video transformations for accessibility with an application to photosensitivity”, Pattern Recognition Letters, 2019.

A. Barbu, Mayo, D., Alverio, J., Luo, W., Wang, C., Gutfreund, D., Tenenbaum, J. B., and Katz, B., “ObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition models”, Neural Information Processing Systems (NeurIPS 2019). Vancouver, Canada, 2019.

C. Ross, Berzak, Y., Katz, B., and Barbu, A., “Learning Language from Vision.”, in Workshop on Visually Grounded Interaction and Language (ViGIL) at the Thirty-third Annual Conference on Neural Information Processing Systems (NeurIPS), Vancouver Convention Center, Vancouver, Canada, 2019.

Y. - L. Kuo, Katz, B., and Barbu, A., “Deep Compositional Robotic Planners that Follow Natural Language Commands.”, Workshop on Visually Grounded Interaction and Language (ViGIL) at the Thirty-third Annual Conference on Neural Information Processing Systems (NeurIPS). Vancouver Convention Centre, Vancouver, Canada, 2019.

Y. - L. Kuo, Barbu, A., and Katz, B., “Deep sequential models for sampling-based planning”, in The IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2018), Madrid, Spain, 2018.

C. Ross, Barbu, A., Berzak, Y., Myanganbayar, B., and Katz, B., “Grounding language acquisition by training semantic parsersusing captioned videos”, in Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP 2018), , Brussels, Belgium, 2018.

B. Myanganbayar, Mata, C., Dekel, G., Katz, B., Ben-Yosef, G., and Barbu, A., “Partially Occluded Hands: A challenging new dataset for single-image hand pose estimation”, in The 14th Asian Conference on Computer Vision (ACCV 2018), 2018.

B. Myanganbayar, Mata, C., Dekel, G., Katz, B., Ben-Yosef, G., and Barbu, A., “Partially Occluded Hands: A challenging new dataset for single-image hand pose estimation”. 2018.

R. Paul, Barbu, A., Felshin, S., Katz, B., and Roy, N., “Temporal Grounding Graphs for Language Understanding with Accrued Visual-Linguistic Context”, in Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI 2017), Melbourne, Australia, 2017.

Y. Berzak, Barbu, A., Harari, D., Katz, B., and Ullman, S., “Language and Vision Ambiguities (LAVA) Corpus”. 2016.

Y. Berzak, Huang, Y., Barbu, A., Korhonen, A., and Katz, B., “Anchoring and Agreement in Syntactic Annotations”. 2016.

Y. Berzak, Barbu, A., Harari, D., Katz, B., and Ullman, S., “Do You See What I Mean? Visual Resolution of Linguistic Ambiguities”. 2016.

B. Katz and Barbu, A., “A look back at the June 2016 BMM Workshop in Sestri Levante, Italy”. 2016.

H. Yu, Siddharth, N., Barbu, A., and Siskind, J. Mark, “A Compositional Framework for Grounding Language Inference, Generation, and Acquisition in Video”. 2015.

Y. Berzak, Barbu, A., Harari, D., Katz, B., and Ullman, S., “Do You See What I Mean? Visual Resolution of Linguistic Ambiguities”, in Conference on Empirical Methods in Natural Language Processing, Lisbon, Portugal. , 2015.

A. Barbu, Barrett, D., Chen, W., Siddharth, N., Xiong, C., Corso, J. J., Fellbaum, C. D., Hanson, C., Hanson, S. José, Helie, S., Malaia, E., Pearlmutter, B. A., Siskind, J. Mark, Talavage, T. Michael, and Wilbur, R. B., “Seeing is worse than believing: Reading people’s minds better than computer-vision methods recognize actions”, in Computer Vision – ECCV 2014, Lecture Notes in Computer Science, vol. 8693, Zurich, Switzerland: Springer International Publishing, 2014, pp. 612–627.

N. Siddharth, Barbu, A., and Siskind, J. Mark, “Seeing what you're told, sentence guided activity recognition in video”, Appeared at CVPR. IEEE, 2014.

N. Siddharth, Barbu, A., and Siskind, J. Mark, “Seeing What You’re Told: Sentence-Guided Activity Recognition In Video.”. 2014.

A. Barbu, Barrett, D., Chen, W., Siddharth, N., Xiong, C., Corso, J. J., Fellbaum, C. D., Hanson, C., Hanson, S. José, Helie, S., Malaia, E., Pearlmutter, B. A., Siskind, J. Mark, Talavage, T. Michael, and Wilbur, R. B., “The Compositional Nature of Event Representations in the Human Brain”. 2014.

N. Siddharth, Barbu, A., and Siskind, J. Mark, “Seeing What You’re Told: Sentence-Guided Activity Recognition In Video”, in CVPR, Columbus, Ohio, 2014.

A. Barbu, Barrett, D., Chen, W., Siddharth, N., Xiong, C., Corso, J. J., Fellbaum, C. D., Hanson, C., Hanson, S. José, Helie, S., Malaia, E., Pearlmutter, B. A., Siskind, J. Mark, Talavage, T. Michael, and Wilbur, R. B., “Seeing is Worse than Believing: Reading People’s Minds Better than Computer-Vision Methods Recognize Actions”. 2014.