Export 18 results:
Filters: Author is Andrei Barbu [Clear All Filters]
Deep sequential models for sampling-based planning. International Conference on Intelligent Robots (2018).
Grounding language acquisition by training semantic parsers using captioned videos. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (2018). at <http://aclweb.org/anthology/D18-1285>
Partially Occluded Hands: A challenging new dataset for single-image hand pose estimation. Asian Conference for Computer Vision (ACCV) (2018).
Temporal Grounding Graphs for Language Understanding with Accrued Visual-Linguistic Context. Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI 2017) (2017). at <c>
A Compositional Framework for Grounding Language Inference, Generation, and Acquisition in Video. (2015). doi:doi:10.1613/jair.4556.
Do You See What I Mean? Visual Resolution of Linguistic Ambiguities. Conference on Empirical Methods in Natural Language Processing, Lisbon, Portugal. (2015).
Seeing is Worse than Believing: Reading People’s Minds Better than Computer-Vision Methods Recognize Actions. (2014).
Computer Vision – ECCV 2014, Lecture Notes in Computer Science 8693, 612–627 (Springer International Publishing, 2014).
Seeing What You’re Told: Sentence-Guided Activity Recognition In Video. CVPR (IEEE, 2014)..
Seeing what you're told, sentence guided activity recognition in video. Appeared at CVPR (2014)..