Robustified ANNs Reveal Wormholes Between Human Category Percepts

TitleRobustified ANNs Reveal Wormholes Between Human Category Percepts
Publication TypeJournal Article
Year of Publication2023
AuthorsGaziv, G, Lee, MJ, DiCarlo, JJ
JournalarXiv
Date Published10/2023
Abstract

The visual object category reports of artificial neural networks (ANNs) are notoriously sensitive to tiny, adversarial image perturbations. Because human category reports (aka human percepts) are thought to be insensitive to those same small-norm perturbations – and locally stable in general – this argues that ANNs are incomplete scientific models of human visual perception. Consistent with this, we show that when small-norm image perturbations are generated by standard ANN models, human object category percepts are indeed highly stable. However, in this very same “human-presumed-stable” regime, we find that robustified ANNs reliably dis- cover low-norm image perturbations that strongly disrupt human percepts. These previously undetectable human perceptual disruptions are massive in amplitude, approaching the same level of sensitivity seen in robustified ANNs. Further, we show that robustified ANNs support precise perceptual state interventions: they guide the construction of low-norm image perturbations that strongly alter human category percepts toward specific prescribed percepts. These observations sug- gest that for arbitrary starting points in image space, there exists a set of nearby “wormholes”, each leading the subject from their current category perceptual state into a semantically very different state. Moreover, contemporary ANN models of biological visual processing are now accurate enough to consistently guide us to those portals.

URLhttps://arxiv.org/abs/2308.06887

Associated Module: 

CBMM Relationship: 

  • CBMM Funded