Performance-optimized hierarchical models predict neural responses in higher visual cortex

Daniel L.K. Yamins, Ha Hong, Charles F. Cadieu, Ethan A. Solomon, Darren Seibert, James J. DiCarlo

Research output: Contribution to journalArticlepeer-review

1149 Scopus citations

Abstract

The ventral visual stream underlies key human visual object recognition abilities. However, neural encoding in the higher areas of the ventral stream remains poorly understood. Here, we describe a modeling approach that yields a quantitatively accurate model of inferior temporal (IT) cortex, the highest ventral cortical area. Using high-throughput computational techniques, we discovered that, within a class of biologically plausible hierarchical neural network models, there is a strong correlation between a model's categorization performance and its ability to predict individual IT neural unit response data. To pursue this idea, we then identified a high-performing neural network that matches human performance on a range of recognition tasks. Critically, even though we did not constrain this model to match neural data, its top output layer turns out to be highly predictive of IT spiking responses to complex naturalistic images at both the single site and population levels. Moreover, the model's intermediate layers are highly predictive of neural responses in the V4 cortex, a midlevel visual area that provides the dominant cortical input to IT. These results show that performance optimization - applied in a biologically appropriate model class - can be used to build quantitative predictive models of neural processing.

Original languageEnglish
Pages (from-to)8619-8624
Number of pages6
JournalProceedings of the National Academy of Sciences of the United States of America
Volume111
Issue number23
DOIs
StatePublished - 10 Jun 2014
Externally publishedYes

Keywords

  • Array electrophysiology
  • Computational neuroscience
  • Computer vision

Fingerprint

Dive into the research topics of 'Performance-optimized hierarchical models predict neural responses in higher visual cortex'. Together they form a unique fingerprint.

Cite this