A parametric texture model based on deep convolutional features closely matches texture appearance for humans

2017 | journal article

Jump to: Cite & Linked | Documents & Media | Details | Version history

Cite this publication

​A parametric texture model based on deep convolutional features closely matches texture appearance for humans​
Wallis, T. S. A.; Funke, C. M.; Ecker, A. S. ; Gatys, L. A.; Wichmann, F. A. & Bethge, M.​ (2017) 
Journal of Vision17(12) art. 5​.​ DOI: https://doi.org/10.1167/17.12.5 

Documents & Media

License

GRO License GRO License

Details

Authors
Wallis, Thomas S. A.; Funke, Christina M.; Ecker, Alexander S. ; Gatys, Leon A.; Wichmann, Felix A.; Bethge, Matthias
Abstract
Our visual environment is full of texture-"stuff" like cloth, bark, or gravel as distinct from "things" like dresses, trees, or paths-and humans are adept at perceiving subtle variations in material properties. To investigate image features important for texture perception, we psychophysically compare a recent parametric model of texture appearance (convolutional neural network [CNN] model) that uses the features encoded by a deep CNN (VGG-19) with two other models: the venerable Portilla and Simoncelli model and an extension of the CNN model in which the power spectrum is additionally matched. Observers discriminated model-generated textures from original natural textures in a spatial three-alternative oddity paradigm under two viewing conditions: when test patches were briefly presented to the near-periphery ("parafoveal") and when observers were able to make eye movements to all three patches ("inspection"). Under parafoveal viewing, observers were unable to discriminate 10 of 12 original images from CNN model images, and remarkably, the simpler Portilla and Simoncelli model performed slightly better than the CNN model (11 textures). Under foveal inspection, matching CNN features captured appearance substantially better than the Portilla and Simoncelli model (nine compared to four textures), and including the power spectrum improved appearance matching for two of the three remaining textures. None of the models we test here could produce indiscriminable images for one of the 12 textures under the inspection condition. While deep CNN (VGG-19) features can often be used to synthesize textures that humans cannot discriminate from natural textures, there is currently no uniformly best model for all textures and viewing conditions.
Issue Date
2017
Journal
Journal of Vision 
ISSN
1534-7362
eISSN
1534-7362
Language
English

Reference

Citations


Social Media