中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Statistics of Visual Responses to Image Object Stimuli from Primate AIT Neurons to DNN Neurons

文献类型:期刊论文

作者Dong, Qiulei1,2,3; Wang, Hong2; Hu, Zhanyi1,2,3
刊名NEURAL COMPUTATION
出版日期2018-02-01
卷号30期号:2页码:447-476
DOI10.1162/neco_a_01039
文献子类Article
英文摘要Under the goal-driven paradigm, Yamins etal. (2014; Yamins & DiCarlo, 2016) have shown that by optimizing only the final eight-way categorization performance of a four-layer hierarchical network, not only can its top output layer quantitatively predict IT neuron responses but its penultimate layer can also automatically predict V4 neuron responses. Currently, deep neural networks (DNNs) in the field of computer vision have reached image object categorization performance comparable to that of human beings on ImageNet, a data set that contains 1.3 million training images of 1000 categories. We explore whether the DNN neurons (units in DNNs) possess image object representational statistics similar to monkey IT neurons, particularly when the network becomes deeper and the number of image categories becomes larger, using VGG19, a typical and widely used deep network of 19 layers in the computer vision field. Following Lehky, Kiani, Esteky, and Tanaka (2011, 2014), where the response statistics of 674 IT neurons to 806 image stimuli are analyzed using three measures (kurtosis, Pareto tail index, and intrinsic dimensionality), we investigate the three issues in this letter using the same three measures: (1) the similarities and differences of the neural response statistics between VGG19 and primate IT cortex, (2) the variation trends of the response statistics of VGG19 neurons at different layers from low to high, and (3) the variation trends of the response statistics of VGG19 neurons when the numbers of stimuli and neurons increase. We find that the response statistics on both single-neuron selectivity and population sparseness of VGG19 neurons are fundamentally different from those of IT neurons in most cases; by increasing the number of neurons in different layers and the number of stimuli, the response statistics of neurons at different layers from low to high do not substantially change; and the estimated intrinsic dimensionality values at the low convolutional layers of VGG19 are considerably larger than the value of approximately 100 reported for IT neurons in Lehky etal. (2014), whereas those at the high fully connected layers are close to or lower than 100. To the best of our knowledge, this work is the first attempt to analyze the response statistics of DNN neurons with respect to primate IT neurons in image object representation.
WOS关键词INFEROTEMPORAL CORTEX ; TEMPORAL CORTEX ; SPARSENESS ; DIMENSIONALITY ; RECOGNITION ; SELECTIVITY ; MODELS ; SPACE
WOS研究方向Computer Science ; Neurosciences & Neurology
语种英语
WOS记录号WOS:000423043500006
资助机构Strategic Priority Research Program of the Chinese Academy of Sciences(XDB02070002) ; National Natural Science Foundation of China(61421004 ; 61375042 ; 61573359 ; 61672489)
源URL[http://ir.ia.ac.cn/handle/173211/21938]  
专题自动化研究所_模式识别国家重点实验室_机器人视觉团队
作者单位1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
2.Univ Chinese Acad Sci, Beijing 100049, Peoples R China
3.CAS Ctr Excellence Brain Sci & Intelligence Techn, Beijing 100190, Peoples R China
推荐引用方式
GB/T 7714
Dong, Qiulei,Wang, Hong,Hu, Zhanyi. Statistics of Visual Responses to Image Object Stimuli from Primate AIT Neurons to DNN Neurons[J]. NEURAL COMPUTATION,2018,30(2):447-476.
APA Dong, Qiulei,Wang, Hong,&Hu, Zhanyi.(2018).Statistics of Visual Responses to Image Object Stimuli from Primate AIT Neurons to DNN Neurons.NEURAL COMPUTATION,30(2),447-476.
MLA Dong, Qiulei,et al."Statistics of Visual Responses to Image Object Stimuli from Primate AIT Neurons to DNN Neurons".NEURAL COMPUTATION 30.2(2018):447-476.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。