Skip to content

Latest commit

 

History

History
33 lines (29 loc) · 2.7 KB

README.md

File metadata and controls

33 lines (29 loc) · 2.7 KB

CNN-IF

Code of article "A Convolutional Neural Network Interpretable Framework for Human Ventral Visual Pathway Representation"

Introduction

We propose a convolutional neural network interpretable framework (CNN-IF) aimed at providing a transparent interpretable encoding model for the ventral visual pathway.

The method is built on the framework of the feature-weighted receptive field (fwrf), an encoding model designed to balance expressiveness, interpretability, and scalability, which can encode the voxel-wise response of voxels in the human brain.

The weighted matrix is extracted from the trained encoding model to convert the layer-by-layer feature maps into voxel maps, so as to conduct the layer-wise Network Dissection along the ventral visual pathway.

Download

  • Clone the code of Network Dissection Lite from GitHub
    git clone https://github.com/CSAILVision/NetDissect-Lite
    cd NetDissect-Lite
  • Acquire the NSD data for the encoding model from:
    http://naturalscenesdataset.org

Environment

Dependency can be get in environment.yml. All models were trained, validated, and analyzed on four NVIDIA GeForce RTX 3090 GPUs.

Appenxix

  • AlexNet hierarchy appendix_fig1 AlexNet hierarchy. (A) Visualization of the hierarchical structure of the ventral visual pathway. (B) The correlation between predictive voxel responses and measured voxel responses for each ROI from all data-driven-pretrained layers. (C) The correlation between predictive voxel responses and measured voxel responses for each ROI from all data-driven-unpretrained layers. The results were averaged across four subjects.
  • GNet hierarchy appendix_fig2 GNet hierarchy. (A) Visualization of the hierarchical structure of the ventral visual pathway. (B) The correlation between predictive voxel responses and measured voxel responses for each ROI from all data-driven-pretrained layers. (C) The correlation between predictive voxel responses and measured voxel responses for each ROI from all data-driven-unpretrained layers. The results were averaged across four subjects.
  • GNet network dissection. appendix_fig3 GNet network dissection. (A) The number of voxels that are detected on GNet (goal-driven-pretrained). (B) The number of voxels that are detected on GNet (data-driven-pretrained). (C) The number of voxels that are detected on GNet (data-driven-unpretrained). Floc ROIs are aligned with the last convolutional layer of GNet for network dissection.