Modelling Attention Control Using A Convolutional Neural .

2y ago

32 Views

2 Downloads

2.46 MB

20 Pages

Last View : 1d ago

Last Download : 3m ago

Upload by : Javier Atchley

Report this link

Download PDF

Transcription

Visual CognitionISSN: 1350-6285 (Print) 1464-0716 (Online) Journal homepage: https://www.tandfonline.com/loi/pvis20Modelling attention control using a convolutionalneural network designed after the ventral visualpathwayChen-Ping Yu, Huidong Liu, Dimitrios Samaras & Gregory J. ZelinskyTo cite this article: Chen-Ping Yu, Huidong Liu, Dimitrios Samaras & Gregory J. Zelinsky (2019):Modelling attention control using a convolutional neural network designed after the ventral visualpathway, Visual Cognition, DOI: 10.1080/13506285.2019.1661927To link to this article: shed online: 05 Sep 2019.Submit your article to this journalArticle views: 26View related articlesView Crossmark dataFull Terms & Conditions of access and use can be found ation?journalCode pvis20

VISUAL 1927Modelling attention control using a convolutional neural network designed afterthe ventral visual pathwayChen-Ping Yua,c, Huidong Liua, Dimitrios Samarasa and Gregory J. Zelinskya,baDepartment of Computer Science, Stony Brook University, Stony Brook, NY, USA; bDepartment of Psychology, Stony Brook University, StonyBrook, NY, USA; cDepartment of Psychology, Harvard University, Cambridge, MA, USAABSTRACTARTICLE HISTORYWe recently proposed that attention control uses object-category representations consisting ofcategory-consistent features (CCFs), those features occurring frequently and consistently across acategory’s exemplars [Yu, C.-P., Maxﬁeld, J. T., & Zelinsky, G. J. (2016). Searching for categoryconsistent features: A computational approach to understanding visual category representation.Psychological Science, 27(6), 870–884.] Here we extracted from a Convolutional Neural Network(CNN) designed after the primate ventral stream (VsNet) CCFs for 68 object categories spanninga three-level category hierarchy, and evaluated VsNet against the gaze behaviour of peoplesearching for the same categorical targets. We also compared its success in predicting attentioncontrol to two other CNNs that diﬀered in their degree and type of brain inspiration. VsNet notonly replicated previous reports of stronger attention guidance to subordinate-level targets, butwith its powerful CNN-CCFs it predicted attention control to individual target categories.Moreover, VsNet outperformed the other CNN models tested, despite these models having moretrainable convolutional ﬁlters. We conclude that CCFs extracted from a brain-inspired CNN canpredict goal-directed attention control.Received 4 March 2019Accepted 12 August 2019The brain’s ability to ﬂexibly exert top-down controlover motor behaviour is key to achieving visuomotorgoals and performing everyday tasks (Ballard &Hayhoe, 2009), but a neurocomputational understanding of goal-directed attention control is still in itsinfancy. Here we introduce VsNet, a neurocomputational model inspired by the primate ventral streamof visually-responsive brain areas, that predicts attention control by learning the representative visual features of an object category.VsNet advances existing models of attentioncontrol in several respects. First, it is image computable, meaning that it accepts the same visuallycomplex and unlabelled imagery that ﬂoods continuously into the primate visual system (see also Adeli,Vitu, & Zelinsky, 2017; Zelinsky, Adeli, Peng, &Samaras, 2013). This is essential for a model aimedat understanding attention control in the real world,as objects do not come with labels telling us whatand where they are. Note that although there areseveral excellent image-computable models ofﬁxation prediction (Bylinskii, Judd, Oliva, Torralba, &CONTACT Gregory J. Zelinskygregory.zelinsky@stonybrook.edu 2019 Informa UK Limited, trading as Taylor & Francis GroupKEYWORDSBrain-inspired CNN; attentioncontrol; categorical search;category-consistent featuresDurand, 2016), these are all in the context of a freeviewing task and therefore outside of our focus ongoal-speciﬁc attention control.1 Second, VsNet isamong the ﬁrst uses of a convolutional neuralnetwork (CNN) to predict goal-directed attention.CNNs are one class of artiﬁcial deep neural networksthat have been setting new performance benchmarksover diverse domains, not the least of which is theautomated (without human input) recognition of visually-complex categories of objects (He, Zhang, Ren, &Sun, 2016; Krizhevsky, Sutskever, & Hinton, 2012; Russakovsky et al., 2015; Simonyan & Zisserman, 2015).However, CNN models that predict goal-directedattention control are still uncommon (Adeli & Zelinsky,2018; Zhang et al., 2018). A third and core source ofVsNet’s capacity to predict attention control is itsextraction of the visual features from image exemplarsthat are most representative of an object category. Inshort, VsNet harnesses the power of deep learning toextract the category-consistent features (Yu, Maxﬁeld, &Zelinsky, 2016) used by the ventral visual areas tocontrol the goal-directed application of attention.Psychology B-240, Stony Brook University, Stony Brook, NY 11794–2500, USA

2C.-P. YU ET AL.VsNet is novel in that it is a brain-inspired CNN. Thereis the start of an interesting new discussion about howbiological neural networks might inform the design ofartiﬁcial deep neural networks (Grill-Spector, Weiner,Gomez, Stigliani, & Natu, 2018; Kietzmann, McClure,& Kriegeskorte, 2019), and VsNet is the newestaddition to this discussion. Our approach is neurocomputational in that, given the many ways that CNNs canbe built, we look to the rich neuroscience literature fordesign inspiration and parameter speciﬁcation. Mostbroadly, VsNet is a multi-layered deep network,making its architecture analogous to the layers ofbrain structures existing along the ventral pathway.The brain’s retinotopic application of ﬁlters throughoutmost of these ventral areas also embody a parallelizedconvolution similar to unit activation across a CNN’slayers (Cadieu et al., 2014; Hong, Yamins, Majaj, &DiCarlo, 2016; Khaligh-Razavi & Kriegeskorte, 2014;Yamins et al., 2014). This parallel between a CNNand the ventral stream’s organization has not goneunnoticed (Kriegeskorte, 2015), and unit activationacross the layers of a CNN has even been used topredict neural activity recorded from brain areas inresponse to the same image content (Cadieu et al.,2014; Yamins et al., 2014). VsNet extends this workby making the architecture of its layers also braininspired, each modelled after a speciﬁc brain area inthe primate ventral stream. In contrast, existing neurocomputational eﬀorts have used either AlexNet (Krizhevsky et al., 2012) or one of its feed-forwardvariants (Simonyan & Zisserman, 2015; Szegedy, Liu,Jia, Sermanet, & Reed, 2015; Zeiler & Fergus, 2014),which are pre-trained CNNs designed purely to 2012 challenge, also known as ImageNet, Russakovsky et al., 2015) without regard for the structuraland functional organization of the primate ventralvisual system. The same disregard for neurobiologicalconstraint applies to later generations of deep networks using diﬀerent architectures (He et al., 2016;Huang, Liu, Van Der Maaten, & Weinberger, 2017;Zagoruyko & Komodakis, 2016). Determining howVsNet’s performance compares to less brain-inspiredCNNs is one broad aim of our study, with our hypothesis being that a model’s predictive success willimprove as its architecture becomes more like thatof the primate brain.A second broad aim is to predict people’s goaldirected allocation of overt attention as they searchfor categories of objects. CNNs have been used topredict the bottom-up allocation of attention inscenes (Huang, Shen, Boix, & Zhao, 2015; Li & Yu,2015; Wang & Shen, 2017), but they have only juststarted to be used to model the top-down controlof attention (Adeli & Zelinsky, 2018; Zhang et al.,2018). We operationally deﬁne attention control asthe degree that eye movements from human participants are guided to targets in a categorical searchtask. The spatial locations ﬁxated via eye movementsare an ideal behavioural ground truth for ourpurpose, as an eye movement is the most basicobservable behaviour linked to a covert shift ofspatial attention (Deubel & Schneider, 1996). Ourfocus on categorical search is similarly perfect. Categorical search, the search for an object designated onlyby its category name, can be contrasted with exemplar search, the more common task where participants are cued with an image showing the exactobject that they are to search for. Categoricalsearch therefore blends a highly nontrivial objectclassiﬁcation task with a gold-standard measure ofattention control, the oculomotor guidance of gazeto a target (Zelinsky, 2008).While historically a neglected task for studyingattention control (see Zelinsky, Peng, Berg, &Samaras, 2013, for discussion), interest in categoricalsearch has accelerated in recent years (e.g., Cohen,Alvarez, Nakayama, & Konkle, 2016; Hout, Robbins,Godwin, Fitzsimmons, & Scarince, 2017; Nako, Wu, &Eimer, 2014; Peelen & Kastner, 2011), a growthfuelled by several key observations: (1) that attentioncan be guided to target categories, as exempliﬁedby the above-chance direction of initial search saccades to target category exemplars in search arrays(Yang & Zelinsky, 2009), (2) that the strength of thecontrol signal guiding attention to categoricaltargets depends on the amount of target-deﬁninginformation provided in the category cue (e.g., stronger guidance for “work boot” than “footwear”;Schmidt & Zelinsky, 2009), (3) that search is guidedto distractors that are visually similar to the target category (guidance to a hand fan when searching for abutterﬂy; Zelinsky, Peng, & Samaras, 2013), (4) that guidance improves with target typicality (stronger guidance to an oﬃce chair than a lawn chair; Maxﬁeld,Stalder, & Zelinsky, 2014), and (5) that guidancebecomes weaker as targets climb the category hierarchy (the guidance to “race car” is greater than the

VISUAL COGNITIONguidance to “car,” which is greater than the guidanceto “vehicle”; Maxﬁeld & Zelinsky, 2012). It is this lattereﬀect of category hierarchy on attention control thatwas the manipulation of interest in the present study.MethodsBehavioural data collectionBehavioural data were obtained from Yu et al. (2016)and were collected using the SBU-68 dataset. Thisdataset consisted of crossly-cropped images of 68object categories that were distributed across threelevels of a category hierarchy. There were 48 subordinate-level categories, which were grouped into 16basic-level categories, which were grouped into 4superordinate-level categories. A categorical searchtask was used, and the participants were 26 StonyBrook University undergraduates. On each trial atext cue designating the target category was displayed for 2500 ms, followed by a 500 ms centralﬁxation cross and then a six-item search display consisting of objects positioned on a circle surroundingstarting ﬁxation. Distractors were from random nontarget categories and on target-present trials thetarget was selected from one of the 48 subordinatelevel categories. Participants responded “present” or“absent” as quickly as possible while maintainingaccuracy, and there were 144 target-present and144 target-absent trials presented in random order.For each target-present trial, a participant’s goaldirected attention guidance was measured as thetime taken to ﬁrst ﬁxate the cued target. Refer toYu et al. (2016) for full details of the behaviouralstimuli and procedure.Category-consistent featuresPrevious work used a generative model to predict thestrength of categorical search guidance across thesubordinate (e.g., taxi), basic (e.g., car), and superordinate (e.g., vehicle) levels of a category hierarchy (Yuet al., 2016). Brieﬂy, its pipeline was as follows. SIFT(Lowe, 2004) and colour histogram features wereextracted from 100 image exemplars of 48 object categories, and the Bag-of-Words (BoW; Csurka, Dance,Fan, Willamowski, & Bray, 2004) method was used toput these features into a common feature space. Thefeatures most visually representative of each of3these categories were then selected, what wetermed to be their Category-Consistent Features(CCFs). Speciﬁcally, responses were obtained for eachBoW feature to all the images of each of a category’sexemplars, and these responses were averaged overthe exemplars and then divided by the standard deviation in the responses to obtain a feature-speciﬁcSignal-to-Noise Ratio (SNR). A feature having a highSNR would therefore be one that occurred both frequently and consistently across a category’s exemplars. CCFs for each of the categories were obtainedby clustering the features’ SNRs and selecting thehighest.This BoW-CCF model was able to predict howbehavioural performance was aﬀected by target speciﬁcation at the three levels of the category hierarchy.For example, one speciﬁc ﬁnding was that the time ittook gaze to ﬁrst land on the target (time-to-target)increased with movement up the hierarchy, whatwas termed the “subordinate-level advantage.” BoWCCF modelled almost perfectly the observed subordinate-level advantage as a simple count of the numberof CCFs extracted for object categories at each hierarchical level; more CCFs were selected for categoriesat the subordinate level than either the basic or superordinate levels. This result was interpreted as evidencethat attention control improves with the number ofCCFs used to represent a target category (Yu et al.,2016, should be consulted for more details). Thepresent method adopts the SNR deﬁnition of CCFsfrom Yu et al. (2016), but critically uses VsNet toextract these features (see next section). Also borrowed from the previous work is the method of predicting search guidance from the number ofextracted CCFs, a measure that we ﬁnd desirable inthat it is relatively simple and intuitive (more CCFs better attention control).Extracting CNN-CCFsThe CCF method selects representative features(which may or may not be discriminative) thatappear both frequently and consistently across theexemplars of an object category, but the methoditself is largely feature independent. In previous work(Yu et al., 2016) these CCFs were selected from alarge pool of BoW features; in our current adaptationwe select CCFs from the even larger pool of featuresfrom a trained CNN, where each trained convolutionalﬁlter is considered a feature and a potential CCF. We

4C.-P. YU ET AL.hypothesize that the more powerful CNN-CCF featureswill represent more meaningful visual dimensions ofan object category. For example, whereas BoW-CCFsmight have coded the fact that many taxis areyellow and represented the various intensity gradients associated with their shape, a CNN-CCF representation of taxis might additionally capture tires,headlights, and the signs typically mounted to theirroofs. We further hypothesize that these richerfeature representations, to the extent that they arepsychologically meaningful, will enable better predictions of attention control.The speciﬁc CNN-CCF selection process is illustratedin Figure 1 for the taxi category and a hypotheticalnetwork. Given an object category with n exemplarsof size m m, and a trained CNN with L convolutionallayers each containing K ﬁlters, we forward pass allexemplars through the network to obtain an activation proﬁle of size m m n for every convolutionalﬁlter, Y (l)k , where l and k are indices to the layer andﬁlter number, respectively. To remove border artefactsintroduced by input padding, the outer 15% of eachm m activation map is set to zero. Each Y (l)k is thenreduced to a 1 n vector, yk(l) , by performing globalsum-pooling over each image’s m m activationmap. This pooling yields the overall activation ofeach ﬁlter in response to an exemplar image. Havingthese exemplar-speciﬁc ﬁlter responses, we thenborrow from the BoW-CCF pipeline and compute aSNR for each ﬁlter:SNR(l)k mean (yk(l) )std (yk(l) ),(1)where the mean and standard deviation are computed over the exemplars. Applying this equation tothe activation proﬁle from each ﬁlter produces a distribution of SNRs. Higher SNRs would indicate strongerand more consistent ﬁlter responses, making theseﬁlters good candidates for being CCFs. To identifythese CCFs we ﬁt a two-component Gamma-MixtureModel to the SNR distribution, a method similar toParametric Graph Partitioning (Yu, Hua, Samaras, &Zelinsky, 2013; Yu, Le, Zelinsky, & Samaras, 2015). Weuse a Gamma distribution because it has beenshown to model spiking neuron activity (Li et al.,2017; Li & Tsien, 2017), and we observed that itdescribes our CNN SNR distributions very well. TheCCFs are then deﬁned as the ﬁlters having SNRshigher than the crossover point of the two Gammacomponents. This pipeline for extracting CNN-CCFswas applied on each convolutional layer independently, as ﬁlter activations have diﬀerent ranges atdiﬀerent layers. Of the 500 training and 50 validationFigure 1. Pipeline of the CNN-CCF extraction method. (A) A set of category exemplars, in this case images of taxis, are input into atrained CNN. (B) Activation maps (or feature maps) in response to each exemplar are obtained for every convolutional ﬁlter at eachlayer. Shown are 64-cell activation maps in a hypothetical layer, where each cell indicates a convolutional ﬁlter’s response to agiven exemplar. In this example, 64 SNRs would be computed (12 shown) by analyzing activation map values for each of the 64ﬁlters across the taxi exemplars. (C) A two-component Gamma mixture model is ﬁt to the distribution of SNRs, (D) and the crossover point determines the CCF selection threshold. (E) Filters having SNRs above this threshold are retained as the CCFs for a givencategory ( ); ﬁlters having below-threshold SNRs are dropped ( ).

VISUAL COGNITIONimages that were used for each of the 48 tested categories (see the ImageNet Training section fordetails), only the 50 validation images were used toextract a given category’s CCFs. The training imageswere therefore used to learn the ﬁlters, whereas thevalidation images were used to extract the CCFs.Designing and comparing brain-inspired CNNsTo date, the design of neural network architectureshas focused on improving network performanceacross a range of applications, the vast majority ofwhich are non-biological. Design choices have therefore been largely ad hoc and not informed by eitherthe voluminous work on the organization and functionof the primate visual system, or by the equally voluminous literature on visual attention and its role in controlling behaviour. Our broad perspective is that, tothe extent one’s goal is to understand the primatevisual system by building a computational model, itis a good idea to use these literatures to inform thedesign of new model architectures so as to be moreclosely aligned with what is known about theprimate brain. This is particularly true for the primatevisual attention system, where there are rich theoretical foundations in the behavioural and neuroscienceliteratures that are relatively easy to connect to CNNmodelling methods.VsNet is a rough ﬁrst attempt to build such a braininspired deep neural network, and its detailed pipelineis shown in Figure 2 (top). This eﬀort is “rough”because the neural constraints that we introducerelate only to the gross organization of brain areasalong the primate ventral visual stream. There are farmore detailed levels of system organization that wecould had also considered, but as a ﬁrst pass wedecided to focus on only the gross network architecture. In our opinion this level would likely reveal thegreatest beneﬁt of a brain-inspired design, with theexpectation that future, more detailed brain-inspiredmodels would only improve prediction of attentioncontrol.Speciﬁcally, we designed VsNet to reﬂect fourwidely accepted and highly studied properties of theventral pathway. First, VsNet’s ﬁve convolutionallayers are mapped to the ﬁve major ventral brainstructures (DiCarlo & Cox, 2007; Kobatake & Tanaka,1994; Kravitz, Kadharbatcha, Baker, Ungerleider, &Mishkin, 2013; Mishkin, Ungerleider, & Macko, 1983;Serre, Kreiman, et al., 2007). VsNet has a V1, a V2, a5combined hV4 and LOC1/2 layer that we refer to asV4-like, a PIT, and a CIT/AIT layer, with these ﬁve convolutional layers followed by two fully-connectedclassiﬁcation layers. Second, the number of ﬁlters ineach of VsNet’s ﬁve convolutional layers are proportional to the number of neurons, estimated bybrain surface area (Orban, Zhu, & Vanduﬀel, 2014;Van Essen et al., 2001), in the corresponding ﬁvebrain structures. Third, the range of ﬁlter sizes ateach layer is informed by the range of receptive ﬁeldsizes for visually responsive neurons in the corresponding structures. And fourth, VsNet diﬀers fromother strictly feedforward architectures in that itadopts a brain-inspired implementation of bypassconnections based on known connectivity betweenlayers in the primate ventral visual stream. SeeFigure 2 and the VsNet Design section for additionalarchitectural design details.Our CNN-CCF extraction algorithm is general, andcan be applied to the ﬁlter responses from any pretrained CNN. This makes model comparison possible.In addition to extracting CNN-CCFs from VsNet, weused the identical algorithm to extract CNN-CCFsfrom two other CNNs. One of these was AlexNet (Krizhevsky et al., 2012), a widely used CNN also consisting of ﬁve convolutional and two fully-connectedlayers. Although AlexNet’s design was not braininspired, it has been used with good success inrecent computational neuroscience studies (Cadieuet al., 2014; Hong et al., 2016; Khaligh-Razavi & Kriegeskorte, 2014) and is therefore of potential interest.More fundamentally, it will serve as a baseline againstwhich the more brain-inspired networks can be compared, which is important to gauge broadly how theinclusion of neural constraints in a CNN’s designtranslates into improved prediction performance.We also extracted CNN-CCFs from a model that weare calling Deep-HMAX, our attempt to create aCNN version of the inﬂuential HMAX model ofobject recognition (Serre, Oliva, & Poggio, 2007).HMAX was designed to be a biologically plausiblemodel of how the recognition of visually complexobjects might be implemented in ventral brain circuitry (Riesenhuber & Poggio, 1999; Tarr, 1999), but itcannot be fairly compared to more recent and powerful convolutional network architectures. Our DeepHMAX model keeps the basic architectural designelements of HMAX intact, with the most centralamong these being the inclusion of simple and

6C.-P. YU ET AL.Figure 2. The architectures of VsNet and Deep-HMAX. Each blue box represents a convolutional layer, with the corresponding ventralpathway area labelled above. Pink circles are Depth-Concat layers that concatenate the input maps from the depth dimension. Arrowsindicate input to output direction, dashed arrows represent max-pooling layers and their kernel sizes and strides, yellow arrows represent dimensionality reduction via 1 1 ﬁlters, and blue arrows are skip connections that can be either a direct copy (dark blue) or adimensionality-reduced copy (light blue) via 1 1 ﬁlters. Green rectangles within each layer represent a set of ﬁlters, where the numberof ﬁlters is in red, followed by the ﬁlter size, stride size, and the corresponding receptive ﬁeld (RF) size in visual angle shown in parentheses (assuming 1 spans 5 pixels). Note that both VsNet and Deep-HMAX attempt to match the RF sizes of the convolutional ﬁltersin each layer to the range of the RF size estimates in each of the ﬁve human ventral visual pathway areas. These target RF size ranges areindicated at the bottom of each VsNet layer (see the Receptive Field Size section for details on how these estimates were obtained). Eachconvolutional ﬁlter is followed by a Batch Normalization layer (BatchNorm; Ioﬀe & Szegedy, 2015) and a Rectiﬁed Linear activation layer(ReLU).complex cell units, but replaces the originally handcrafted units with convolutional layers that learn thesimple and complex cell responses from visualinput, thereby making possible a more direct comparison to VsNet. Figure 2 (bottom) shows the architecture of Deep-HMAX, and additional details can befound in the ImageNet Training section. Broadly, themodel has a very diﬀerent architecture than VsNet,with one example being that it uses 10 convolutionaland two fully-connected layers. By comparing DeepHMAX and VsNet it is therefore possible to see howa fairly gross level of brain organization might aﬀectnetwork performance. Note also that VsNet was computationally disadvantaged in these comparisonsbecause it used the smallest number of convolutionalﬁlters to predict attention control; AlexNet has 1152ﬁlters, Deep-HMAX 1760, but VsNet only 726 (excluding 1 1 dimensionality-reduction ﬁlters). This conservative design means that, to the extent thatVsNet better predicts attention control than theother models, this beneﬁt would likely be due to itsbrain-inspired architecture and not simply greatercomputational power.2Vsnet designVsNet is brain-inspired in three key respects: thenumber of ﬁlters at each convolutional layer is proportional to the estimated number of neurons in thecorresponding brain structure, the sizes of ﬁlters ateach layer are proportional to neuron receptive ﬁeldsizes in corresponding structures, and the gross connectivity between its layers is informed by connectivity between structures in the primate ventralvisual stream. Each of these brain-inspired constraintswill be discussed in more detail. With respect toVsNet’s broad mapping of convolutional layers tobrain structures, the mappings of its ﬁrst layer to V1and its second layer to V2 are relatively noncontroversial. However, we wanted VsNet’s third convolutionallayer to map to V4, a macaque brain area, and identifying a homolog to V4 in humans is less

VISUAL COGNITIONstraightforward. A structure has been identiﬁed as“human V4” (hv4), and neurons in this structure areorganized retinotopically (Brewer, Liu, Wade, &Wandell, 2005; Fize et al., 2003; McKeefry & Zeki,1997) like macaque V4, but their feature selectivitiesare somewhat diﬀerent. Macaque V4 neurons areselective to colour, shape, and boundary conformation(Cadieu et al., 2007; Desimone, Schein, Moran, &Ungerleider, 1985; Pasupathy & Connor, 2002),whereas neurons in hV4 respond mainly to justcolour and occupy a proportionally much smaller cortical surface area (Brewer et al., 2005; Larsson &Heeger, 2006; McKeefry & Zeki, 1997). For humans,shape and boundary and other object-related processing likely occurs in lateral occipital areas 1 and 2 (LO1/2; Larsson & Heeger, 2006). LO1/2 is also retinotopically organized and is anatomically adjacent to hV4(Van Essen et al., 2001). In an eﬀort to obtain asuﬃciently large number of learnable mid-level features, we therefore map VsNet’s third convolutionallayer to a combination of hV4 and LO1/2, referred tohere as “V4-like.” Our intent was to map VsNet’sdeeper layers to IT, and decisions had to be madeabout these mappings as well. To keep congruencewith the monkey neurophysiology literature, wespeciﬁcally wanted to identify human homologs tomacaque TEO and TE. For VsNet’s fourth layer wesettled on a structure anterior to hV4, termed“human TEO” in (Beck, Pinsk, & Kastner, 2005;Kastner et al., 2001; Kastner, Weerd, Desimone, &Ungerleider, 1998) and PIT elsewhere (Orban et al.,2014), and for its ﬁfth layer we chose central andanterior inferotemporal cortex (CIT AIT; Rajimehr,Young, & Tootell, 2009), roughly macaque TE. We willshow that CNN-CCFs extracted from VsNet, a CNNhaving this more primate-like architecture, better predicts primate behaviour.Ventral stream surface areasThe numbers of convolutional ﬁlters in VsNet’s layerswere based on estimates of human brain surfaceareas in the mapped structures. Speciﬁcally, V1, V2and V4-like surface areas were estimated to be2323 mm2, 2102 mm2, and 2322 mm2, respectively(Larsson & Heeger, 2006). For PIT and CIT AIT, weestimated their surface areas to be approximately 9times larger than the surface areas in the corresponding macaque structures (TEO and TE, respectively;Orban et al., 2014), based on reported diﬀerences in7cortical size between macaque and human (VanEssen et al., 2001). This resulted in an estimate of PIThaving a surface area of 3510 mm2, and of CIT AIThaving a surface area of 3420 mm2. Having thesesurface area estimates, one approach might make proportional allocations of convolutional ﬁlters at eachlayer, but this would ignore the fact that some ofthese structures have a retinotopic organization. Retinotopy requires that the RFs of neurons having similarfeature selectivities are tiled across the visual ﬁeld inorder to obtain location-speciﬁc information, andthis duplication of neurons is a major factor determining the surface area of some brain structures. CNNshave no retinotopy; their ﬁlters are convolved with avisual input rather than duplicated and tiled over animage. To equate the two, we derive a duplicationfactor that estimates the latent number of uniquelyselective neurons within each brain structure, andthen makes the number of convolutional ﬁlters inthe corresponding layer proportional to this estimate.In doing this we make a simple assumption. If theaverage RF size for a neuron in a ventral stream structure is as large as the entire visual ﬁeld, then therewould be no need for the retinotopic duplication ofthis type of neuron for the purpose of capturing information from across the visual ﬁeld. This would lead toa duplication factor of 1. However, if in this examplethe average RF size for a neuron covers only aquarter of the visual ﬁeld, then there would minimallyneed to be four neurons of this type organized retinotopically to cover the entire visual ﬁeld. This wouldlead to a duplication factor of 4. More gene

Modelling attention control using a convolutional neural network designed after the ventral visual pathway Chen-Ping Yua,c, Huidong Liua, Dimitrios Samarasa and Gregory J. Zelinskya,b aDepartment of Computer Science, Stony Brook University, Stony Brook, NY, USA; bDepartment of Psychology, Stony Brook University, Stony Brook, NY, USA; cD

Related Documents:

Rockin' crystals engineering themed activity pack

5. Who can grow the largest crystal from solution? Modelling crystals 15 . 1. Modelling a salt crystal using marshmallows 2. Modelling crystals using cardboard shapes 3. Modelling diamond and graphite 4. Modelling crystal growth using people. More about crystals 21 . 1. Crystalline or plastic? 2. Make a crystal garden. Putting crystals to use .

12 Views

11m ago

Business Process Modelling in Production Logistics: Complementary Use ...

follow using state-of-the- art modeling tool of BPMN 2.0 and UML. Key words: Computer-aided systems Production logistics Business process modelling BPMN 2.0 UML Modelling techniques INTRODUCTION Business Process Execution Language for web Business Process Modelling (BPM) as the main core Business Process Modelling Notation (BPMN) to

16 Views

5m ago

Simplified FEM modelling for the collapse assessment of a masonry vault

and simplified method to describe masonry vaults in global seismic analyses of buildings. Fig. 1 summarizes three different modelling techniques for ma sonry modelling, respectively, mi cro- , macro- and simplified micro modelling. In the case a micro modelling approach is take n, the challenge is to describe the complex behavior of the

31 Views

1y ago

Agile Modelling in Software Engineering - TUM

Agile Modelling is a concept invented in 1999 by Scott Ambler as a supplement to Extreme Pro-gramming (XP) [Source: Agile Modelling Values]. Strictly deﬁned, Agile Modelling (AM) is a chaordic, practices-based methodology for eﬀective modelling and documentation [Source: Interview with SA by Clay Shannon].

20 Views

1y ago

Integratingpartandassemblymodelling - TU Delft

equately support part modelling, i.e. modelling of product elements that are manufactured in one piece. Modelling is here based on requirements from part-oriented applica-tions, such as a minimal width for a slot in order to be able to manufacture it. Part modelling systems have evolved for some time now, and different modelling concepts have

21 Views

1y ago

s l a t n e a d n u F g n i l e d o M l a i c n Financial Modelling a n ...

Financial Statements Modelling www.bestpracticemodelling.com Page 5 of 40 Financial Statements Module Location 1.2. Financial Statements Modelling Overview The modelling of the financial statements components of an entity is a unique area of spreadsheet modelling, because it involves the systematic linking in of information from

12 Views

11m ago

CIBSE Application Manual AM11 'Building Performance Modelling' Chapter ...

CIBSE Application Manual AM11 'Building Performance Modelling' Chapter 8: Modelling of plant and renewable energy systems AM11 Overview Seminar: March 15th 2016 3 Theme 1 -simplified modelling using control functions The plant is not modelled -energy and ventilation flows are modelled using band-limited deviations from set points

5 Views

4m ago

Individual-Based Modelling

BACKGROUND AND EMERGENCE OF INDIVIDUAL-BASED strong MODELLING /strong Individual-based strong modelling /strong (IBM) is a topic that has been receiving rapidly increasing attention in ecology for more than 10 years. The introduction of IBM extended the set of available strong modelling /strong techniques. Before the 1990s, the differential equationŒbased approach was widely

28 Views

3y ago

Recent Views

A Message From Prosecutor Walsh Domestic Violence Awareness

The Summit ounty Prosecutor's Office has a new campaign to help inform the public about what it's like to be a prosecutor. Each month, one of our assistant prosecutors explains why they chose to be a prosecutor. This month Assistant Prosecutor Elliot Kolkovich discusses the reasons why he is a prosecutor. prosecutor is holding people

1y ago

114 Views

Check Your IPAC Prosecutor Page for Accuracy, Current Content - in

IPAC web site Find Your Prosecutor page? The main Find Your Prosecutor page lists the name, contact information and web site (if any) for each county. It also links to a page for each prosecutor that has space for a photograph and biography. The public uses IPAC's Find Your Prosecutor pages to find out about their local prosecutor. In the 4th .

1y ago

132 Views

Carolyn A. Murray Acting Essex County Prosecutor

2013 Annual Report of Essex County Prosecutor's Office Executive Staff Left to right, first row: Acting Essex County Prosecutor Carolyn A. Murray, New Jersey Attorney General Jeffery S. Chiesa, First Assistant Prosecutor Robert D. Laurino. Second row: Chief Assistant Prosecutor Keith Harvest, Public Information Officer

1y ago

115 Views

Office of the Public Prosecutor CODE OF ETHICS - Gov

the prosecutor has demonstrated actual bias or prejudice towards an accused, complainant or witness; ii) the prosecutor previously served as counsel for the other party, or . was a material witness in the prosecution; iii) the prosecutor, or a member of the prosecutor's family, has an interest in the outcome of the prosecution;

1y ago

107 Views

Role, Functions, and Duties of the Prosecutor & How to Succeed

prosecutor's office should exercise sound discretion and independent judgment in the performance of the prosecution function. (b) The primary duty of the prosecutor is to seek justice within the bounds of the law, not merely to convict. The prosecutor serves the public interest and should act with integrity and balanced judgment to increase

1y ago

115 Views

Mahoning County Prosecutor'S Office Annual Report

Chief Assistant Prosecutor Chief, Civil Division 330-740-2330 v 3 Karen Gaglione Assistant Chief, Civil Division 330-740-2330 Mahoning County Prosecutor's Office kgaglione@mahoningcountyoh.gov 21 W. Boardman Street, 6th Floor, Youngstown, OH 44503 (T) 330-740-2330 (F) 330-740-2008 Website: prosecutor .

1y ago

118 Views

The Kansas Prosecutor

The Associate Member Prosecutor of the Year Award is presented to a prosecutor for outstanding prosecution of a case or cases throughout the year from an office other than a County or District Attorney's office. Nominations may be made by either the prosecutor himself/herself or by a colleague. The nominee must be an associate member of the .

1y ago

121 Views

The Prosecutor - Montgomery County, Ohio

The Prosecutor is published as a public service by the Montgomery County Prosecutor's Office. For questions or comments about articles appearing in The Prosecutor, or to recommend topics you'd like to see, please contact: Mr. Greg Flannagan, Public Information Officer at 937-225-5610 or e-mail info@mcpo.com Office Staff Updates

1y ago

104 Views

A Message From Prosecutor Walsh The Role Of A Prosecutor

Assistant Prosecutor with the ivil Division and has worked with my office for over 14 years. Annie Spitali began her career in the hild Support Division in 1996 and is currently a hild Support Supervisor. And Heaven Guest has been with my office since 2003 and is currently an Appellate Prosecutor. Your awards are well deserved!

1y ago

111 Views

How Prosecutor Elections Fail Us - Ohio State University

the prosecutor has applied the criminal law according to public values. This article surveys the typical rhetoric in prosecutor election campaigns, drawing on a new database that collects news accounts of candidate statements during prosecutor elections. Those statements reflect the candidates' claims about

1y ago

111 Views

TRIAL STRATEGIES FOR THE PROSECUTION OF SEXUAL ABUSE IN .

Trial Preparation Sherry Sullivan is transported to the prosecutor’s office for a trial prep session. Sherry says she doesn’t want to talk about the rapes; she just wants the prosecutor to talk to her about what will happen during the trial. The prosecutor spends 45 minutes talking about the process. Sherry says she will talk to

3y ago

128 Views

English .: ICC-01/18 Date

No. ICC-01/18 2/30 16 March 2020 Document to be notified in accordance with regulation 31 of the Regulations of the Court to: The Office of the Prosecutor Ms. Fatou Bensouda, Prosecutor Mr James Stewart, Deputy Prosecutor

3y ago

130 Views

Paul C. Dedinsky, J.D., Ph.D. 5737 North Kent Avenue .

Sensitive Crimes Prosecutor, 1999–2001, Assistant to E. Michael McCann, serving as a Sexual Assault prosecutor. Misdemeanor, Domestic Violence and Delinquency Prosecutor, 1997–1999 Private Practice Law Offices, Milwaukee, WI, 1994–1997 Principal Attorney, specializing in criminal defe

2y ago

122 Views

Open letter to the Chief Prosecutor of the International .

whether that meant that the Prosecutor would defer to a national investigation or that the Prosecutor would at some point resume the investigation, either after the settlement negotiations were successful or if they failed, and what

2y ago

116 Views

DOCUMENT RESUME ED 188 049 Prosecutor's Responsibility

DOCUMENT RESUME. ED 188 049. CG 014 443. Prosecutor's Responsibility in Spouse Abuse Cases. INSTITUTION National District Attorneys Association, Chicago, . The role of the prosecutor in spouse assault cases was the subjct of a conference organized by the National-District Att

2y ago

185 Views

Modelling Attention Control Using A Convolutional Neural .

It looks like you're using an ad-blocker