Shape perception

形状感知
  • 文章类型: Journal Article
    尽管跨多种感官的信息集成可以增强记忆中的对象表示,多感官信息如何影响类别的形成是不确定的。特别是,目前尚不清楚多感官信息形成的类别在多大程度上有利于对象识别,而不是单感官输入。两个实验研究了新颖的听觉和视觉对象的分类,类别由空间相似性定义,并测试了对新范例的概括。参与者学会了根据仅视觉(几何形状)对样本进行分类,仅听觉(空间定义的音景)或视听空间线索。然后在相同的感官学习条件下测试学习的分类以及新颖的示例。对于所有学习方式,归类推广到新颖的范例。然而,没有证据表明学习的多感官样本的分类表现得到了增强.充其量,双峰性能接近最精确的单峰条件,尽管仅在类别中的一部分样本中观察到了这种情况。这些发现提供了对类别形成所涉及的感知过程的见解,并且与理解支撑这些类别的对象表示的感官性质有关。
    Although the integration of information across multiple senses can enhance object representations in memory, how multisensory information affects the formation of categories is uncertain. In particular, it is unclear to what extent categories formed from multisensory information benefit object recognition over unisensory inputs. Two experiments investigated the categorisation of novel auditory and visual objects, with categories defined by spatial similarity, and tested generalisation to novel exemplars. Participants learned to categorise exemplars based on visual-only (geometric shape), auditory-only (spatially defined soundscape) or audio-visual spatial cues. Categorisation to learned as well as novel exemplars was then tested under the same sensory learning conditions. For all learning modalities, categorisation generalised to novel exemplars. However, there was no evidence of enhanced categorisation performance for learned multisensory exemplars. At best, bimodal performance approximated that of the most accurate unimodal condition, although this was observed only for a subset of exemplars within a category. These findings provide insight into the perceptual processes involved in the formation of categories and have relevance for understanding the sensory nature of object representations underpinning these categories.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Case Reports
    人类对物体的视觉体验包括视觉特征的组合,比如颜色,position,和形状。空间注意力被认为在创造连贯的感知体验中发挥作用,整合来自给定位置的视觉信息,但是这个过程背后的机制还没有完全理解。空间注意力的缺陷,这种整合过程通常不会发生,比如忽视,可以提供有关视觉对象识别中空间注意力机制的见解。在这项研究中,我们描述了对一个人进行的一系列实验,DH.DH表现出对单个物体左侧缺乏意识的特征,通过不良的物体和面部识别来证明,和受损的单词阅读。然而,他在他无法识别的相同物体的边界内表现出对颜色的完整识别。此外,他也可以报告的方向和位置的着色区域在被忽视的左侧,尽管缺乏意识的形状的区域。总的来说,尽管在同一空间位置对基本视觉特征进行了完整的处理,但DH仍显示出选择性的形状意识不足。DH的表现提出了关于空间注意力在连贯的对象感知和视觉意识形成中的作用的有趣问题和挑战。
    Human visual experience of objects comprises a combination of visual features, such as color, position, and shape. Spatial attention is thought to play a role in creating a coherent perceptual experience, integrating visual information coming from a given location, but the mechanisms underlying this process are not fully understood. Deficits of spatial attention in which this integration process does not occur normally, such as neglect, can provide insights regarding the mechanisms of spatial attention in visual object recognition. In this study, we describe a series of experiments conducted with an individual with neglect, DH. DH presents characteristic lack of awareness of the left side of individual objects, evidenced by poor object and face recognition, and impaired word reading. However, he exhibits intact recognition of color within the boundaries of the same objects he fails to recognize. Furthermore, he can also report the orientation and location of a colored region on the neglected left side despite lack of awareness of the shape of the region. Overall, DH shows selective lack of awareness of shape despite intact processing of basic visual features in the same spatial location. DH\'s performance raises intriguing questions and challenges about the role of spatial attention in the formation of coherent object percepts and visual awareness.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    视觉拥挤是指在被其他刺激(干扰物)包围时,容易孤立地识别的目标对象变得难以识别的现象。许多心理物理学研究已经调查了这种现象,并提出了潜在机制的替代模型。一个突出的假设,尽管有混合的心理物理学支持,假设拥挤是由于在皮质视觉处理的早期阶段对目标和干扰物刺激的特征进行汇总编码而导致的信息丢失。然而,神经生理学研究尚未严格检验这一假设。我们研究了猕猴(一只雄性,一名女性)区域V4,腹侧的中间阶段,对象处理路径,参数化设计的拥挤显示器及其纹理统计匹配的同色异谱对应物。我们的调查揭示了拥挤参数之间的惊人相似之处,例如,number,距离,和分心者的位置,影响人类心理物理性能和V4形状选择性。重要的是,我们还发现,增强目标刺激的显著性可以减轻高度杂乱场景中的拥挤效应,这可能在时间上延长,反映了一个动态过程。因此,附近刺激的汇总编码无法解释观察到的反应,我们提出了一种替代模型,其中V4神经元优先在拥挤的显示器中编码显着的刺激。总的来说,我们得出的结论是,拥挤效应的大小不仅取决于干扰物的数量和目标-干扰物的分离,还取决于目标与干扰物的相对显著性,这是基于它们的特征属性-干扰物的相似性,以及目标刺激和干扰刺激之间的对比。意义陈述心理物理学家长期以来一直研究视觉拥挤的现象,但潜在的神经机制是未知的。我们的结果揭示了中层视觉皮层区V4中神经元的反应与心理物理演示之间的惊人相关性,揭示了拥挤不仅受干扰物的数量和空间排列的影响,而且受目标和干扰物特征相似性的影响,以及干扰者本身。总的来说,我们的研究提供了强有力的证据,证明视觉系统使用策略来优先编码视觉场景中的显著特征,大概是为了有效地处理视觉信息.当附近的多个刺激同样显著时,拥挤的现象随之而来。
    Visual crowding refers to the phenomenon where a target object that is easily identifiable in isolation becomes difficult to recognize when surrounded by other stimuli (distractors). Many psychophysical studies have investigated this phenomenon and proposed alternative models for the underlying mechanisms. One prominent hypothesis, albeit with mixed psychophysical support, posits that crowding arises from the loss of information due to pooled encoding of features from target and distractor stimuli in the early stages of cortical visual processing. However, neurophysiological studies have not rigorously tested this hypothesis. We studied the responses of single neurons in macaque (one male, one female) area V4, an intermediate stage of the object-processing pathway, to parametrically designed crowded displays and texture statistics-matched metameric counterparts. Our investigations reveal striking parallels between how crowding parameters-number, distance, and position of distractors-influence human psychophysical performance and V4 shape selectivity. Importantly, we also found that enhancing the salience of a target stimulus could alleviate crowding effects in highly cluttered scenes, and this could be temporally protracted reflecting a dynamical process. Thus, a pooled encoding of nearby stimuli cannot explain the observed responses, and we propose an alternative model where V4 neurons preferentially encode salient stimuli in crowded displays. Overall, we conclude that the magnitude of crowding effects is determined not just by the number of distractors and target-distractor separation but also by the relative salience of targets versus distractors based on their feature attributes-the similarity of distractors and the contrast between target and distractor stimuli.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    分类是一个基本的认知和感知过程,这是自发发生的。然而,早期的研究往往忽略了这一过程的自发性,主要采用行为或神经影像学范式中的明确任务。这里,我们在22名健康人类参与者(男性和女性)的脑电图(EEG)过程中使用频率标记(FT)作为确定自发视觉分类处理的直接方法.从示意性的自然视觉刺激开始,我们创建了包含11个相等步骤的变形序列。反映行为分类感知歧视范式,我们采用了FT-EEG奇怪的范例,评估刺激类别内和之间同等大小差异的神经敏感性。同样,反映行为类别分类范式,我们管理了一个扫描FT-EEG奇怪的范例,从变形序列的一端扫到另一端,从而使我们能够客观地确定神经类别的边界。我们发现FT-EEG可以隐式地测量分类处理和辨别。更具体地说,我们可以得出所需水平的客观神经指数来区分这两个类别,这个神经指数显示了分类感知的典型标记(即,与类别内相比,跨类别的歧视更强)。还使用显式行为任务验证了隐式范式的神经发现。这些结果提供了证据,表明FT-EEG可以用作衡量歧视和分类的客观工具,并且人脑固有地和自发地(没有任何有意识或决策过程)使用更高级别的有意义的分类信息来解释歧义(变形)形状。重要性声明每当我们遇到新的图像或物体时,我们将自动将其与以前存储的类别相关联。这种分类过程使我们能够有效地对新信息做出反应,并影响我们的感知。分类感知的行为标志要求我们比类别内更清楚地感知类别之间的差异。先前的研究主要研究了使用显式任务的分类处理。这里,我们使用带有隐式的精心控制的变体,神经脑电图测量以评估自发分类处理。我们发现“之间”-比“内”-类别变形对的神经振幅更高,建立分类感知的神经关联。这提供了证据,表明大脑固有地和自动地使用更高级别的有意义的分类信息来解释模糊的形状。
    Categorization is an essential cognitive and perceptual process, which happens spontaneously. However, earlier research often neglected the spontaneous nature of this process by mainly adopting explicit tasks in behavioral or neuroimaging paradigms. Here, we use frequency-tagging (FT) during electroencephalography (EEG) in 22 healthy human participants (both male and female) as a direct approach to pinpoint spontaneous visual categorical processing. Starting from schematic natural visual stimuli, we created morph sequences comprising 11 equal steps. Mirroring a behavioral categorical perception discrimination paradigm, we administered a FT-EEG oddball paradigm, assessing neural sensitivity for equally sized differences within and between stimulus categories. Likewise, mirroring a behavioral category classification paradigm, we administered a sweep FT-EEG oddball paradigm, sweeping from one end of the morph sequence to the other, thereby allowing us to objectively pinpoint the neural category boundary. We found that FT-EEG can implicitly measure categorical processing and discrimination. More specifically, we could derive an objective neural index of the required level to differentiate between the two categories, and this neural index showed the typical marker of categorical perception (i.e., stronger discrimination across as compared with within categories). The neural findings of the implicit paradigms were also validated using an explicit behavioral task. These results provide evidence that FT-EEG can be used as an objective tool to measure discrimination and categorization and that the human brain inherently and spontaneously (without any conscious or decisional processes) uses higher-level meaningful categorization information to interpret ambiguous (morph) shapes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    为了调查在没有意识的情况下局部元素是否被分组为全局形状,我们介绍了两种不同的蒙面灌注设计(例如,经典的分离范式和试验式探针和主要歧视任务),并收集了这两个目标(即,基于表现)和主观(使用感知意识量表[PAS])意识度量。使用三种不同的prime-mask刺激开始异步(SOA)和未掩盖条件来操纵prime可见性。我们的结果表明,评估主要能见度试验严重干扰了掩蔽启动,阻止了任何主要促进作用。贝叶斯回归模型的实现,预测意识水平处于偶然水平的参与者的启动效应,提供了有力的证据来支持以下假设:在没有意识到超过50毫秒的SOA的情况下,局部元素组成整体形状,这表明prime-maskSOA是在没有意识的情况下处理全局形状的关键因素。
    To investigate whether local elements are grouped into global shapes in the absence of awareness, we introduced two different masked priming designs (e.g., the classic dissociation paradigm and a trial-wise probe and prime discrimination task) and collected both objective (i.e., performance based) and subjective (using the perceptual awareness scale [PAS]) awareness measures. Prime visibility was manipulated using three different prime-mask stimulus onset asynchronies (SOAs) and an unmasked condition. Our results showed that assessing prime visibility trial-wise heavily interfered with masked priming preventing any prime facilitation effect. The implementation of Bayesian regression models, which predict priming effects for participants whose awareness levels are at chance level, provided strong evidence in favor of the hypothesis that local elements group into global shape in the absence of awareness for SOAs longer than 50 ms, suggesting that prime-mask SOA is a crucial factor in the processing of the global shape without awareness.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目的:尽管卷积神经网络(CNN)和变压器在许多医学图像分割任务中表现良好,他们依赖于大量的标记数据进行训练。医学图像数据的标注既昂贵又耗时,因此,通常使用半监督学习方法来使用少量标记数据和大量未标记数据来提高医学成像分割的性能。
    方法:这项工作旨在使用具有形状感知和多尺度一致性正则化的三重老师交叉学习半监督医学图像分割来增强医学图像的分割性能。为了有效地利用来自未标记数据的信息,设计了一种基于形状感知的三教师交叉学习多尺度半监督方法,称为半TMS。这三个教师模型相互交叉学习,老师A和老师C利用CNN架构,而B老师采用了变压器模型。由老师A和老师C组成的交叉学习模块捕获本地和全球信息,生成伪标签,并使用预测结果进行交叉学习。将多尺度一致性正则化分别应用于CNN和变换器以提高精度。此外,来自老师A或老师C的低不确定性输出概率被用作老师B的输入,提高了先验知识的利用率和整体分割的鲁棒性。
    结果:对两个公共数据集的实验评估表明,所提出的方法优于一些现有的半分割模型,隐含地捕获形状信息,通过多尺度一致性有效提高未标记数据的利用率和准确性。
    结论:随着医学影像在临床诊断中的广泛应用,我们的方法有望成为一个潜在的辅助工具,协助临床医生和医学研究人员进行诊断。
    Objective. Although convolutional neural networks (CNN) and Transformers have performed well in many medical image segmentation tasks, they rely on large amounts of labeled data for training. The annotation of medical image data is expensive and time-consuming, so it is common to use semi-supervised learning methods that use a small amount of labeled data and a large amount of unlabeled data to improve the performance of medical imaging segmentation.Approach. This work aims to enhance the segmentation performance of medical images using a triple-teacher cross-learning semi-supervised medical image segmentation with shape perception and multi-scale consistency regularization. To effectively leverage the information from unlabeled data, we design a multi-scale semi-supervised method for three-teacher cross-learning based on shape perception, called Semi-TMS. The three teacher models engage in cross-learning with each other, where Teacher A and Teacher C utilize a CNN architecture, while Teacher B employs a transformer model. The cross-learning module consisting of Teacher A and Teacher C captures local and global information, generates pseudo-labels, and performs cross-learning using prediction results. Multi-scale consistency regularization is applied separately to the CNN and Transformer to improve accuracy. Furthermore, the low uncertainty output probabilities from Teacher A or Teacher C are utilized as input to Teacher B, enhancing the utilization of prior knowledge and overall segmentation robustness.Main results. Experimental evaluations on two public datasets demonstrate that the proposed method outperforms some existing semi-segmentation models, implicitly capturing shape information and effectively improving the utilization and accuracy of unlabeled data through multi-scale consistency.Significance. With the widespread utilization of medical imaging in clinical diagnosis, our method is expected to be a potential auxiliary tool, assisting clinicians and medical researchers in their diagnoses.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    我们环境中的许多物体和材料都会发生改变其形状的变化。例如,树枝在风中弯曲,冰融化,和纸屑。尽管如此,我们在这些变化中识别物体和材料,这表明我们可以区分物体的原始特征和那些由变换(“形状分裂”)引起的特征。然而,如果我们真正理解转换,我们不仅应该能够识别它们的签名,而且还应该积极地将转换应用于新对象(即,通过想象或心理模拟)。这里,我们使用绘图任务研究了这种能力。在平板电脑上,参与者查看了样本轮廓及其转换版本,并被要求通过绘制转换后的测试形状的外观来将相同的转换应用于测试轮廓。因此,他们必须(I)从形状差异中推断转变,(ii)设想将其应用于测试形状,(iii)得出结果。我们的发现表明,与原始测试形状相比,图纸更类似于地面真实转换的测试形状,从而证明了观察到的转换的推断和再现。然而,这只观察到相对简单的形状。该能力还通过变换类型和大小而不是通过样品和测试形状之间的相似性来调节。一起,我们的发现表明,我们可以区分原始物体形状的表示和它们的变换,并且可以使用视觉图像在心理上对观察到的对象应用非刚性转换,展示我们如何不仅感知,而且“理解”形状。
    Many objects and materials in our environment are subject to transformations that alter their shape. For example, branches bend in the wind, ice melts, and paper crumples. Still, we recognize objects and materials across these changes, suggesting we can distinguish an object\'s original features from those caused by the transformations (\"shape scission\"). Yet, if we truly understand transformations, we should not only be able to identify their signatures but also actively apply the transformations to new objects (i.e., through imagination or mental simulation). Here, we investigated this ability using a drawing task. On a tablet computer, participants viewed a sample contour and its transformed version, and were asked to apply the same transformation to a test contour by drawing what the transformed test shape should look like. Thus, they had to (i) infer the transformation from the shape differences, (ii) envisage its application to the test shape, and (iii) draw the result. Our findings show that drawings were more similar to the ground truth transformed test shape than to the original test shape-demonstrating the inference and reproduction of transformations from observation. However, this was only observed for relatively simple shapes. The ability was also modulated by transformation type and magnitude but not by the similarity between sample and test shapes. Together, our findings suggest that we can distinguish between representations of original object shapes and their transformations, and can use visual imagery to mentally apply nonrigid transformations to observed objects, showing how we not only perceive but also \'understand\' shape.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 我们对罕见综合征视觉形式失认症(VFA)进行了全面审查。我们首先记录它的历史,包括这个词的起源,第一个案例研究标记为VFA。该综合征的定义特征,正如其他人之前定义的那样,然后描述。损伤,保留的视觉感知方面,并详细描述了21例符合这些定义特征的患者的脑损伤区域,包括哪些测试用于验证关键症状的存在或不存在。由此,我们注意到患者之间的重要相似性以及明显的差异.枕叶损伤(20/21),无法识别线条图(19/21),保留的色彩视觉(14/21),视野缺损(16/21)是大多数病例的一致性区域.我们发现,在检查VFA患者的感知能力时,区分形状和形式作为不同的构造是有用的。我们的观察表明,这些患者在处理简化形式时经常遇到困难。加工方向和尺寸的缺陷并不常见。尽管通常被引用为综合征的定义特征,但运动知觉和视觉图像并未得到广泛的测试-尽管在所描述的样本中,运动知觉从未被发现是一个缺陷。此外,视力问题(例如,视力低下和视野中存在半尖牙/暗点瘤)比我们想象的更常见,也可能导致VFA患者的知觉障碍。我们得出的结论是,VFA是一种感知障碍,其中视觉系统出于理解图像整体代表的目的而将线条合成在一起的能力降低。
    We present a comprehensive review of the rare syndrome visual form agnosia (VFA). We begin by documenting its history, including the origins of the term, and the first case study labelled as VFA. The defining characteristics of the syndrome, as others have previously defined it, are then described. The impairments, preserved aspects of visual perception, and areas of brain damage in 21 patients who meet these defining characteristics are described in detail, including which tests were used to verify the presence or absence of key symptoms. From this, we note important similarities along with notable areas of divergence between patients. Damage to the occipital lobe (20/21), an inability to recognise line drawings (19/21), preserved colour vision (14/21), and visual field defects (16/21) were areas of consistency across most cases. We found it useful to distinguish between shape and form as distinct constructs when examining perceptual abilities in VFA patients. Our observations suggest that these patients often exhibit difficulties in processing simplified versions of form. Deficits in processing orientation and size were uncommon. Motion perception and visual imagery were not widely tested for despite being typically cited as defining features of the syndrome - although in the sample described, motion perception was never found to be a deficit. Moreover, problems with vision (e.g., poor visual acuity and the presence of hemianopias/scotomas in the visual fields) are more common than we would have thought and may also contribute to perceptual impairments in patients with VFA. We conclude that VFA is a perceptual disorder where the visual system has a reduced ability to synthesise lines together for the purposes of making sense of what images represent holistically.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    Deep convolutional neural networks (DCNNs) have attracted considerable interest as useful devices and as possible windows into understanding perception and cognition in biological systems. In earlier work, we showed that DCNNs differ dramatically from human perceivers in that they have no sensitivity to global object shape. Here, we investigated whether those findings are symptomatic of broader limitations of DCNNs regarding the use of relations. We tested learning and generalization of DCNNs (AlexNet and ResNet-50) for several relations involving objects. One involved classifying two shapes in an otherwise empty field as same or different. Another involved enclosure. Every display contained a closed figure among contour noise fragments and one dot; correct responding depended on whether the dot was inside or outside the figure. The third relation we tested involved a classification that depended on which of two polygons had more sides. One polygon always contained a dot, and correct classification of each display depended on whether the polygon with the dot had a greater number of sides. We used DCNNs that had been trained on the ImageNet database, and we used both restricted and unrestricted transfer learning (connection weights at all layers could change with training). For the same-different experiment, there was little restricted transfer learning (82.2%). Generalization tests showed near chance performance for new shapes. Results for enclosure were at chance for restricted transfer learning and somewhat better for unrestricted (74%). Generalization with two new kinds of shapes showed reduced but above-chance performance (≈66%). Follow-up studies indicated that the networks did not access the enclosure relation in their responses. For the relation of more or fewer sides of polygons, DCNNs showed successful learning with polygons having 3-5 sides under unrestricted transfer learning, but showed chance performance in generalization tests with polygons having 6-10 sides. Experiments with human observers showed learning from relatively few examples of all of the relations tested and complete generalization of relational learning to new stimuli. These results using several different relations suggest that DCNNs have crucial limitations that derive from their lack of computations involving abstraction and relational processing of the sort that are fundamental in human perception.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    众所周知,观察者可以使用所谓的视觉集合的汇总统计来简化感知处理。假设是视觉系统提取视觉集合的均值和方差,而不是详细表示特征分布。但是最近使用称为特征分布学习的方法进行隐式测试的证据表明,保留了比摘要统计文献所显示的更多的分布细节。观察者还编码高阶统计信息,例如方向和颜色的特征分布的峰度。但是这种学习还没有被证明是视觉信息的更复杂的方面。在这里,我们测试了分心组合的形状学习,使用特征分布学习方法。使用线性化的圆形空间,我们发现,对于这个形状空间,不会发生形状的详细分布的学习,而观察者能够学习分布的均值和范围。先前的特征分布学习演示涉及比此处测试的更复杂的形状空间更简单的特征维度,因此,我们的发现可以揭示特征分布学习的重要边界条件。
    It is well known that observers can use so-called summary statistics of visual ensembles to simplify perceptual processing. The assumption has been that instead of representing feature distributions in detail the visual system extracts the mean and variance of visual ensembles. But recent evidence from implicit testing using a method called feature distribution learning showed that far more detail of the distributions is retained than the summary statistic literature indicates. Observers also encode higher-order statistics such as the kurtosis of feature distributions of orientation and color. But this sort of learning has not been shown for more intricate aspects of visual information. Here we tested the learning of distractor ensembles for shape, using the feature distribution learning method. Using a linearized circular shape space, we found that learning of detailed distributions of shape does not occur for this shape space while observers were able to learn the mean and range of the distributions. Previous demonstrations of feature distribution learning involved simpler feature dimensions than the more complex shape space tested here, and our findings may therefore reveal important boundary conditions of feature distribution learning.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号