Natural images

自然图像
  • 文章类型: Journal Article
    人类的姿势,定义为身体部位之间的空间关系,携带支持理解一个人的运动和行动的工具信息。先前的大量工作已经识别出响应于身体和不同身体部位的图像的皮质区域。然而,视觉感知身体部位关系的神经基础受到的关注较少。为了扩大我们对身体感知的理解,我们分析了高分辨率fMRI对来自4,000多个复杂自然场景的各种姿势的反应。使用地面实况注释和三维(3D)姿势重建算法的应用,我们比较了皮层活动的相似性模式与从具有不同深度可用性和视点依赖性的人类姿态模型构建的相似性模式。针对使用可解释模型解释复杂自然图像响应中的方差的挑战,我们在姿态模型和皮层活动模式之间取得了统计学上显著的相关性(尽管表现水平远低于噪声上限).我们发现3D视图无关的姿态模型,与二维模型相比,更好地捕获来自不同皮质区域的激活,包括右颞上沟(pSTS)。这些地区,与LOTC中的其他姿态选择区域一起,形成更广泛的,分布式皮层网络,在更多的前部斑块中具有更大的视野耐受性。我们根据自然身体图像的计算复杂性来解释这些发现,姿势结构支持的各种视觉任务,以及铰接对象和普通对象之间视图不变处理的可能共享原则,刚性物体。
    Human pose, defined as the spatial relationships between body parts, carries instrumental information supporting the understanding of motion and action of a person. A substantial body of previous work has identified cortical areas responsive to images of bodies and different body parts. However, the neural basis underlying the visual perception of body part relationships has received less attention. To broaden our understanding of body perception, we analyzed high-resolution fMRI responses to a wide range of poses from over 4,000 complex natural scenes. Using ground-truth annotations and an application of three-dimensional (3D) pose reconstruction algorithms, we compared similarity patterns of cortical activity with similarity patterns built from human pose models with different levels of depth availability and viewpoint dependency. Targeting the challenge of explaining variance in complex natural image responses with interpretable models, we achieved statistically significant correlations between pose models and cortical activity patterns (though performance levels are substantially lower than the noise ceiling). We found that the 3D view-independent pose model, compared with two-dimensional models, better captures the activation from distinct cortical areas, including the right posterior superior temporal sulcus (pSTS). These areas, together with other pose-selective regions in the LOTC, form a broader, distributed cortical network with greater view-tolerance in more anterior patches. We interpret these findings in light of the computational complexity of natural body images, the wide range of visual tasks supported by pose structures, and possible shared principles for view-invariant processing between articulated objects and ordinary, rigid objects.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    Pareidolia是对不存在的可识别图像或有意义模式的感知。近年来,这种现象在健康受试者和患有神经或精神疾病的患者中得到了越来越多的研究。当前的研究检查了一组53名中风患者和82名神经系统健康的对照者的pareidolia产生,他们执行了自然图像任务。我们发现左半球和右半球卒中患者的绝对帕累托产生量显著减少,右半球患者总体上产生最少的帕累托酮输出。反应分为28个不同的类别,与\'动物\',\'人类\',\'脸\',和“身体部位”是最常见的,占所有pareidolia的72%。关于不同类别的pareidolia的百分比,我们发现,与对照组相比,左半球患者组的“身体部位”pareidolia的百分比显着降低,而与健康对照组相比,右半球患者中这种paresidolia类型的百分比并未显着降低。这些结果支持以下假设:pareidolia的产生可能会受到局部全局视觉处理的影响,而左半球在更大程度上参与了局部和详细的分析视觉处理。因此,右半球的病变,这被认为对全球视觉处理至关重要,可能解释了右半球患者产生的总体最少的帕累托酮输出。
    Pareidolia are perceptions of recognizable images or meaningful patterns where none exist. In recent years, this phenomenon has been increasingly studied in healthy subjects and patients with neurological or psychiatric diseases. The current study examined pareidolia production in a group of 53 stroke patients and 82 neurologically healthy controls who performed a natural images task. We found a significant reduction of absolute pareidolia production in left- and right-hemispheric stroke patients, with right-hemispheric patients producing overall fewest pareidolic output. Responses were categorized into 28 distinct categories, with \'Animal\', \'Human\', \'Face\', and \'Body parts\' being the most common, accounting for 72% of all pareidolia. Regarding the percentages of the different categories of pareidolia, we found a significant reduction for the percentage of \"Body parts\" pareidolia in the left-hemispheric patient group as compared to the control group, while the percentage of this pareidolia type was not significantly reduced in right-hemispheric patients compared to healthy controls. These results support the hypothesis that pareidolia production may be influenced by local-global visual processing with the left hemisphere being involved in local and detailed analytical visual processing to a greater extent. As such, a lesion to the right hemisphere, that is believed to be critical for global visual processing, might explain the overall fewest pareidolic output produced by the right-hemispheric patients.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    人类光感受器由视锥细胞组成,棒,和表达黑素的固有光敏视网膜神经节细胞(ipRGC)。首次研究昼夜节律调节和瞳孔控制,ipRGC项目涉及各种大脑中心,表明除了非视觉功能之外,还涉及更广泛的参与。IpRGC反应稳定,持久的,并对感光器信号进行了特殊的编码。与锥杆信号的瞬态和自适应特性相比,ipRGC信号可能为色觉的不同属性提供生态优势。先前的研究表明,黑视素对视觉反应的影响,但其对人类颜色感知的贡献仍存在争议。我们总结了证据和假设(从生理学,心理物理学,和自然图像统计)关于ipRGC直接和间接参与人类色觉,首先简要评估有关黑视素和ipRGC在视觉和光谱信号编码中的作用的当前知识。然后,我们探讨了有关黑素激活引起颜色感知的问题,讨论使用沉默替代法的研究。最后,我们探索了ipRGC可能间接影响颜色感知的各种途径,例如通过参与外围颜色匹配,后接待途径,颜色恒定性,长期的色彩适应,和色度感应。虽然人们对ipRGC在亮度感知中的作用达成了共识,确认其对人类颜色感知的直接贡献需要进一步研究。我们提出了未来研究的潜在方法,强调需要经验验证和方法论的彻底性,以阐明ipRGC在人类色觉中的确切作用。
    Human photoreceptors consist of cones, rods, and melanopsin-expressing intrinsically photosensitive retinal ganglion cells (ipRGCs). First studied in circadian regulation and pupillary control, ipRGCs project to a variety of brain centers suggesting a broader involvement beyond non-visual functions. IpRGC responses are stable, long-lasting, and with a particular codification of photoreceptor signals. In comparison with the transient and adaptive nature of cone and rod signals, ipRGCs\' signaling might provide an ecological advantage to different attributes of color vision. Previous studies have indicated melanopsin\'s influence on visual responses yet its contribution to color perception in humans remains debated. We summarized evidence and hypotheses (from physiology, psychophysics, and natural image statistics) about direct and indirect involvement of ipRGCs in human color vision, by first briefly assessing the current knowledge about the role of melanopsin and ipRGCs in vision and codification of spectral signals. We then approached the question about melanopsin activation eliciting a color percept, discussing studies using the silent substitution method. Finally, we explore various avenues through which ipRGCs might impact color perception indirectly, such as through involvement in peripheral color matching, post-receptoral pathways, color constancy, long-term chromatic adaptation, and chromatic induction. While there is consensus about the role of ipRGCs in brightness perception, confirming its direct contribution to human color perception requires further investigation. We proposed potential approaches for future research, emphasizing the need for empirical validation and methodological thoroughness to elucidate the exact role of ipRGCs in human color vision.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    皮层视觉区域,V4已被认为是对有助于对象的中级表示的轮廓进行编码。对自然轮廓固有的复杂轮廓特征的神经响应有望阐明表示的本质。为了接近自然轮廓的皮质编码,我们研究了猴子(Macacafuscata)V4神经元中多个轮廓特征的同时编码及其种群水平表示。大量的神经元显示出两个或更多的特征,如曲率和闭合,这表明大量的V4神经元同时编码多个轮廓特征。大部分神经元对围绕经典感受野中心的急剧弯曲的轮廓做出了强烈反应,这表明V4神经元倾向于编码物体轮廓的突出特征。对神经反应和每个轮廓特征之间的互信息(MI)的分析表明,对于每种类型的MI,大多数神经元表现出相似的幅度,表明许多神经元显示的反应取决于多个轮廓特征。接下来,我们使用多维缩放分析检查了人口水平的表示。与轮廓刺激相比,神经对多种轮廓特征的偏好以及对自然刺激的偏好随着主轴和次轴的增加而增加。分别,表明多种轮廓特征和表面纹理在种群响应中的贡献。我们的分析表明,V4神经元同时在自然图像中编码多个轮廓特征,并表示群体中的轮廓和表面属性。重要性陈述自然物体的轮廓通常很复杂,但视觉系统会提取它们的特征并有效地表示它们的形状。中级视觉皮层的神经元,V4,对自然轮廓的表示起着至关重要的作用。分析电生理数据,我们发现V4神经元同时编码多个轮廓特征,如曲率,关闭,和方向,并代表突出的轮廓,如角落和凸起。例如,许多神经元对封闭物体范围的急性曲率做出反应。互信息和种群分析表明,许多神经元的反应取决于多个轮廓特征,并代表种群中的轮廓和表面。一群V4神经元似乎编码了复杂但突出的轮廓,以表示自然物体。
    The cortical visual area, V4, has been considered to code contours that contribute to the intermediate-level representation of objects. The neural responses to the complex contour features intrinsic to natural contours are expected to clarify the essence of the representation. To approach the cortical coding of natural contours, we investigated the simultaneous coding of multiple contour features in monkey (Macaca fuscata) V4 neurons and their population-level representation. A substantial number of neurons showed significant tuning for two or more features such as curvature and closure, indicating that a substantial number of V4 neurons simultaneously code multiple contour features. A large portion of the neurons responded vigorously to acutely curved contours that surrounded the center of classical receptive field, suggesting that V4 neurons tend to code prominent features of object contours. The analysis of mutual information (MI) between the neural responses and each contour feature showed that most neurons exhibited similar magnitudes for each type of MI, indicating that many neurons showing the responses depended on multiple contour features. We next examined the population-level representation by using multidimensional scaling analysis. The neural preferences to the multiple contour features and that to natural stimuli compared with silhouette stimuli increased along with the primary and secondary axes, respectively, indicating the contribution of the multiple contour features and surface textures in the population responses. Our analyses suggested that V4 neurons simultaneously code multiple contour features in natural images and represent contour and surface properties in population.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    构建计算解码模型以解释语义信息的皮层表示在理解视觉感知中起着至关重要的作用。人类视觉系统在感知自然视觉的语义内容时处理不同对象之间的交互关系。然而,现有的语义解码模型通常在视觉和语义上将类别视为完全分离和独立的类别,很少考虑与先验信息的关系。在这项工作中,提出了一种新的语义图学习模型,用于从大脑活动中解码感知到的自然图像的多个语义类别。所提出的模型在从五个正常受试者收集的功能磁共振成像数据上进行了验证,同时查看了包含52个语义类别的2750个自然图像。结果表明,基于图神经网络的解码模型比其他深度神经网络模型具有更高的精度。此外,语义类别间的共现概率与解码精度显著相关。此外,结果表明,以具有较高视觉区域的分层方式组织的语义内容与内部视觉体验更密切相关。一起,本研究为多语义解码提供了一个优越的计算框架,支持语义处理的视觉整合机制。
    Constructing computational decoding models to account for the cortical representation of semantic information plays a crucial role in understanding visual perception. The human visual system processes interactive relationships among different objects when perceiving the semantic contents of natural visions. However, the existing semantic decoding models commonly regard categories as completely separate and independent visually and semantically and rarely consider the relationships from prior information. In this work, a novel semantic graph learning model was proposed to decode multiple semantic categories of perceived natural images from brain activity. The proposed model was validated on the functional magnetic resonance imaging data collected from five normal subjects while viewing 2750 natural images comprising 52 semantic categories. The results showed that the Graph Neural Network-based decoding model achieved higher accuracies than other deep neural network models. Moreover, the co-occurrence probability among semantic categories showed a significant correlation with the decoding accuracy. Additionally, the results suggested that semantic content organized in a hierarchical way with higher visual areas was more closely related to the internal visual experience. Together, this study provides a superior computational framework for multi-semantic decoding that supports the visual integration mechanism of semantic processing.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    尽管自然场景中自然发生了全球和局部日光变化,人类视觉系统通常很好地适应这些变化并形成稳定的颜色感知。在之前的研究中,分析了以相关色温(CCT)为特征的日光对不同色度描述符的影响(Ojeda等人。,2017)。结果表明,对于超过14,000K的CCT值,色度信息几乎是恒定的,局部极值发生在低CCT范围内。这项工作的目的是将对光源的CCT依赖性的分析扩展到考虑空间色结构的那些,包括二阶描述符(梯度,光谱斜率,光谱特征,和PCA)和高阶描述符(峰度,偏斜度,和相关颜色的数量)。我们的结果表明,对于15,000K以上的CCT和3,900K-9,600K范围内的局部极值,大多数描述符表现出水平渐近行为。对于可以在CIELAB空间中分析的描述符,获得了足够的统计证据来考虑偏度,峰度,并且L*通道的独立频谱斜率在所使用的CCT范围内相等。然而,当将PCA应用于图像块时,光谱特征和主成分的方向的微小变化在统计上不显著,并且在不同的光源下不能被认为是相等的。相关颜色(NRC)的数量表现出对温度变化的敏感性,其行为与其他描述符相似。由于数量少。
    Despite the natural occurrence of global and local daylight changes in natural scenes, the human visual system typically adapts well to these changes and develops stable colour perception. In a previous study, the influence of daylight characterized by its Correlated Colour Temperatures (CCT) on different chromatic descriptors was analysed (Ojeda et al., 2017). The results showed that chromatic information is almost constant for CCT values above 14,000 K, with local extremes occurring in the range of low CCTs. The aim of this work is to extend the analysis of the CCT dependence of the illuminant to those that consider the spatio-chromatic structure, including second order descriptors (gradients, spectral slope, spectral signature, and PCA) and higher order descriptors (kurtosis, skewness, and number of relevant colours). Our results show that most of the descriptors exhibit horizontal asymptotic behaviour for CCTs above 15,000 K and local extremes in the range of 3,900 K-9,600 K. For those descriptors that could be analysed in CIELAB space, sufficient statistical evidence was obtained to consider skewness, kurtosis, and the independent spectral slopes of the L* channel as equal in the range of CCTs used. However, the slight variations in spectral signatures and the directions of the principal components when applying PCA to image patches are not statistically significant and cannot be considered equal under different illuminants. The number of relevant colours (NRC) exhibits sensitivity to temperature variations and behaves similarly to the other descriptors, due to its small number.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    双眼视差是三维形状的重要线索。我们评估了该线索对自然场景立体照片中深度的可靠性和一致性的贡献。观察者查看杂乱场景的照片,同时调整仪表图以指示物体表面的明显三维方向。仪表图位于场景中多个点的物体表面上,设置是在单目和双目下进行的,立体观看。设置用于创建深度浮雕贴图,表示场景的明显三维结构。我们发现双目线索增加了表观深度的大小,跨重复措施的设置的可靠性,以及参与者感知深度的一致性。这些结果表明,在包含多个图像提示的自然场景中,双目提示对深度的精确和准确感知做出了重要贡献。
    Binocular disparity is an important cue to three-dimensional shape. We assessed the contribution of this cue to the reliability and consistency of depth in stereoscopic photographs of natural scenes. Observers viewed photographs of cluttered scenes while adjusting a gauge figure to indicate the apparent three-dimensional orientation of the surfaces of objects. The gauge figure was positioned on the surfaces of objects at multiple points in the scene, and settings were made under monocular and binocular, stereoscopic viewing. Settings were used to create a depth relief map, indicating the apparent three-dimensional structure of the scene. We found that binocular cues increased the magnitude of apparent depth, the reliability of settings across repeated measures, and the consistency of perceived depth across participants. These results show that binocular cues make an important contribution to the precise and accurate perception of depth in natural scenes that contain multiple pictorial cues.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    初级视觉皮层用ON和OFF皮层通路发出明暗刺激的信号。这里,我们证明了两种途径对大的均匀表面产生相似的响应增量,并且它们的响应平均值随着表面亮度的增加而增加。我们证明,在猫的视觉皮层,当刺激小于一个感受野中心时,来自ON或OFF通路的反应优势呈双峰分布,但当刺激较大时呈单峰分布。此外,而小的明亮刺激驱动来自ON和OFF途径的相反反应(增加与抑制活动),大明亮的表面驱动类似的响应增量。我们表明,这种大小-亮度关系之所以出现,是因为强烈的照明增加了自然界中光表面的大小,并且ON和OFF皮质神经元都从ON丘脑通路接收输入。我们得出的结论是,当来自ON和OFF皮层通路的平均响应增量变得更强时,视觉场景被认为更亮。
    The primary visual cortex signals the onset of light and dark stimuli with ON and OFF cortical pathways. Here, we demonstrate that both pathways generate similar response increments to large homogeneous surfaces and their response average increases with surface brightness. We show that, in cat visual cortex, response dominance from ON or OFF pathways is bimodally distributed when stimuli are smaller than one receptive field center but unimodally distributed when they are larger. Moreover, whereas small bright stimuli drive opposite responses from ON and OFF pathways (increased versus suppressed activity), large bright surfaces drive similar response increments. We show that this size-brightness relation emerges because strong illumination increases the size of light surfaces in nature and both ON and OFF cortical neurons receive input from ON thalamic pathways. We conclude that visual scenes are perceived as brighter when the average response increments from ON and OFF cortical pathways become stronger.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    面部感知的神经机制主要在良好控制的实验环境中进行研究,这些实验环境涉及随机刺激序列和固定的眼睛位置。虽然强大,所采用的范式远非构成自然视觉。这里,我们证明了使用自然观看行为的生态上更有效的实验范式的可行性,通过结合自然场景的自由观看范式,没有摄影师的偏见,利用先进的数据处理技术来校正重叠效应和多个眼球运动参数的共同变化的非线性依赖性。我们通过在神经反应中复制经典的N170效应来验证这种方法,由注视发作触发(注视事件相关电位[fERPs])。重要的是,除了发现两个实验之间有很强的相关性,与经典设置相比,我们更自然的刺激范式在受试者之间产生了更小的变异性。超越经典的时间和空间效果位置,我们的实验还揭示了以前未知的面部处理特征:这包括事件相关电位(ERP)的幅度的类别特定调制,甚至在固定开始之前,以及后续固定过程中的适应效果,具体取决于它们的历史。
    Neural mechanisms of face perception are predominantly studied in well-controlled experimental settings that involve random stimulus sequences and fixed eye positions. Although powerful, the employed paradigms are far from what constitutes natural vision. Here, we demonstrate the feasibility of ecologically more valid experimental paradigms using natural viewing behaviour, by combining a free viewing paradigm on natural scenes, free of photographer bias, with advanced data processing techniques that correct for overlap effects and co-varying non-linear dependencies of multiple eye movement parameters. We validate this approach by replicating classic N170 effects in neural responses, triggered by fixation onsets (fixation event-related potentials [fERPs]). Importantly, besides finding a strong correlation between both experiments, our more natural stimulus paradigm yielded smaller variability between subjects than the classic setup. Moving beyond classic temporal and spatial effect locations, our experiment furthermore revealed previously unknown signatures of face processing: This includes category-specific modulation of the event-related potential (ERP)\'s amplitude even before fixation onset, as well as adaptation effects across subsequent fixations depending on their history.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • DOI:
    文章类型: Journal Article
    在自然图像中感知3D结构是视觉系统的巨大计算挑战。虽然以前的许多研究都集中在对刚性3D物体的感知上,我们将一种新颖的方法应用于一组常见的非刚性物体-自然界中人体的静态图像。我们调查了人类在自然图像中解释3D姿势的能力在多大程度上取决于潜在3D姿势的典型性和视点的信息量。使用一种新颖的2AFC姿态匹配任务,我们通过两个比较中的一个来测量受试者能够匹配目标自然姿态图像的程度,来自不同视点的合成身体图像-一个是使用与目标相同的3D姿态参数渲染的,而另一个是在关节角度上添加噪声的干扰物。我们发现,典型姿势的性能要比非典型姿势好得多;但是,我们发现信息和信息较少的观点之间没有显着差异。对同一任务的2D和3D姿势匹配模型的进一步比较表明,在解释非典型姿势的图像时,3D身体知识尤为重要。这些结果表明,人类解释3D姿势的能力取决于姿势的典型性,而不是观点的信息量。人类可能使用3D姿态结构的先验知识。
    Perceiving 3D structure in natural images is an immense computational challenge for the visual system. While many previous studies focused on the perception of rigid 3D objects, we applied a novel method on a common set of non-rigid objects-static images of the human body in the natural world. We investigated to what extent human ability to interpret 3D poses in natural images depends on the typicality of the underlying 3D pose and the informativeness of the viewpoint. Using a novel 2AFC pose matching task, we measured how well subjects were able to match a target natural pose image with one of two comparison, synthetic body images from a different viewpoint-one was rendered with the same 3D pose parameters as the target while the other was a distractor rendered with added noises on joint angles. We found that performance for typical poses was measurably better than atypical poses; however, we found no significant difference between informative and less informative viewpoints. Further comparisons of 2D and 3D pose matching models on the same task showed that 3D body knowledge is particularly important when interpreting images of atypical poses. These results suggested that human ability to interpret 3D poses depends on pose typicality but not viewpoint informativeness, and that humans probably use prior knowledge of 3D pose structures.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号