spatial attention

空间注意力
  • 文章类型: Journal Article
    医学图像的手动注释对于临床专家来说是耗时的;因此,可靠的自动分割将是处理大型医疗数据集的理想方法。在本文中,我们对妊娠早期超声(US)扫描中两个基本测量的检测和分割感兴趣:Nuchal半透明(NT)和冠部长度(CRL)。形状可能会有很大的变化,胎儿超声扫描中解剖结构的位置或大小。我们提出了一种新方法,即早期妊娠超声CRL和NT分割的密集注意感知网络(DA2Net),依靠强大的注意力机制和密集连接的网络来编码特征大小的变化。我们的结果表明,提出的D2ANet与专家手册注释提供了高像素协议(平均JSC=84.21)。
    Manual annotation of medical images is time consuming for clinical experts; therefore, reliable automatic segmentation would be the ideal way to handle large medical datasets. In this paper, we are interested in detection and segmentation of two fundamental measurements in the first trimester ultrasound (US) scan: Nuchal Translucency (NT) and Crown Rump Length (CRL). There can be a significant variation in the shape, location or size of the anatomical structures in the fetal US scans. We propose a new approach, namely Densely Attentional-Aware Network for First Trimester Ultrasound CRL and NT Segmentation (DA2Net), to encode variation in feature size by relying on the powerful attention mechanism and densely connected networks. Our results show that the proposed D2ANet offers high pixel agreement (mean JSC = 84.21) with expert manual annotations.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    可变形图像配准(DIR)在许多临床任务中起着重要作用,在过去的几年中,深度学习在DIR方面取得了重大进展。
    提出一种用于单峰图像配准的快速多尺度无监督可变形图像配准(称为FMIRNet)方法。
    我们设计了一个多尺度融合模块,通过组合和细化三个尺度的变形场,来估计大位移场。在我们的融合模块中采用了空间注意力机制来逐个像素地加权位移场。除均方误差(MSE)外,我们还在训练阶段增加了结构相似性(ssim)度量,以增强变形图像和固定图像之间的结构一致性。
    我们的注册方法在EchoNet上进行了评估,混沌和SLIVER,在SSIM方面确实有了性能改进,NCC和NMI得分。此外,我们将FMIRNet集成到分段网络中(FCN,UNet)在我们的联合学习框架中使用很少的手动注释来增强数据集上的分段任务。实验结果表明,联合分割方法在Dice,HD和ASSD评分。
    我们提出的FMIRNet对于大变形估计是有效的,并且其注册能力在联合注册和分割框架中具有通用性和鲁棒性,可以为训练分割任务生成可靠的标签。
    UNASSIGNED: Deformable image registration (DIR) plays an important part in many clinical tasks, and deep learning has made significant progress in DIR over the past few years.
    UNASSIGNED: To propose a fast multiscale unsupervised deformable image registration (referred to as FMIRNet) method for monomodal image registration.
    UNASSIGNED: We designed a multiscale fusion module to estimate the large displacement field by combining and refining the deformation fields of three scales. The spatial attention mechanism was employed in our fusion module to weight the displacement field pixel by pixel. Except mean square error (MSE), we additionally added structural similarity (ssim) measure during the training phase to enhance the structural consistency between the deformed images and the fixed images.
    UNASSIGNED: Our registration method was evaluated on EchoNet, CHAOS and SLIVER, and had indeed performance improvement in terms of SSIM, NCC and NMI scores. Furthermore, we integrated the FMIRNet into the segmentation network (FCN, UNet) to boost the segmentation task on a dataset with few manual annotations in our joint leaning frameworks. The experimental results indicated that the joint segmentation methods had performance improvement in terms of Dice, HD and ASSD scores.
    UNASSIGNED: Our proposed FMIRNet is effective for large deformation estimation, and its registration capability is generalizable and robust in joint registration and segmentation frameworks to generate reliable labels for training segmentation tasks.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在这项研究中,我们评估了可预测性是否影响面部表情的早期处理.为了实现这一点,我们测量了与视觉处理相关的侧方潜伏期早期和中期事件相关电位.22名参与者被展示成对的双边恐惧,快乐,生气,或者打乱的脸。参与者被要求在空间参与的一侧识别愤怒的面孔,而忽略快乐,恐惧,和乱七八糟的脸。每个块都以“HAPPY或FEARFUL”一词开头,该单词告知参与者这些面孔出现的可能性。发现了侧向化P1的注意力效应,这表明情绪不会差异地调节P1,与情绪有关的预测也是如此。成对比较表明,当空间无人看管时,与预测的恐惧面孔和未预测的快乐面孔相比,未预测的恐惧面孔产生了更大的横向N170振幅。最后,对面部的关注增加了侧向EPN振幅,恐惧的表达和低可预测性也是如此。因此,我们证明了N170和EPN对与面部表情相关的自上而下的预测很敏感,并且低可预测性似乎在无人看管时特别影响恐惧面孔的早期编码,可能是为了开始注意力捕捉。
    In this study, we assessed whether predictability affected the early processing of facial expressions. To achieve this, we measured lateralised early- and mid-latency event-related potentials associated with visual processing. Twenty-two participants were shown pairs of bilaterally presented fearful, happy, angry, or scrambled faces. Participants were required to identify angry faces on a spatially attended side whilst ignoring happy, fearful, and scrambled faces. Each block began with the word HAPPY or FEARFUL which informed participants the probability at which these faces would appear. Attention effects were found for the lateralised P1, suggesting that emotions do not modulate the P1 differentially, nor do predictions relating to emotions. Pairwise comparisons demonstrated that, when spatially unattended, unpredicted fearful faces produced larger lateralised N170 amplitudes compared to predicted fearful faces and unpredicted happy faces. Finally, attention towards faces increased lateralised EPN amplitudes, as did both fearful expressions and low predictability. Thus, we demonstrate that the N170 and EPN are sensitive to top-down predictions relating to facial expressions and that low predictability appears to specifically affect the early encoding of fearful faces when unattended, possibly to initiate attentional capture.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    奖励价值和选择性注意都可以在处理的最早阶段增强感官刺激的代表性。奖励驱动和注意机制是否以及如何相互作用以影响感知仍在争论中。在这里,我们问奖励值和选择性注意力之间的相互作用是否取决于传达奖励信息的感觉方式。人类参与者首先在调节阶段学习了单模态视觉和听觉刺激的奖励值。随后,他们对包含先前奖励的刺激的双峰刺激执行了目标检测任务,两者,或者两种模式都没有。此外,要求参与者将注意力集中在一侧,只报告参加一侧的目标。我们的结果表明,空间注意力对视觉和听觉事件相关电位(ERP)有很强的调节作用。我们没有发现奖励值的主要影响,但重要的是,我们发现了交互作用,因为ERP的注意力调节强度受到奖励值的显着影响。当针对每种模态分别检查奖励效应时,发现听觉价值驱动的注意力调制主导了ERP效应,而视觉奖励价值本身没有影响,可能是由于其对目标处理的干扰。这些结果激发了一个两阶段模型,其中首先在特定于每种感官模态的局部优先级图上增强了高回报刺激的显著性,在第二阶段,奖励值和自上而下的注意机制被整合到整个感官模式中,以影响感知。
    Reward value and selective attention both enhance the representation of sensory stimuli at the earliest stages of processing. It is still debated whether and how reward-driven and attentional mechanisms interact to influence perception. Here we ask whether the interaction between reward value and selective attention depends on the sensory modality through which the reward information is conveyed. Human participants first learned the reward value of uni-modal visual and auditory stimuli during a conditioning phase. Subsequently, they performed a target detection task on bimodal stimuli containing a previously rewarded stimulus in one, both, or neither of the modalities. Additionally, participants were required to focus their attention on one side and only report targets on the attended side. Our results showed a strong modulation of visual and auditory event-related potentials (ERPs) by spatial attention. We found no main effect of reward value but importantly we found an interaction effect as the strength of attentional modulation of the ERPs was significantly affected by the reward value. When reward effects were examined separately with respect to each modality, auditory value-driven modulation of attention was found to dominate the ERP effects whereas visual reward value on its own led to no effect, likely due to its interference with the target processing. These results inspire a two-stage model where first the salience of a high reward stimulus is enhanced on a local priority map specific to each sensory modality, and at a second stage reward value and top-down attentional mechanisms are integrated across sensory modalities to affect perception.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    过去的工作揭示了视觉工作记忆中空间注意力和存储之间的紧密关系。但是在空间上参与一个项目等同于工作记忆编码吗?在这里,在两项成人研究(N=39;N=33)中,我们追踪了脑电图(EEG)的空间注意力和工作记忆编码特征,同时独立操作记忆项目的数量和注意力的空间范围.空间注意力的神经测量跟踪了所关注区域的位置和大小,而与编码到工作记忆中的个性化项目的数量无关。同时,存储在工作记忆中的项目数的多变量解码对空间注意力的广度和位置的变化不敏感。最后,代表性相似性分析为对存储物品的空间范围不敏感的纯负载信号提供了收敛证据。因此,尽管空间注意力是视觉工作记忆的持久伙伴,它在功能上与工作记忆中个性化表示的选择和维护分离。
    Past work reveals a tight relationship between spatial attention and storage in visual working memory. But is spatially attending an item tantamount to working memory encoding? Here, we tracked electroencephalography (EEG) signatures of spatial attention and working memory encoding while independently manipulating the number of memory items and the spatial extent of attention in two studies of adults (N = 39; N = 33). Neural measures of spatial attention tracked the position and size of the attended area independent of the number of individuated items encoded into working memory. At the same time, multivariate decoding of the number of items stored in working memory was insensitive to variations in the breadth and position of spatial attention. Finally, representational similarity analyses provided converging evidence for a pure load signal that is insensitive to the spatial extent of the stored items. Thus, although spatial attention is a persistent partner of visual working memory, it is functionally dissociable from the selection and maintenance of individuated representations in working memory.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    有证据表明,皮层下结构在高级认知功能中起作用,例如空间注意力的分配。虽然人类有大量证据表明后α波段振荡是由空间注意力调制的,关于皮质下区域如何促成这些振荡调制,特别是在不同的认知挑战条件下。在这项研究中,我们结合了MEG和结构MRI数据,通过采用具有不同感知负荷水平的提示空间注意力范式,研究了皮质下结构在控制注意力资源分配中的作用.我们询问丘脑和基底神经节的体积测量的半球偏侧化是否可以预测α带功率的半球调制。苍白球的横向不对称,尾状核,和丘脑预测后验α振荡的注意力相关调制。当感知负荷被施加到目标上并且牵张器是显着的尾状核不对称性时,可以预测α带调制。当任一目标具有高负载时,苍白球可以预测α带调制,或者分心者很突出,但不是两者都有。最后,当任务的两个组成部分都没有感知要求时,丘脑的不对称性预测了α带调制。除了提供对皮层下电路的新见解,用空间注意力控制阿尔法振荡,我们的发现也可能具有临床应用价值.我们提供了一个框架,可用于检测与神经系统疾病相关的皮层下区域的结构变化如何反映在振荡脑活动的调节中。
    Evidence suggests that subcortical structures play a role in high-level cognitive functions such as the allocation of spatial attention. While there is abundant evidence in humans for posterior alpha band oscillations being modulated by spatial attention, little is known about how subcortical regions contribute to these oscillatory modulations, particularly under varying conditions of cognitive challenge. In this study, we combined MEG and structural MRI data to investigate the role of subcortical structures in controlling the allocation of attentional resources by employing a cued spatial attention paradigm with varying levels of perceptual load. We asked whether hemispheric lateralization of volumetric measures of the thalamus and basal ganglia predicted the hemispheric modulation of alpha-band power. Lateral asymmetry of the globus pallidus, caudate nucleus, and thalamus predicted attention-related modulations of posterior alpha oscillations. When the perceptual load was applied to the target and the distractor was salient caudate nucleus asymmetry predicted alpha-band modulations. Globus pallidus was predictive of alpha-band modulations when either the target had a high load, or the distractor was salient, but not both. Finally, the asymmetry of the thalamus predicted alpha band modulation when neither component of the task was perceptually demanding. In addition to delivering new insight into the subcortical circuity controlling alpha oscillations with spatial attention, our finding might also have clinical applications. We provide a framework that could be followed for detecting how structural changes in subcortical regions that are associated with neurological disorders can be reflected in the modulation of oscillatory brain activity.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    皮质深处结构大小的不对称解释了大脑中的α振荡如何对注意力的转移做出反应。
    Asymmetries in the size of structures deep below the cortex explain how alpha oscillations in the brain respond to shifts in attention.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    我们学习环境中嵌入的规律性的能力是我们认知系统的一个基本方面。这种统计学习依赖于注意力吗?关于这个主题的研究很少,并且产生了不同的发现。在这项预先注册的研究中,我们研究了空间注意力在统计学习中的作用,特别是在学过的干扰物位置抑制中。这种现象是指在视觉搜索过程中,与低概率位置相比,参与者在高概率位置忽略显著干扰因素方面表现得更好,即在概率失衡停止后很长时间内,这种偏差会持续存在.参与者搜索形状单例目标,有时会出现颜色单例干扰物。在学习阶段,与低概率位置相比,彩色单例干扰物更可能出现在高概率位置。至关重要的是,我们通过让实验组在搜索显示之前将注意力集中在目标位置来操纵空间注意力,使用100%信息的空间前序,而对照组是中性的,无信息的提示。在随后的测试阶段,彩色单例干扰物同样可能出现在任何位置,并且没有提示。不出所料,中性线索组的结果重复了之前的发现.至关重要的是,对于信息提示组,当注意力从它转移时(在学习期间),来自干扰物的干扰是最小的,并且在测试期间没有观察到统计学习。审判间启动占了学习过程中发现的小统计学习效果。这些发现表明,视觉搜索中的统计学习需要注意。
    Our ability to learn the regularities embedded in our environment is a fundamental aspect of our cognitive system. Does such statistical learning depend on attention? Research on this topic is scarce and has yielded mixed findings. In this preregistered study, we examined the role of spatial attention in statistical learning, and specifically in learned distractor-location suppression. This phenomenon refers to the finding that during visual search, participants are better at ignoring a salient distractor at a high-probability location than at low-probability locations - a bias persisting long after the probability imbalance has ceased. Participants searched for a shape-singleton target and a color-singleton distractor was sometimes present. During the learning phase, the color-singleton distractor was more likely to appear in the high-probability location than in the low-probability locations. Crucially, we manipulated spatial attention by having the experimental group focus their attention on the target\'s location in advance of the search display, using a 100%-informative spatial precue, while the control group was presented with a neutral, uninformative cue. During the subsequent test phase, the color-singleton distractor was equally likely to appear at any location and there were no cues. As expected, the results for the neutral-cue group replicated previous findings. Crucially, for the informative-cue group, interference from the distractor was minimal when attention was diverted from it (during learning) and no statistical learning was observed during test. Intertrial priming accounted for the small statistical-learning effect found during learning. These findings show that statistical learning in visual search requires attention.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    傅里叶重叠显微术(FPM)是一种基于光学原理的显微成像技术。它采用傅立叶光学来分离和组合来自样品的不同光学信息。然而,在成像过程中引入的噪声往往导致重建图像的分辨率较差。本文设计了一种基于残差局部混合网络的方法来提高傅立叶重叠重建图像的质量。通过将通道注意力和空间注意力纳入FPM重建过程,提高了网络重构的效率,减少了重构时间。此外,高斯扩散模型的引入进一步减少了相干伪影,提高了图像重建质量。对比实验结果表明,该网络具有较好的重建质量,在主观观察和客观定量评价方面都优于现有方法。
    Fourier Ptychographic Microscopy (FPM) is a microscopy imaging technique based on optical principles. It employs Fourier optics to separate and combine different optical information from a sample. However, noise introduced during the imaging process often results in poor resolution of the reconstructed image. This article has designed an approach based on a residual local mixture network to improve the quality of Fourier ptychographic reconstruction images. By incorporating channel attention and spatial attention into the FPM reconstruction process, the network enhances the efficiency of the network reconstruction and reduces the reconstruction time. Additionally, the introduction of the Gaussian diffusion model further reduces coherent artifacts and improves image reconstruction quality. Comparative experimental results indicate that this network achieves better reconstruction quality, and outperforming existing methods in both subjective observation and objective quantitative evaluation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    这项工作旨在通过开发用于实际临床CBCT投影数据的深度学习(DL)方法来改善有限角度(LA)锥形束计算机断层扫描(CBCT),这是第一个基于临床投影数据的LA-CBCT的可行性研究,据我们所知.在放射治疗(RT)中,CBCT通常用作患者设置的机载成像模态。与诊断性CT相比,CBCT具有较长的采集时间,例如,一个完整的360°旋转60秒,受到运动伪影的影响。因此,LA-CBCT,如果可以实现,对RT的目的非常感兴趣,除了辐射剂量外,它还按比例减少了扫描时间。然而,LA-CBCT遭受严重的楔形伪影和图像失真。针对真实的临床预测数据,我们已经探索了各种DL方法,例如图像/数据/混合域方法,并最终开发了一种所谓的结构增强注意力网络(SEA-Net)方法,该方法在我们实施的DL方法中具有来自临床投影数据的最佳图像质量。具体来说,提出的SEA-Net采用专门的结构增强子网络来促进纹理保存。观察到重建图像中楔形伪影的分布是不均匀的,空间注意模块用于强调相关区域,而忽略不相关区域,这导致更准确的纹理恢复。
    This work aims to improve limited-angle (LA) cone beam computed tomography (CBCT) by developing deep learning (DL) methods for real clinical CBCT projection data, which is the first feasibility study of clinical-projection-data-based LA-CBCT, to the best of our knowledge. In radiation therapy (RT), CBCT is routinely used as the on-board imaging modality for patient setup. Compared to diagnostic CT, CBCT has a long acquisition time, e.g., 60 seconds for a full 360° rotation, which is subject to the motion artifact. Therefore, the LA-CBCT, if achievable, is of the great interest for the purpose of RT, for its proportionally reduced scanning time in addition to the radiation dose. However, LA-CBCT suffers from severe wedge artifacts and image distortions. Targeting at real clinical projection data, we have explored various DL methods such as image/data/hybrid-domain methods and finally developed a so-called Structure-Enhanced Attention Network (SEA-Net) method that has the best image quality from clinical projection data among the DL methods we have implemented. Specifically, the proposed SEA-Net employs a specialized structure enhancement sub-network to promote texture preservation. Based on the observation that the distribution of wedge artifacts in reconstruction images is non-uniform, the spatial attention module is utilized to emphasize the relevant regions while ignores the irrelevant ones, which leads to more accurate texture restoration.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号