Signal Detection Theory

信号检测理论
  • 文章类型: Journal Article
    唤醒和动机相互作用,深刻地影响行为。例如,经验告诉我们,当我们有适当的动机时,我们有一定的能力来控制我们的唤醒,例如在驾驶机动车时保持清醒。然而,对唤醒和动机如何共同影响决策计算知之甚少,包括动物是否和如何,如啮齿动物,使他们的觉醒状态适应他们的需要。这里,我们开发并显示了听觉结果,基于特征的,具有间歇性转移任务效用的持续注意力任务。我们使用瞳孔大小来估计各种状态的唤醒,并应用量身定制的信号检测理论,危险函数,和大量小鼠中的累积至结合建模方法。我们发现,与瞳孔相关的唤醒和任务效用都对任务绩效的多个方面产生重大影响。尽管在公用事业条件下持续存在大量的唤醒波动,当任务效用较高时,小鼠的唤醒部分稳定在中等和最佳水平附近。行为分析表明,在高任务效用期间,行为的多个要素得到改善,唤醒会影响一些,但不是全部,他们的。具体来说,唤醒影响感觉证据积累的可能性和时间尺度,但不影响参加时每个时间步长积累的证据数量。总之,结果建立了唤醒的特定决策计算特征,动机,以及他们在注意力上的互动。这么做,我们提供了一个实验和分析框架,用于研究神经典型大脑和注意力缺陷/多动障碍等疾病中的唤醒自我调节。
    Arousal and motivation interact to profoundly influence behavior. For example, experience tells us that we have some capacity to control our arousal when appropriately motivated, such as staying awake while driving a motor vehicle. However, little is known about how arousal and motivation jointly influence decision computations, including if and how animals, such as rodents, adapt their arousal state to their needs. Here, we developed and show results from an auditory, feature-based, sustained-attention task with intermittently shifting task utility. We use pupil size to estimate arousal across a wide range of states and apply tailored signal-detection theoretic, hazard function, and accumulation-to-bound modeling approaches in a large cohort of mice. We find that pupil-linked arousal and task utility both have major impacts on multiple aspects of task performance. Although substantial arousal fluctuations persist across utility conditions, mice partially stabilize their arousal near an intermediate and optimal level when task utility is high. Behavioral analyses show that multiple elements of behavior improve during high task utility and that arousal influences some, but not all, of them. Specifically, arousal influences the likelihood and timescale of sensory evidence accumulation but not the quantity of evidence accumulated per time step while attending. In sum, the results establish specific decision-computational signatures of arousal, motivation, and their interaction in attention. So doing, we provide an experimental and analysis framework for studying arousal self-regulation in neurotypical brains and in diseases such as attention-deficit/hyperactivity disorder.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    Carragher和Hancock(2023)研究了在自动面部识别系统(AFRS)的帮助下,个人在一对一面部匹配任务中的表现。在五个预先注册的实验中,他们发现了辅助性能欠佳的证据,由于AFRS辅助的个人始终无法达到AFRS单独实现的绩效水平。当前的研究重新分析了这些数据(卡拉格和汉考克,2023),将自动化辅助性能与一系列协同决策的统计模型进行基准测试,跨越一系列的效率水平。使用贝叶斯分层信号检测模型的分析表明,协作性能非常低效,最接近所测试的自动化依赖的最次优模型。这种结果模式概括了以前关于一系列视觉搜索中次优的人类与自动化交互的报告,目标检测,感官辨别,和数值估计决策任务。当前的研究是第一个在一对一面部匹配任务中提供自动化辅助性能基准的研究。
    Carragher and Hancock (2023) investigated how individuals performed in a one-to-one face matching task when assisted by an Automated Facial Recognition System (AFRS). Across five pre-registered experiments they found evidence of suboptimal aided performance, with AFRS-assisted individuals consistently failing to reach the level of performance the AFRS achieved alone. The current study reanalyses these data (Carragher and Hancock, 2023), to benchmark automation-aided performance against a series of statistical models of collaborative decision making, spanning a range of efficiency levels. Analyses using a Bayesian hierarchical signal detection model revealed that collaborative performance was highly inefficient, falling closest to the most suboptimal models of automation dependence tested. This pattern of results generalises previous reports of suboptimal human-automation interaction across a range of visual search, target detection, sensory discrimination, and numeric estimation decision-making tasks. The current study is the first to provide benchmarks of automation-aided performance in the one-to-one face matching task.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    几十年来,目击者记忆研究的目标是最大限度地减少无辜嫌疑人被错误识别的机会。然而,这不是唯一的目标。部分接收器操作特性(ROC)曲线提供了一种识别阵容程序的方式,该阵容程序保持低的误报率,同时还最大化命中率。最近,已经尝试将ROC曲线扩展到有意设计公平阵容以避免的高误报率区域。这些新的完整的ROC可以为警察提供一种方法,以公平的阵容规避填充物提供的保护。此外,这些扩展ROC曲线的尝试并非基于潜在诊断信号的数学相干模型.在这篇文章中,我们通过实证证明了这种缺乏坚实的基础如何导致可疑的结论,例如目击者拥有预见性,能够可靠地识别他们将来会看到的犯罪的人。
    For decades, eyewitness memory research has had the worthy goal of minimizing the chances that an innocent suspect is falsely identified. However, this is not the only goal. Partial receiver operating characteristic (ROC) curves provide a way to identify lineup procedures that keep the false alarm rate low while also maximizing the hit rate. Recently, there have been attempts to extend the ROC curve into high false alarm rate regions that fair lineups are intentionally designed to avoid. These new full ROCs could provide a way for the police to circumvent the protections offered by fillers in a fair lineup. Moreover, these attempts to extend the ROC curve are not based on a mathematically coherent model of latent diagnostic signals. In this article, we empirically demonstrate how this lack of a solid foundation can lead to dubious conclusions, such as eyewitnesses possessing precognition and being able to reliably identify the person they will see commit a crime in the future.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    人类在他们的决定中体验到自信的感觉。在感知中,这些感觉通常是准确的-我们倾向于对正确的决定更有信心。人们对决策准确性的洞察力被称为元认知敏感度。当前流行的估计元认知敏感度的方法受到解释歧义的影响,因为它们假设人们在反复暴露于单个输入时具有不同经验的正常形状分布。如果违反了这个常态假设,计算会错误地低估元认知敏感度。这里,我们描述了一种估计元认知敏感度的方法,该方法对违反正态假设更稳健。这种改进的方法可以很容易地添加到标准的行为实验中,和作者提供的Matlab代码来帮助研究人员实现这些分析和实验程序。
    Humans experience feelings of confidence in their decisions. In perception, these feelings are typically accurate - we tend to feel more confident about correct decisions. The degree of insight people have into the accuracy of their decisions is known as metacognitive sensitivity. Currently popular methods of estimating metacognitive sensitivity are subject to interpretive ambiguities because they assume people have normally shaped distributions of different experiences when they are repeatedly exposed to a single input. If this normality assumption is violated, calculations can erroneously underestimate metacognitive sensitivity. Here, we describe a means of estimating metacognitive sensitivity that is more robust to violations of the normality assumption. This improved method can easily be added to standard behavioral experiments, and the authors provide Matlab code to help researchers implement these analyses and experimental procedures.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    当展示阵容时,证人的任务是指认罪犯或表明罪犯不在场。然后,证人对该决定进行信心判断。但是证人如何做出这些决定和判断?根据绝对判断模型,目击者确定哪个阵容成员提供了最强的记忆匹配,并根据这个MAX阵容成员的绝对实力确定他们的身份决定和信心判断。相反,相对判断模型建议证人确定哪个阵容成员提供了最强的匹配记忆,然后将他们的身份决定和信心判断基于MAX阵容成员相对于其余阵容成员的相对强度。我们采取了一种关键的测试方法来测试这两个模型的预测。正如绝对判断模型所预测的那样,但与相对判断模型的预测相反,证人更有可能正确地拒绝低相似性阵容比高相似性阵容(实验1),与公平阵容相比,更有可能拒绝有偏见的阵容(实验2)。同样,证人拒绝低相似性阵容的信心高于高相似性阵容(实验1),拒绝偏见阵容的信心高于公平阵容(实验2)。只有一个模式与相对模型一致,而与绝对模型不一致:来自偏倚阵容的可疑识别比来自公平阵容的可疑识别具有更大的置信度(实验2)。结果表明,绝对判断模型比相对判断模型更好地预测证人决策,而纯粹的相对判断模型是不可行的。
    When presented with a lineup, the witness is tasked with identifying the culprit or indicating that the culprit is not present. The witness then qualifies the decision with a confidence judgment. But how do witnesses go about making these decisions and judgments? According to absolute-judgment models, witnesses determine which lineup member provides the strongest match to memory and base their identification decision and confidence judgment on the absolute strength of this MAX lineup member. Conversely, relative-judgment models propose that witnesses determine which lineup member provides the strongest match to memory and then base their identification decision and confidence judgment on the relative strength of the MAX lineup member compared to the remaining lineup members. We took a critical test approach to test the predictions of both models. As predicted by the absolute-judgment model, but contrary to the predictions of the relative-judgment model, witnesses were more likely to correctly reject low-similarity lineups than high-similarity lineups (Experiment 1), and more likely to reject biased lineups than fair lineups (Experiment 2). Likewise, witnesses rejected low-similarity lineups with greater confidence than high-similarity lineups (Experiment 1) and rejected biased lineups with greater confidence than fair lineups (Experiment 2). Only a single pattern was consistent with the relative model and inconsistent with the absolute model: suspect identifications from biased lineups were made with greater confidence than suspect identifications from fair lineups (Experiment 2). The results suggest that absolute-judgment models better predict witness decision-making than do relative-judgment models and that pure relative-judgment models are unviable.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    判别性度量d\'在心理学中被广泛用于独立于反应偏差来估计灵敏度。估计d'的常规方法涉及从命中率和误报率的变换。当性能完美时,必须应用校正方法来计算d',但是这些修正扭曲了估计。在三个模拟研究中,我们表明,d'估计中的失真可能来自实验设计的其他属性(试验次数,样本量,样本方差,任务难度),当结合校正方法的应用时,使任何特定实验设计中的d\'失真复杂,并可能在最坏的情况下(I型和II型错误)误导统计推断。为了解决这个问题,我们建议研究人员模拟d'估计来探索设计选择的影响,给定预期或观察到的数据。介绍了一种RShiny应用程序,用于估计d'失真,为研究人员提供识别失真并采取措施将其影响降至最低的手段。
    The discriminability measure d \' is widely used in psychology to estimate sensitivity independently of response bias. The conventional approach to estimate d \' involves a transformation from the hit rate and the false-alarm rate. When performance is perfect, correction methods must be applied to calculate d \' , but these corrections distort the estimate. In three simulation studies, we show that distortion in d \' estimation can arise from other properties of the experimental design (number of trials, sample size, sample variance, task difficulty) that, when combined with application of the correction method, make d \' distortion in any specific experiment design complex and can mislead statistical inference in the worst cases (Type I and Type II errors). To address this problem, we propose that researchers simulate d \' estimation to explore the impact of design choices, given anticipated or observed data. An R Shiny application is introduced that estimates d \' distortion, providing researchers the means to identify distortion and take steps to minimize its impact.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    研究人员对高配错误的原因提出异议,低NoGo目标检测任务,比如持续关注响应任务(SART)。一些研究人员认为SART中的错误是由于感知解耦,参与者不知道刺激身份。这种缺乏外部意识会导致错误的反应。其他研究人员认为,SART中的大多数错误是由于反应宽大处理,不是感知脱钩。响应延迟可能会使最初不知道刺激身份的参与者,感知解耦,意识到刺激的身份,或在感知上重新耦合。如果,然而,将刺激呈现时间缩短到刺激识别所需的最小值,并且用结构化的掩模破坏刺激,那么即使有反应延迟,也应该没有时间使感知重新耦合。从感知脱钩的角度来看,在这种情况下,响应延迟对性能没有影响。或者,如果响应偏差是关键的,即使在这种情况下,响应延迟也可能影响性能。在这项研究中,我们缩短了刺激呈现时间,并增加了一个结构化的面具。我们检查了响应延迟是否影响SART和SART响应格式反转的任务的性能。我们预计响应延迟只会影响信号检测理论偏差,C,在SART中,回应宽大处理是一个问题。在反向格式化的SART中,由于偏见预计不会宽大处理,我们预计反应延迟对反应偏差没有影响或影响最小.这些预测得到了验证。响应偏差在理解SART性能方面更为重要,而不是感知解耦,如果它发生在SART中,这是罕见的。
    Researchers dispute the cause of errors in high Go, low No Go target detection tasks, like the Sustained Attention to Response Task (SART). Some researchers propose errors in the SART are due to perceptual decoupling, where a participant is unaware of stimulus identity. This lack of external awareness causes an erroneous response. Other researchers suggest the majority of the errors in the SART are instead due to response leniency, not perceptual decoupling. Response delays may enable a participant who is initially unaware of stimulus identity, perceptually decoupled, to become aware of stimulus identity, or perceptually recoupled. If, however, the stimulus presentation time is shortened to the minimum necessary for stimulus recognition and the stimulus is disrupted with a structured mask, then there should be no time to enable perception to recouple even with a response delay. From the perceptual decoupling perspective, there should be no impact of a response delay on performance in this case. Alternatively if response bias is critical, then even in this case a response delay may impact performance. In this study, we shortened stimulus presentation time and added a structured mask. We examined whether a response delay impacted performance in the SART and tasks where the SART\'s response format was reversed. We expected a response delay would only impact signal detection theory bias, c, in the SART, where response leniency is an issue. In the reverse formatted SART, since bias was not expected to be lenient, we expected no impact or minimal impact of a response delay on response bias. These predictions were verified. Response bias is more critical in understanding SART performance, than perceptual decoupling, which is rare if it occurs at all in the SART.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在许多决策任务中,我们有一系列的替代选择,我们面临的问题是如何利用我们潜在的信念和偏好对每一个选择做出单一的选择。认知和决策模型通常假定信念和偏好被提炼成每种选择的标量潜在强度,但对人们如何利用这些潜在的优势来选择单一的替代方案进行建模也是至关重要的。大多数模型遵循两个传统之一来建立这种联系。现代心理物理学和记忆研究者利用信号检测理论,假设潜在的强度受到噪音的干扰,并选择最高的结果信号。相比之下,许多现代决策理论建模和机器学习方法都使用softmax函数(该函数基于Luce的选择公理;Luce,1959)给予非最大强度替代品一定的权重。尽管这两种选择理论很突出,目前的方法很少解决它们之间的联系,和选择一个或另一个似乎更多的动机在传统的相关文献比理论或经验的原因更喜欢一个理论。当前工作的目标是通过阐明这两个模型中的哪一个可以更好地表征m-alternative决策任务中的潜在过程来重新审视这一主题。特别关注内存任务。在一组视觉记忆实验中,我们证明,在相同的实验设计中,softmax参数β在m个备选方案中变化,而信号检测模型的参数d'是稳定的。一起,我们的发现表明,用信号检测链路模型代替softmax将在任务结构的变化中产生更普遍的预测。更有野心,信号检测模型参数在不同任务之间的不变性表明,这些模型的参数假设可能不仅仅是数学上的便利,但反映了人类决策的一些真实的东西。
    In many decision tasks, we have a set of alternative choices and are faced with the problem of how to use our latent beliefs and preferences about each alternative to make a single choice. Cognitive and decision models typically presume that beliefs and preferences are distilled to a scalar latent strength for each alternative, but it is also critical to model how people use these latent strengths to choose a single alternative. Most models follow one of two traditions to establish this link. Modern psychophysics and memory researchers make use of signal detection theory, assuming that latent strengths are perturbed by noise, and the highest resulting signal is selected. By contrast, many modern decision theoretic modeling and machine learning approaches use the softmax function (which is based on Luce\'s choice axiom; Luce, 1959) to give some weight to non-maximal-strength alternatives. Despite the prominence of these two theories of choice, current approaches rarely address the connection between them, and the choice of one or the other appears more motivated by the tradition in the relevant literature than by theoretical or empirical reasons to prefer one theory to the other. The goal of the current work is to revisit this topic by elucidating which of these two models provides a better characterization of latent processes in m -alternative decision tasks, with a particular focus on memory tasks. In a set of visual memory experiments, we show that, within the same experimental design, the softmax parameter β varies across m -alternatives, whereas the parameter d \' of the signal-detection model is stable. Together, our findings indicate that replacing softmax with signal-detection link models would yield more generalizable predictions across changes in task structure. More ambitiously, the invariance of signal detection model parameters across different tasks suggests that the parametric assumptions of these models may be more than just a mathematical convenience, but reflect something real about human decision-making.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    情感识别是亲社会互动的核心,能够推断心理和情感状态。已知犯有性犯罪的人表现出社会情感缺陷,文献中发现的四个动态风险评估维度之一。很少有研究集中在情感识别上。现有文献,专门针对在监狱中犯有性犯罪的个人,显示对比结果。有些人发现了一个全球性的(跨越所有情绪)或特定的(例如,愤怒,恐惧)情感识别中的缺陷。相比之下,其他人发现在监狱中犯有性犯罪的人和犯有非性犯罪的人之间没有区别。此外,尚未在表现出社会情感缺陷的法医住院患者中进行此类研究。这项研究旨在调查112名男性参与者对动态面部表情的识别,这些参与者分为三组:犯有性犯罪的法医住院患者(n=37),犯有非性犯罪的法医住院病人(n=25),和社区成员(n=50),使用信号检测理论指数:灵敏度(d')和响应偏差(c)。此外,与反应时间相关的措施,情感标签反射时间,任务容易,还收集了容易反射时间。非参数分析(Kruskall-Wallis\'H,随后是Mann-Whitney的U与Dunn-Bonferroni更正)强调,与社区成员相比,这两个法医住院组表现出情感识别缺陷。犯有性犯罪的法医住院患者在选择惊喜标签方面比社区成员更为保守。他们还花费了更多的时间来对刺激做出反应并选择情感标签。尽管情感识别缺陷,两个法医住院组报告的刺激容易性比社区成员多。
    Emotion recognition is central in prosocial interaction, enabling the inference of mental and affective states. Individuals who have committed sexual offenses are known to exhibit socio-affective deficits, one of the four dynamic risk assessment dimensions found in the literature. Few research focused on emotion recognition. The available literature, exclusively on individuals in prison who have committed sexual offenses, showed contrasting results. Some found a global (across all emotions) or specific (e.g., anger, fear) deficit in emotion recognition. In contrast, others found no difference between individuals in prison who have committed sexual offenses and those who have committed non-sexual offenses. In addition, no such study has been undertaken among forensic inpatients who exhibit socio-affective deficits. This study aims to investigate the recognition of dynamic facial expressions of emotion in 112 male participants divided into three groups: forensic inpatients who have committed sexual offenses (n = 37), forensic inpatients who have committed non-sexual offenses (n = 25), and community members (n = 50), using the Signal Detection Theory indices: sensitivity (d\') and response bias (c). In addition, measures related to reaction time, emotion labeling reflection time, task easiness, and easiness reflection time were also collected. Non-parametric analyses (Kruskall-Wallis\' H, followed by Mann-Whitney\'s U with Dunn-Bonferroni correction) highlighted that the two forensic inpatient groups exhibited emotion recognition deficits when compared to community members. Forensic inpatients who have committed sexual offenses were more conservative in selecting the surprise label than community members. They also took significantly more time to react to stimuli and to select an emotional label. Despite emotion recognition deficits, the two forensic inpatient groups reported more stimuli easiness than community members.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    先前的研究报告了在将同时阵容与两个不同的阵容程序进行比较时,在区分性和响应偏差方面,结果存在不同的模式,顺序停止规则阵容和英国阵容。在单个大样本实验中,我们比较了同时呈现的六项摄影阵容中的可辨别性和响应偏差,有一个停止规则,或者在进行识别并包括重新访问物品的能力之前,依次要求对物品进行两圈完整的检查,类似于英国的阵容程序。与顺序停止规则阵容相比,同时阵容的可识别性更大,尽管这两个程序之间的经验辨别性没有显著差异。将同时阵容与连续两个阵容以及连续两个圈阵容与连续停止规则阵容进行比较时,可判别性没有显着差异。对于连续的两圈阵容,回应是最宽容的,紧随其后的是同时阵容,其次是顺序阵容。这些结果表明,顺序项目呈现可能不会对辨别性和响应偏差产生很大影响。相反,序贯停止规则阵容和英国阵容中的可辨别性和反应偏差是由于序贯项目呈现与这些程序的其他方面的相互作用所致.
    Previous research has reported diverging patterns of results with respect to discriminability and response bias when comparing the simultaneous lineup to two different lineup procedures in which items are presented sequentially, the sequential stopping rule lineup and the UK lineup. In a single large sample experiment, we compared discriminability and response bias in six-item photographic lineups presented either simultaneously, sequentially with a stopping rule, or sequentially requiring two full laps through the items before making an identification and including the ability to revisit items, analogous to the UK lineup procedure. Discriminability was greater for the simultaneous lineup compared to the sequential stopping rule lineup, despite a non-significant difference in empirical discriminability between the procedures. There was no significant difference in discriminability when comparing the simultaneous lineup to the sequential two lineup and the sequential two lap lineup to the sequential stopping rule lineup. Responding was most lenient for the sequential two lap lineup, followed by the simultaneous lineup, followed by the sequential lineup. These results imply that sequential item presentation may not exert a large effect in isolation on discriminability and response bias. Rather, discriminability and response bias in the sequential stopping rule lineup and UK lineup result from the interaction of sequential item presentation with other aspects of these procedures.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号