Speech Acoustics

语音声学
  • 文章类型: Journal Article
    语言韵律在帕金森病(PD)中受影响,这暗示了基底神经节在韵律产生中的作用。然而,最近没有关于PD韵律损害的可用声学证据的系统综合。这项研究旨在确定在PD中始终受到影响的语言韵律的声学特征。
    作者系统地回顾了报道PD中韵律产生的声学特征的文章。文章关注基频(F0)及其变异性,强度及其变异性,语音和发音率,和暂停持续时间和比率。在总共648条记录中,36符合纳入和排除标准。对于每个声学测量和任务,将PD患者(PwPD)的数据与对照组的数据进行比较,以提取效应大小.使用稳健的贝叶斯分层回归模型估计集合效应大小。
    PD与F0变异性降低和暂停持续时间增加相关。PwPD中强度变异性和语速降低的证据有限。没有证据表明PD会影响发音率或停顿率。
    受PD影响的韵律的主要声学参数是F0变异性和停顿持续时间。这些声学参数的识别对于PD管理策略的选择具有重要的临床意义。F0变异性和停顿持续时间与PD的关联表明,控制这些参数的神经回路至少部分共享,并且可能包括基底神经节。虽然当前的研究集中在韵律线索的语音实现上,未来的研究应该检查PD是否以及如何在更高的加工水平下影响韵律。
    https://doi.org/10.23641/asha.25892923。
    UNASSIGNED: Linguistic prosody is affected in Parkinson\'s disease (PD), which implicates the basal ganglia\'s role in the production of prosody. However, there is no recent systematic synthesis of the available acoustic evidence of prosodic impairment in PD. This study aimed to identify the acoustic features of linguistic prosody that are consistently affected in PD.
    UNASSIGNED: The authors systematically reviewed articles that reported acoustic features of prosodic production in PD. Articles focused on fundamental frequency (F0) and its variability, intensity and its variability, speech and articulation rate, and pause duration and ratio. From a total of 648 records identified, 36 met criteria for inclusion and exclusion. For each acoustic measurement and task, data from people with PD (PwPD) were compared with those from controls to extract effect sizes. Pooled effect sizes were estimated using robust Bayesian hierarchical regression models.
    UNASSIGNED: PD was associated with decreased F0 variability and increased pause duration. There was limited evidence of reduced intensity variability and speech rate in PwPD. No evidence was found to suggest that PD affects articulation rate or pause ratio.
    UNASSIGNED: The primary acoustic parameters of prosody affected by PD are F0 variability and pause duration. The identification of these acoustic parameters has important clinical implications for the selection of PD management strategies. The association of F0 variability and pause duration with PD suggests that the neural circuits controlling these parameters are at least partly shared and might include the basal ganglia. While the current study focused on the phonetic realization of prosodic cues, future studies should examine whether and how PD affects prosody at higher levels of processing.
    UNASSIGNED: https://doi.org/10.23641/asha.25892923.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目的:在早期声门鳞状细胞癌,在经口激光显微手术(TLM)和放射治疗(RT)之间的疾病控制方面也描述了类似的结果.在过去的二十年里,几项研究比较了独家RT和TLM的主观声乐结果,显示出随着时间的推移TLM结果改善的趋势。然而,在专用RT和TLM之间的频谱声学语音参数方面的客观差异很少被研究。这项系统评价的目的是评估TLM和RT治疗早期声门型喉癌后的语音质量。基于声学分析参数,包括抖动,shimmer,噪声谐波比,基频和最大发声时间。
    方法:在Pubmed,Scopus和Cochrane数据库遵循PRISMA指南。
    结果:从搜索中总共检索到441个标题。在全文筛选和应用纳入/排除标准后,包括12篇文章。我们发现TLM和RT治疗在考虑的声学分析参数中没有显着差异,除了Shimmer,在RT组中报告了更有利的值。
    结论:考虑到疾病的传播,并期望随着时间的推移,长期生存率得到改善,必须进行设计良好的多中心研究,并对更多人群进行长期随访,以更好地评估频谱声学语音参数方面的客观语音结果.
    OBJECTIVE: In early glottic squamous cell carcinoma, similar results have been described in terms of disease control between transoral laser microsurgery (TLM) and radiation therapy (RT). During the past two decades, several studies compared subjective vocal outcomes of exclusive RT with those of TLM, showing a trend towards improving results for TLM over time. However, the objective differences in terms of spectro-acoustic voice parameters between exclusive RT and TLM have been less frequently investigated. The aim of this systematic review with meta-analysis was to evaluate voice quality after TLM and RT treatment for early glottic carcinoma, based on acoustic analysis parameters including jitter, shimmer, noise to harmonic ratio, fundamental frequency and maximum phonation time.
    METHODS: A search of the English published literature was conducted on the Pubmed, Scopus and Cochrane databases following PRISMA guidelines.
    RESULTS: A total of 441 titles were retrieved from the search. After full-text screening and application of inclusion/exclusion criteria, 12 articles were included. We found no significant differences between TLM and RT treatment in the considered acoustic analysis parameters, except for Shimmer, with more favorable values reported in the RT group.
    CONCLUSIONS: Considering the spread of the disease and expecting an improvement in long-term survival over time, well-designed and multicentric studies involving larger populations with a long-term follow up are mandatory to better assess objective voice outcomes in terms of spectro-acoustic voice parameters.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目的:由于近年来有关Wendler声门成形术(WG)对变性女性声音影响的新证据已在文献中获得,我们旨在进行更新的系统评价和荟萃分析,以确定WG在声带女性化过程中的实际安全性和有效性.
    方法:PubMed,Embase,和Cochrane被搜索到2023年7月4日之前发表的英语文章。如果研究人员评估了WG对跨性别女性的声学空气动力学措施和语音质量的影响,则研究合格。
    结果:确定了23项研究。由于数据不完整,排除了三项研究后,20项研究包括656例患者纳入荟萃分析。在WG之后,基频显著增加,说到基频,和频率范围的下限(p<0.001)。同时,观察到频率范围和最大发声时间显着降低(p<0.001)。术前和术后值之间没有发现显著差异,粗糙度,呼吸,Asthenia,和应变量表评分(p=0.339)。WG后,跨性别语音问卷(TWVQ)的总分显着提高(p<0.001)。
    结论:WG是跨性别女性的有效语音女性化方法,与高手术成功率和低术后并发症风险相关。术后TWVQ评分显著改善提示其对嗓音相关生活质量的积极影响。术后最大发声时间和频率范围的减少似乎不会显着影响发声的有效性。
    OBJECTIVE: Since new evidence regarding the impact of Wendler glottoplasty (WG) on the voice in transgender women became available in the literature in recent years, we aimed to perform an updated systematic review and meta-analysis to determine the actual safety and efficacy of WG in the process of vocal feminization.
    METHODS: PubMed, Embase, and Cochrane were searched for English-language articles published until July 4, 2023. Studies were found eligible if they evaluated the impact of WG on the acoustic-aerodynamic measures and quality of voice in transgender women.
    RESULTS: Twenty-three studies were identified. After exclusion of three studies due to incomplete data, 20 studies including 656 patients were included in the meta-analysis. After WG, there was a significant increase of fundamental frequency, speaking fundamental frequency, and lower limit of the frequency range (p < 0.001). Concurrently, a significant reduction of frequency range and maximum phonation time was observed (p < 0.001). No significant differences were found between the pre- and postoperative values regarding the Grade, Roughness, Breathiness, Asthenia, and Strain scale score (p = 0.339). The overall score in the Trans Woman Voice Questionnaire (TWVQ) significantly improved after WG (p < 0.001).
    CONCLUSIONS: WG is an effective voice feminization method in transgender women, associated with a high procedural success and low risk of postoperative complications. Significantly improved TWVQ score after surgery suggests its positive impact on the voice-related quality of life. Postoperative decrease of maximum phonation time and frequency range does not seem to significantly impact the effectiveness of voice production.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Meta-Analysis
    目的:肌张力发声障碍(MTD)是最常见的功能性嗓音障碍。行为嗓音疗法是MTD的一线治疗,喉部手动治疗可能是这种治疗的一部分。这项研究的目的是研究手动喉部治疗(MCT)对语音质量的声学标记(抖动,shimmer,和谐波噪声比)和声音功能(基频)通过荟萃分析进行系统评价。
    方法:从开始到2022年12月搜索了四个数据库,并进行了手动搜索。
    方法:应用了PRISMA扩展声明,用于报告包含医疗保健干预措施的荟萃分析的系统综述,并使用随机效应模型进行荟萃分析。
    结果:我们从30项研究中确定了6项符合条件的研究(无重复)。MCT方法对具有大效果尺寸(Cohen'sd>0.8)的声学非常有效。在抖动百分比方面获得了显着改善(平均差-.58;95%CI-1.00至0.16),闪烁百分比(平均差为-5.66;95%CI-8.16至3.17),和谐波噪声比,单位为dB(平均差为4.65;95%CI1.90-7.41),考虑到测量变异性,MCT继续显着改善后两种测量。
    结论:大多数临床研究通过评估抖动来证实MCT对MTD的疗效,shimmer,以及与语音质量相关的谐波噪声比。无法验证MCT对基频变化的影响。需要高质量随机对照试验的进一步贡献来支持喉科循证实践。喉镜,2023年。
    Muscle tension dysphonia (MTD) is the most common functional voice disorder. Behavioral voice therapy is the front-line treatment for MTD, and laryngeal manual therapy may be a part of this treatment. The objective of this study was to investigate the effect of manual circumlaryngeal therapy (MCT) on acoustic markers of voice quality (jitter, shimmer, and harmonics-to-noise ratio) and vocal function (fundamental frequency) through a systematic review with meta-analysis.
    Four databases were searched from inception to December 2022, and a manual search was performed.
    The PRISMA extension statement for reporting systematic reviews incorporating a meta-analysis of health care interventions was applied, and a random effects model was used for the meta-analyses.
    We identified 6 eligible studies from 30 studies (without duplicates). The MCT approach was highly effective on acoustics with large effect sizes (Cohen\'s d > 0.8). Significant improvements were obtained in jitter in percent (mean difference of -.58; 95% CI -1.00 to 0.16), shimmer in percent (mean difference of -5.66; 95% CI -8.16 to 3.17), and harmonics-to-noise ratio in dB (mean difference of 4.65; 95% CI 1.90-7.41), with the latter two measurements continuing to be significantly improved by MCT when measurement variability is considered.
    The efficacy of MCT for MTD was confirmed in most clinical studies by assessing jitter, shimmer, and harmonics-to-noise ratio related to voice quality. The effects of MCT on the fundamental frequency changes could not be verified. Further contributions of high-quality randomized control trials are needed to support evidence-based practice in laryngology. Laryngoscope, 134:18-26, 2024.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • Acoustic analysis of the voice, as an objective, quantitative, non-invasive and reproducible method for the evaluation of voice quality, can be used to detect and analyze the acoustic characteristics of normal, artistic or pathological voice. With the development of medicine, physics, statistics, and artificial intelligence technology, there are new advances in the study of voice acoustic analysis, especially in terms of acoustic parameters. In addition, artificial neural networks can be used to perform complex multi-parameter analysis, which greatly improves the efficiency of acoustic analysis. This paper provides an overview of the methods of acoustic analysis and its latest development.
    摘要: 嗓音声学分析可用于检测和分析正常嗓音、艺术嗓音和病理性嗓音的声学特征,是一种客观、定量、非侵入且可重复的嗓音质量评价方法。随着现代医学、物理学、统计学和人工智能技术的发展,近年来嗓音声学分析的研究,特别是在声学参数的开发和适用性上有了新的进展。同时利用人工神经网络等辅助计算方法进行复杂的多参数分析,大大提高了嗓音声学分析的效率。本文就嗓音声学分析的方法及其最新进展做一概述。.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Systematic Review
    When speaking to infants, adults often produce speech that differs systematically from that directed to other adults. To quantify the acoustic properties of this speech style across a wide variety of languages and cultures, we extracted results from empirical studies on the acoustic features of infant-directed speech. We analysed data from 88 unique studies (734 effect sizes) on the following five acoustic parameters that have been systematically examined in the literature: fundamental frequency (f0), f0 variability, vowel space area, articulation rate and vowel duration. Moderator analyses were conducted in hierarchical Bayesian robust regression models to examine how these features change with infant age and differ across languages, experimental tasks and recording environments. The moderator analyses indicated that f0, articulation rate and vowel duration became more similar to adult-directed speech over time, whereas f0 variability and vowel space area exhibited stability throughout development. These results point the way for future research to disentangle different accounts of the functions and learnability of infant-directed speech by conducting theory-driven comparisons among different languages and using computational models to formulate testable predictions.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    人类的声音随着神经系统疾病的进展和影响发音者的疾病的发作而变化,经常降低沟通的有效性。可以使用提取声学特征的信号处理技术客观地测量这些变化。测量声学特征时,通常有几个步骤和假设可能被声学和语音学专家所知,但对其他学科的透明度较低(例如,临床医学,言语病理学,工程,和数据科学)。本教程介绍了这些信号处理技术,明确概述了精确测量的基本步骤,并讨论了临床声学标志物的含义。
    我们用直截了当的术语建立词汇,提供可视化以实现共同点,并指导对声学和听觉信号处理领域之外的人的理解。在可能的情况下,我们重点介绍了测量临床声学标志物的最佳实践,并为获得和进一步理解这些测量提供了资源。
    The human voice changes with the progression of neurological disease and the onset of diseases that affect articulators, often decreasing the effectiveness of communication. These changes can be objectively measured using signal processing techniques that extract acoustic features. When measuring acoustic features, there are often several steps and assumptions that might be known to experts in acoustics and phonetics, but are less transparent for other disciplines (e.g., clinical medicine, speech pathology, engineering, and data science). This tutorial describes these signal processing techniques, explicitly outlines the underlying steps for accurate measurement, and discusses the implications of clinical acoustic markers.
    We establish a vocabulary using straightforward terms, provide visualizations to achieve common ground, and guide understanding for those outside the domains of acoustics and auditory signal processing. Where possible, we highlight the best practices for measuring clinical acoustic markers and suggest resources for obtaining and further understanding these measures.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Meta-Analysis
    目的:神经退行性运动疾病(NMD)对患者及其亲人的生活具有破坏性影响,部分原因是神经系统异常对言语的影响,这极大地限制了功能沟通。因此,临床语音研究人员花费了数十年的时间来研究患有NMD的人群的语音特征。鉴于其对清晰度的不利影响,关节功能受损的特征尤其令人感兴趣,它们编码各种不同运动障碍的能力,以及它们作为神经退行性疾病诊断指标的潜力。本次范围界定审查的目的是确定(1)衔接的哪些组成部分(即协调,一致性,速度,精度,和重复率)在有关NMD的声学文献中表现得最多;(2)哪些声学关节特征显示出最有可能检测NMD中的语音运动功能障碍;(3)每个NMD中哪些关节成分受损最多。方法:这篇综述研究了1976年至2020年发表的文献。使用预定义的关键搜索词从六个电子数据库中识别出研究。第一个研究目标是通过研究每个关节成分的频率计数来解决的。而第二个和第三个目标是使用荟萃分析解决的.结果:来自126项研究的结果表明,人们相当重视发音精度。在荟萃分析中包含的24个特征中,将NMD人口与对照组进行比较时,元音散度/距离和停差持续时间表现出最大的影响。荟萃分析还揭示了不同疾病类型的关节表现的不同模式,提供关节损伤独特特征的证据。结论:这篇综述说明了有关NMD声关节特征的文献现状。通过强调每个关节组成部分和疾病组中的需要领域,这项工作为临床研究人员提供了基础,演讲科学家,神经学家,和计算机科学工程师可以开发研究问题,这将扩大和加深对NMD中关节损伤的理解。
    Neurodegenerative motor diseases (NMDs) have devastating effects on the lives of patients and their loved ones, in part due to the impact of neurologic abnormalities on speech, which significantly limits functional communication. Clinical speech researchers have thus spent decades investigating speech features in populations suffering from NMDs. Features of impaired articulatory function are of particular interest given their detrimental impact on intelligibility, their ability to encode a variety of distinct movement disorders, and their potential as diagnostic indicators of neurodegenerative diseases. The objectives of this scoping review were to identify (1) which components of articulation (i.e. coordination, consistency, speed, precision, and repetition rate) are the most represented in the acoustic literature on NMDs; (2) which acoustic articulatory features demonstrate the most potential for detecting speech motor dysfunction in NMDs; and (3) which articulatory components are the most impaired within each NMD.
    This review examined literature published between 1976 and 2020. Studies were identified from six electronic databases using predefined key search terms. The first research objective was addressed using a frequency count of studies investigating each articulatory component, while the second and third objectives were addressed using meta-analyses.
    Findings from 126 studies revealed a considerable emphasis on articulatory precision. Of the 24 features included in the meta-analyses, vowel dispersion/distance and stop gap duration exhibited the largest effects when comparing the NMD population to controls. The meta-analyses also revealed divergent patterns of articulatory performance across disease types, providing evidence of unique profiles of articulatory impairment.
    This review illustrates the current state of the literature on acoustic articulatory features in NMDs. By highlighting the areas of need within each articulatory component and disease group, this work provides a foundation on which clinical researchers, speech scientists, neurologists, and computer science engineers can develop research questions that will both broaden and deepen the understanding of articulatory impairments in NMDs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    此范围审查考虑了针对非本地听众的清晰语音寄存器的声学特征,称为外国人定向语音(FDS)。我们确定元音高发音和低语率是FDS最具代表性的声学特征;其他特征,包括宽间距范围和高强度,仍在辩论中。我们还讨论了可能影响FDS结果和特征的因素。我们首先研究住宿理论,概述了FDS可能通过帮助听众获得第二语言(L2)来发挥教学功能的原因。我们研究这个语音寄存器如何适应听众的身份和语言需求,这表明FDS也考虑了听众的L2熟练程度。为了确认FDS的教学功能,我们将其与其他清晰的语音寄存器进行比较,特别是婴儿演讲和伦巴第演讲。
    我们的评论显示,研究尚未确定FDS是否成功作为支持L2获取的教学工具。此外,一组复杂的因素决定了FDS的具体实现,这需要进一步探索。最后,我们总结了悬而未决的问题,并指出了未来研究的方向和建议。
    This scoping review considers the acoustic features of a clear speech register directed to nonnative listeners known as foreigner-directed speech (FDS). We identify vowel hyperarticulation and low speech rate as the most representative acoustic features of FDS; other features, including wide pitch range and high intensity, are still under debate. We also discuss factors that may influence the outcomes and characteristics of FDS. We start by examining accommodation theories, outlining the reasons why FDS is likely to serve a didactic function by helping listeners acquire a second language (L2). We examine how this speech register adapts to listeners\' identities and linguistic needs, suggesting that FDS also takes listeners\' L2 proficiency into account. To confirm the didactic function of FDS, we compare it to other clear speech registers, specifically infant-directed speech and Lombard speech.
    Our review reveals that research has not yet established whether FDS succeeds as a didactic tool that supports L2 acquisition. Moreover, a complex set of factors determines specific realizations of FDS, which need further exploration. We conclude by summarizing open questions and indicating directions and recommendations for future research.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    建模语音产生和语音衔接仍然是一个不断发展的研究课题。当前的一些核心问题是:控制语音发音的基本(神经)组织是什么?如何以有效且生物学上真实的方式对嘴唇和舌头等语音发音者及其运动进行建模?如何开发高质量的发音声学模型以实现高质量的发音语音合成?因此,一方面,计算机建模将帮助我们展开语音产生的潜在生物学和声学发音概念,另一方面,进一步的建模工作将帮助我们实现基于更详细的语音合成的高质量发音语音合成的目标。关于声道声学和语音发音的知识。目前,发音模型无法达到基于语料库的语音合成的质量水平。此外,基于生物力学和神经肌肉的方法很复杂,仍然无法用于句子级语音合成。本文列出了许多计算机实现的发音模型,并提供了将发音模型划分为不同类别的标准。最近的一个主要研究问题,即,详细讨论了如何以神经生物学适当的方式控制关节模型。可以得出结论,非常需要进一步开发发音声学模型,以便测试语音发音的基于神经生物学的定量控制概念,并揭示人类发音和声音信号生成中的其余细节。此外,这些努力可能有助于我们接近建立高质量的发音声学以及神经生物学基础的语音合成的目标。
    Modeling speech production and speech articulation is still an evolving research topic. Some current core questions are: What is the underlying (neural) organization for controlling speech articulation? How to model speech articulators like lips and tongue and their movements in an efficient but also biologically realistic way? How to develop high-quality articulatory-acoustic models leading to high-quality articulatory speech synthesis? Thus, on the one hand computer-modeling will help us to unfold underlying biological as well as acoustic-articulatory concepts of speech production and on the other hand further modeling efforts will help us to reach the goal of high-quality articulatory-acoustic speech synthesis based on more detailed knowledge on vocal tract acoustics and speech articulation. Currently, articulatory models are not able to reach the quality level of corpus-based speech synthesis. Moreover, biomechanical and neuromuscular based approaches are complex and still not usable for sentence-level speech synthesis. This paper lists many computer-implemented articulatory models and provides criteria for dividing articulatory models in different categories. A recent major research question, i.e., how to control articulatory models in a neurobiologically adequate manner is discussed in detail. It can be concluded that there is a strong need to further developing articulatory-acoustic models in order to test quantitative neurobiologically based control concepts for speech articulation as well as to uncover the remaining details in human articulatory and acoustic signal generation. Furthermore, these efforts may help us to approach the goal of establishing high-quality articulatory-acoustic as well as neurobiologically grounded speech synthesis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号