topic modeling

主题建模
  • 文章类型: Journal Article
    机器学习和深度学习算法的最新进展,随着生成AI的出现,导致人工智能成为组织中的“新常态”。这种趋势已经扩展到CRM,导致了支持人工智能的CRM系统的开发,或AI-CRM。尽管越来越多地采用人工智能作为竞争战略的一部分,许多公司报告说,人工智能对绩效的积极影响很小,甚至没有。这项研究解决了以下研究问题:“AI-CRM系统的关键特征是什么?”和“这些特征如何影响组织竞争优势?”我们的目标是确定AI-CRM的关键特征,并评估它们对组织绩效的影响。在研究1中,我们利用BERTopic主题建模从用户评论中提取AI-CRM的关键特征。研究2采用PLS-SEM来检验这些特征如何影响组织竞争优势。研究1揭示了AI-CRM的四个主要特征(一般,市场营销,销售,和服务/支持),每个都包括不同的特征。研究2表明,这些特征不同地影响CRM能力,显著影响绩效和竞争优势。这些发现为在组织中有效使用人工智能的理论和实践提供了宝贵的见解。
    The recent advances in machine learning and deep learning algorithms, along with the advent of generative AI, have led AI to become the \"new normal\" in organizations. This trend has extended to CRM, resulting in the development of AI-enabled CRM systems, or AI-CRM. Despite the growing adoption of AI as part of competitive strategies, many firms report minimal or no positive effect of AI on performance. This study addresses the research questions: \"What are the critical features of AI-CRM systems?\" and \"How do these features impact organizational competitive advantage?\" To explore this, we aim to identify key characteristics of AI-CRM and assess their impact on organizational performance. In Study 1, we utilize BERTopic topic modeling to extract critical features of AI-CRM from user reviews. Study 2 employs PLS-SEM to examine how these features influence organizational competitive advantage. Study 1 reveals four main characteristics of AI-CRM (general, marketing, sales, and service/support), each comprising distinct features. Study 2 shows that these characteristics differentially impact CRM capability, significantly affecting performance and competitive advantage. The findings offer valuable insights for both theory and practice regarding the effective use of AI in organizations.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Editorial
    如今,许多研究讨论学术出版和相关的挑战,但是被劫持期刊的问题被忽视了。被劫持的期刊是模仿原始期刊但由网络犯罪分子管理的克隆网站。本研究使用主题建模方法来分析被劫持版本的医学期刊中已发表的论文。
    从医学领域的21种被劫持期刊下载了总共3384篇论文,并通过主题建模算法进行了分析。
    结果表明,被劫持的医学期刊在医学领域的大多数领域都有发表,并且通常尊重原始期刊的主要领域。
    学术界面对的是被劫持的第三代期刊,它们的检测可能比普通的更复杂。人工智能(AI)的使用可以成为处理这种现象的强大工具。
    UNASSIGNED: Nowadays, many studies discuss scholarly publishing and associated challenges, but the problem of hijacked journals has been neglected. Hijacked journals are cloned websites that mimic original journals but are managed by cybercriminals. The present study uses a topic modeling approach to analyze published papers in hijacked versions of medical journals.
    UNASSIGNED: A total of 3384 papers were downloaded from 21 hijacked journals in the medical domain and analyzed by topic modeling algorithm.
    UNASSIGNED: Results indicate that hijacked versions of medical journals are published in most fields of the medical domain and typically respect the primary domain of the original journal.
    UNASSIGNED: The academic world is faced with the third-generation of hijacked journals, and their detection may be more complex than common ones. The usage of artificial intelligence (AI) can be a powerful tool to deal with the phenomenon.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    微生物组,一个复杂的微生态系统,帮助宿主进行各种重要的生理过程。微生物组的改变(生态失调)与几种疾病有关,一般来说,在健康组和患者组之间进行差异丰度测试以鉴定重要的细菌。然而,向个体提供单一种类的细菌,因为治疗不如粪便微生物群移植治疗成功,健康个体的整个微生物组被转移。这些观察结果表明,细菌的组合可能对有益作用至关重要。在这里,我们提供了利用主题建模的框架,一种无监督的机器学习方法,识别与健康或疾病相关的细菌群落。具体来说,我们使用了以前发表的多发性硬化症(MS)患者的肠道微生物组数据,一种与肠道菌群失调有关的神经退行性疾病。我们确定了与MS相关的细菌群落,包括以前发现的属,还有其他会被差异丰度测试忽略的。这种方法可以是分析微生物组的有用工具,它应该与常用的差异丰度测试一起考虑,以更好地了解肠道微生物组在健康和疾病中的作用。
    The microbiome, a complex micro-ecosystem, helps the host with various vital physiological processes. Alterations of the microbiome (dysbiosis) have been linked with several diseases, and generally, differential abundance testing between the healthy and patient groups is performed to identify important bacteria. However, providing a singular species of bacteria to an individual as treatment has not been as successful as fecal microbiota transplant therapy, where the entire microbiome of a healthy individual is transferred. These observations suggest that a combination of bacteria might be crucial for the beneficial effects. Here we provide the framework to utilize topic modeling, an unsupervised machine learning approach, to identify a community of bacteria related to health or disease. Specifically, we used our previously published gut microbiome data of patients with multiple sclerosis (MS), a neurodegenerative disease linked to a dysbiotic gut microbiome. We identified communities of bacteria associated with MS, including genera previously discovered, but also others that would have been overlooked by differential abundance testing. This method can be a useful tool for analyzing the microbiome, and it should be considered along with the commonly utilized differential abundance tests to better understand the role of the gut microbiome in health and disease.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    为了应对医疗保健领域的持续挑战,纳入患者和医疗保健专业人员等利益相关者的第一手经验和观点至关重要。然而,当前的收集过程,分析和解释定性数据,比如面试,是缓慢和劳动密集型的。为了加快这一进程,提高效率,自动化方法旨在提取有意义的主题并加速解释,但是当前的方法,如主题建模,降低了原始数据的丰富性。这里,我们评估大型语言模型是否可以用于支持定性访谈数据的半自动解释。我们将基于LLM的新颖方法与主题建模方法进行比较,并在两个不同的定性访谈数据集中手动识别主题。这项探索性研究发现,LLM有可能支持在可持续医疗保健系统的发展中更广泛地纳入人类观点。
    To address the persistent challenges in healthcare, it is crucial to incorporate firsthand experiences and perspectives from stakeholders such as patients and healthcare professionals. However, the current process of collecting, analyzing and interpreting qualitative data, such as interviews, is slow and labor-intensive. To expedite this process and enhance efficiency, automated approaches aim to extract meaningful themes and accelerate interpretation, but current approaches such as topic modeling reduce the richness of the raw data. Here, we evaluate whether Large Language Models can be used to support the semi-automated interpretation of qualitative interview data. We compare a novel approach based on LLMs to topic modeling approaches and to manually identified themes across two different qualitative interview datasets. This exploratory study finds that LLMs have the potential to support incorporating human perspectives more widely in the advancement of sustainable healthcare systems.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    本文在流行的瑞典讨论论坛Flashback上探讨了多个线程中的流行主题。在各种各样的主题中,该论坛积极参与用户解决和辩论与COVID-19疫苗和疫苗接种有关的问题。通过在14个相关主题讨论的帖子中区分积极和消极的观点,我们雇佣了BERTopic,一个模块化的主题建模框架,它利用预先训练的语言模型并应用聚类技术来识别当前主题。这使我们能够对总体主题进行细致入微的探索,为瑞典关于COVID-19疫苗和疫苗接种的讨论的多面性提供有价值的见解。
    This paper explores the prevalent themes across multiple threads on the popular Swedish discussion forum Flashback. Among its diverse array of topics, the forum actively engages users in addressing and debating questions pertaining to COVID-19 vaccines and vaccination. Through distinguishing between positive and negative perspectives within posts across 14 relevant thread discussions, we employ BERTopic, a modular topic modeling framework, which utilizes pre-trained language models and applies clustering techniques to identify prevailing topics. This enables us to conduct a nuanced exploration of overarching themes, offering valuable insights into the multifaceted nature of the discussions regarding COVID-19 vaccines and vaccination in Sweden.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    投票建议应用程序(VAA)在潜在选民中的受欢迎程度证明了它们的相关性和重要性。平均而言,大约30%的选民在选举期间考虑了这些申请的建议。潜在选民和政党立场之间的比较是根据要求用户发表意见的VAA政策声明进行的。VAA设计人员投入大量时间和精力来分析国内和国际政治,以制定政策声明并选择要包含在应用程序中的声明。此过程涉及手动读取和评估大量公开可用的数据,主要是党的宣言。工作的一个问题是有限的时间框架。这项研究提出了一个系统,以协助VAA设计师制定,修改,并选择政策声明。使用预训练的语言模型和机器学习方法处理与政治相关的文本数据,该系统产生对应于相关VAA语句的一组建议。实验是使用日本的政党宣言和YouTube评论进行的,结合六个日本VAA和两个欧洲VAA的VAA政策声明。系统中使用的技术方法基于BERT语言模型,它以其捕获文档中单词上下文的能力而闻名。尽管系统的输出并没有完全消除人工评估的需要,它为更新VAA政策声明提供了宝贵的建议,即,无偏置,基础。
    The relevance and importance of voting advice applications (VAAs) are demonstrated by their popularity among potential voters. On average, around 30% of voters take into account the recommendations of these applications during elections. The comparison between potential voters\' and parties\' positions is made on the basis of VAA policy statements on which users are asked to express opinions. VAA designers devote substantial time and effort to analyzing domestic and international politics to formulate policy statements and select those to be included in the application. This procedure involves manually reading and evaluating a large volume of publicly available data, primarily party manifestos. A problematic part of the work is the limited time frame. This study proposes a system to assist VAA designers in formulating, revising, and selecting policy statements. Using pre-trained language models and machine learning methods to process politics-related textual data, the system produces a set of suggestions corresponding to relevant VAA statements. Experiments were conducted using party manifestos and YouTube comments from Japan, combined with VAA policy statements from six Japanese and two European VAAs. The technical approaches used in the system are based on the BERT language model, which is known for its capability to capture the context of words in the documents. Although the output of the system does not completely eliminate the need for manual human assessment, it provides valuable suggestions for updating VAA policy statements on an objective, i.e., bias-free, basis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:本研究调查了TikTok平台内关于焦虑管理策略的论述,分析了一年来收集的45,639条评论的大量数据集。
    目的:主要目的是探索用户用来管理焦虑的各种策略,重点关注如何在TikTok上讨论和分享这些策略。
    方法:采用混合方法,将潜在狄利克雷分配(LDA)主题建模与定性分析相结合。这种方法允许确定九个不同的主题,进一步分为三大类:自我授权和应对策略,社区支持和社会连通性,以及识别和导航触发器。
    结果:分析揭示了用户用来管理焦虑的各种策略,跨越个人应对机制,社会支持网络,以及触发器的识别和缓解。这些发现强调了TikTok作为一个动态共享空间的角色,探索,并验证与焦虑管理相关的经验。
    结论:TikTok在焦虑的挑战中提供了独特的身份建构和社区支持机会。然而,这项研究承认局限性,例如基于关键词的数据收集中的潜在偏见以及在平台上捕获多模态话语的复杂性。结论强调需要进一步完善数字心理健康平台,呼吁复杂的算法解决方案,以增强用户支持和内容相关性。
    Background: This study investigates the discourse on anxiety management strategies within the TikTok platform, analyzing a substantial dataset of 45,639 comments collected over a year.
    Aims: The primary aim is to explore the various strategies users employ to manage anxiety, focusing on how these strategies are discussed and shared on TikTok.
    Methods: A mixed-method approach was utilized, combining Latent Dirichlet Allocation (LDA) for topic modeling with qualitative analysis. This methodology allowed for the identification of nine distinct topics, which were further grouped into three main categories: Self-Empowerment and Coping Strategies, Community Support and Social Connectivity, and Recognizing and Navigating Triggers.
    Results: The analysis revealed a diverse range of strategies users employ to manage anxiety, spanning personal coping mechanisms, social support networks, and the recognition and mitigation of triggers. These findings underscore TikTok\'s role as a dynamic space for sharing, exploring, and validating experiences related to anxiety management.
    Conclusions: TikTok offers unique opportunities for identity construction and community support amidst the challenges of anxiety. However, the study acknowledges limitations, such as potential biases in keyword-based data collection and the complexity of capturing multimodal discourse on the platform. The conclusion emphasizes the need for further refinement of digital mental health platforms, calling for sophisticated algorithmic solutions to enhance user support and content relevance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    本研究调查了医疗器械行业监管事务的复杂性,影响市场准入和患者护理的关键因素。
    通过定性研究,我们寻求专家见解,以了解导致这种复杂性的因素。该研究涉及对来自医疗器械公司的28名专业人士的半结构化访谈,专门从事监管事务的各个方面。这些访谈是使用定性编码和自然语言处理(NLP)技术进行分析的。
    这些发现揭示了监管环境中复杂性的关键来源,分为五个领域:(1)监管语言复杂性,(2)监管过程中的复杂性,(3)全球层面的复杂性,(4)数据库相关的考虑,(5)产品层面的问题。
    与会者强调了简化监管合规战略的必要性,加强监管机构和行业参与者之间的互动,并为快速的技术进步开发适应性框架。强调跨学科合作和提高透明度,该研究得出结论,这些要素对于在医疗器械行业建立连贯和有效的监管程序至关重要。
    UNASSIGNED: This study investigates the complexity of regulatory affairs in the medical device industry, a critical factor influencing market access and patient care.
    UNASSIGNED: Through qualitative research, we sought expert insights to understand the factors contributing to this complexity. The study involved semi-structured interviews with 28 professionals from medical device companies, specializing in various aspects of regulatory affairs. These interviews were analyzed using a mix of qualitative coding and natural language processing (NLP) techniques.
    UNASSIGNED: The findings reveal key sources of complexity within the regulatory landscape, divided into five domains: (1) regulatory language complexity, (2) intricacies within the regulatory process, (3) global-level complexities, (4) database-related considerations, and (5) product-level issues.
    UNASSIGNED: The participants highlighted the need for strategies to streamline regulatory compliance, enhance interactions between regulatory bodies and industry players, and develop adaptable frameworks for rapid technological advancements. Emphasizing interdisciplinary collaboration and increased transparency, the study concludes that these elements are vital for establishing coherent and effective regulatory procedures in the medical device sector.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:心力衰竭(HF)是一项重大的全球临床和公共卫生挑战,影响全球6430万人。为了解决捐赠器官的稀缺问题,左心室辅助装置(LVAD)植入已成为治疗终末期HF的关键干预措施,作为心脏移植的桥梁或作为目的地治疗。基于网络的健康论坛,如MyLVAD.com,作为HF症状患者及其护理人员的可靠信息来源,起着至关重要的作用。
    目的:我们的目标是发现用户在MyLVAD.com网站上分享的帖子中潜在的主题。
    方法:使用潜在的Dirichlet分配算法和可视化工具,我们的目标是在MyLVAD.com网站上分享的帖子中发现潜在的主题。通过应用主题建模技术,我们分析了2015年至2023年LVAD接受者及其家庭成员撰写的459篇帖子.
    结果:这项研究揭示了LVAD患者及其家人关注的5个突出主题。这些主题包括家庭支持(39.5%的体重值),涵盖子主题,如家庭护理角色和情感或实际支持;服装(23.9%重量值),与舒适相关的子主题,正常状态,和功能;感染(18.2%体重值),涵盖传动系统感染,预防,和护理;功率(12%重量值),涉及与权力依赖相关的挑战;和自我护理维护,监测,和管理(6.3%重量值),其中包括血液测试等子主题,监测,警报,和设备管理。
    结论:这些发现有助于更好地了解植入LVAD患者的经历和需求,为医疗保健专业人员提供有价值的见解,以提供量身定制的支持和护理。通过使用潜在的Dirichlet分配来分析来自MyLVAD.com论坛的帖子,这项研究揭示了用户讨论的关键主题,促进改善患者护理和加强患者与提供者的沟通。
    BACKGROUND: Heart failure (HF) is a significant global clinical and public health challenge, impacting 64.3 million individuals worldwide. To address the scarcity of donor organs, left ventricular assist device (LVAD) implantation has become a crucial intervention for managing end-stage HF, serving as a bridge to heart transplantation or as a destination therapy. Web-based health forums, such as MyLVAD.com, play a vital role as trusted sources of information for individuals with HF symptoms and their caregivers.
    OBJECTIVE: We aim to uncover the latent topics within the posts shared by users on the MyLVAD.com website.
    METHODS: Using the latent Dirichlet allocation algorithm and a visualization tool, our objective was to uncover latent topics within the posts shared on the MyLVAD.com website. Through the application of topic modeling techniques, we analyzed 459 posts authored by recipients of LVAD and their family members from 2015 to 2023.
    RESULTS: This study unveiled 5 prominent themes of concern among patients with LVAD and their family members. These themes included family support (39.5% weight value), encompassing subthemes such as family caregiving roles and emotional or practical support; clothing (23.9% weight value), with subthemes related to comfort, normalcy, and functionality; infection (18.2% weight value), covering driveline infections, prevention, and care; power (12% weight value), involving challenges associated with power dependency; and self-care maintenance, monitoring, and management (6.3% weight value), which included subthemes such as blood tests, monitoring, alarms, and device management.
    CONCLUSIONS: These findings contribute to a better understanding of the experiences and needs of patients implanted with LVAD, providing valuable insights for health care professionals to offer tailored support and care. By using latent Dirichlet allocation to analyze posts from the MyLVAD.com forum, this study sheds light on key topics discussed by users, facilitating improved patient care and enhanced patient-provider communication.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    随着在线市场的快速增长,人们在购买产品时更多地依赖产品评论。因此,许多公司和研究人员对分析产品评论感兴趣,产品评论本质上是一种文本数据。在目前的文献中,通常只使用文本分析工具来分析文本数据集。但在我们的工作中,我们提出了一种方法,利用文本分析方法,如主题建模和统计网络模型,在个体之间建立网络,发现有趣的社区。我们引入了一个有前途的框架,该框架结合了主题建模技术来定义个体之间的边缘并形成网络,并使用随机块模型(SBM)来查找社区。我们提出的方法的强大功能已在Amazon产品评论数据集的实际应用中得到证明。
    As the online market grows rapidly, people are relying more on product review when they purchase the product. Hence, many companies and researchers are interested in analyzing product review which essentially a text data. In the current literature, it is common to use only text analysis tools to analyze text dataset. But in our work, we propose a method that utilizes both text analysis method such as topic modeling and statistical network model to build network among individuals and find interesting communities. We introduce a promising framework that incorporates topic modeling technique to define the edges among the individuals and form a network and uses stochastic blockmodels (SBM) to find the communities. The power of our proposed method is demonstrated in real-world application to Amazon product review dataset.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号