{Reference Type}: Journal Article
{Title}: Identifying symptom etiologies using syntactic patterns and large language models.
{Author}: Taub-Tabib H;Shamay Y;Shlain M;Pinhasov M;Polak M;Tiktinsky A;Rahamimov S;Bareket D;Eyal B;Kassis M;Goldberg Y;Kaminski Rosenberg T;Vulfsons S;Ben Sasson M;
{Journal}: Sci Rep
{Volume}: 14
{Issue}: 1
{Year}: 2024 07 13
{Factor}: 4.996
{DOI}: 10.1038/s41598-024-65645-6
{Abstract}: Differential diagnosis is a crucial aspect of medical practice, as it guides clinicians to accurate diagnoses and effective treatment plans. Traditional resources, such as medical books and services like UpToDate, are constrained by manual curation, potentially missing out on novel or less common findings. This paper introduces and analyzes two novel methods to mine etiologies from scientific literature. The first method employs a traditional Natural Language Processing (NLP) approach based on syntactic patterns. By using a novel application of human-guided pattern bootstrapping patterns are derived quickly, and symptom etiologies are extracted with significant coverage. The second method utilizes generative models, specifically GPT-4, coupled with a fact verification pipeline, marking a pioneering application of generative techniques in etiology extraction. Analyzing this second method shows that while it is highly precise, it offers lesser coverage compared to the syntactic approach. Importantly, combining both methodologies yields synergistic outcomes, enhancing the depth and reliability of etiology mining.