%0 Journal Article %T Natural language processing pipeline to extract prostate cancer-related information from clinical notes. %A Nakai H %A Suman G %A Adamo DA %A Navin PJ %A Bookwalter CA %A LeGout JD %A Chen FK %A Wellnitz CV %A Silva AC %A Thomas JV %A Kawashima A %A Fan JW %A Froemming AT %A Lomas DJ %A Humphreys MR %A Dora C %A Korfiatis P %A Takahashi N %J Eur Radiol %V 0 %N 0 %D 2024 Jun 6 %M 38842692 %F 7.034 %R 10.1007/s00330-024-10812-6 %X OBJECTIVE: To develop an automated pipeline for extracting prostate cancer-related information from clinical notes.
METHODS: This retrospective study included 23,225 patients who underwent prostate MRI between 2017 and 2022. Cancer risk factors (family history of cancer and digital rectal exam findings), pre-MRI prostate pathology, and treatment history of prostate cancer were extracted from free-text clinical notes in English as binary or multi-class classification tasks. Any sentence containing pre-defined keywords was extracted from clinical notes within one year before the MRI. After manually creating sentence-level datasets with ground truth, Bidirectional Encoder Representations from Transformers (BERT)-based sentence-level models were fine-tuned using the extracted sentence as input and the category as output. The patient-level output was determined by compilation of multiple sentence-level outputs using tree-based models. Sentence-level classification performance was evaluated using the area under the receiver operating characteristic curve (AUC) on 15% of the sentence-level dataset (sentence-level test set). The patient-level classification performance was evaluated on the patient-level test set created by radiologists by reviewing the clinical notes of 603 patients. Accuracy and sensitivity were compared between the pipeline and radiologists.
RESULTS: Sentence-level AUCs were ≥ 0.94. The pipeline showed higher patient-level sensitivity for extracting cancer risk factors (e.g., family history of prostate cancer, 96.5% vs. 77.9%, p < 0.001), but lower accuracy in classifying pre-MRI prostate pathology (92.5% vs. 95.9%, p = 0.002) and treatment history of prostate cancer (95.5% vs. 97.7%, p = 0.03) than radiologists, respectively.
CONCLUSIONS: The proposed pipeline showed promising performance, especially for extracting cancer risk factors from patient's clinical notes.
CONCLUSIONS: The natural language processing pipeline showed a higher sensitivity for extracting prostate cancer risk factors than radiologists and may help efficiently gather relevant text information when interpreting prostate MRI.
CONCLUSIONS: When interpreting prostate MRI, it is necessary to extract prostate cancer-related information from clinical notes. This pipeline extracted the presence of prostate cancer risk factors with higher sensitivity than radiologists. Natural language processing may help radiologists efficiently gather relevant prostate cancer-related text information.