Utilizing Open-Source Large Language Models to Extract Genitourinary Symptoms from Clinical Notes

  • Yunbing Bai
  • , Wanting Cui
  • , Joseph Finkelstein

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Accurately identifying patient signs and symptoms from clinical notes is essential for effective diagnosis, treatment planning, and medical research. In this study, we evaluated the performance of the Meta Llama model in extracting signs and symptoms related to the genitourinary system, along with their corresponding ICD-10 codes, from urological clinical notes in the MTSamples dataset. The dataset was manually annotated to compare the extraction results of large language models (LLMs) output. We utilized Llama 3.3-70B and performed prompt engineering. The findings suggest that the best performance was achieved when the prompt included a predefined list of definitions of corresponding ICD-10 codes and restricted the model from making assumptions. Under these conditions, Llama 3.3-70B achieved an average recall of 0.96, precision of 0.89, and F1-score of 0.92 for S&S extraction, as well as an average recall of 0.93, precision of 0.85, and F1-score of 0.89 for ICD-10 code generation.

Original languageEnglish
Title of host publicationGlobal Healthcare Transformation in the Era of Artificial Intelligence and Informatics
EditorsJohn Mantas, Arie Hasman, Parisis Gallos, Emmanouil Zoulias, Konstantinos Karitis
PublisherIOS Press BV
Pages16-20
Number of pages5
ISBN (Electronic)9781643686004
DOIs
StatePublished - 26 Jun 2025
Externally publishedYes
Event23rd Annual International Conference on Informatics, Management, and Technology in Healthcare, ICIMTH 2025 - Athens, Greece
Duration: 4 Jul 20256 Jul 2025

Publication series

NameStudies in Health Technology and Informatics
Volume328
ISSN (Print)0926-9630
ISSN (Electronic)1879-8365

Conference

Conference23rd Annual International Conference on Informatics, Management, and Technology in Healthcare, ICIMTH 2025
Country/TerritoryGreece
CityAthens
Period4/07/256/07/25

Keywords

  • Large Language Models
  • Llama Models
  • Natural Language Processing
  • Symptom Extraction

Fingerprint

Dive into the research topics of 'Utilizing Open-Source Large Language Models to Extract Genitourinary Symptoms from Clinical Notes'. Together they form a unique fingerprint.

Cite this