Selection Bias from Data Processing in N3C

N3C Consortium

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This study investigates potential selection bias in outcome prediction within the National COVID Cohort Collaborative (N3C) resulting from arbitrarily made decisions. In the processing of health data, decisions regarding cohort criteria and variable selection are often arbitrarily made, potentially introducing selection bias. This work explores if such decisions affect results of data analysis and potential conclusions of research studies. An experiment is conducted in which four arbitrary decisions are made. Results demonstrate significant differences in the obtained datasets and indicate a high potential for bias based on inclusion or exclusion decisions. The findings contribute to informed healthcare policies, better decision-making, and improved patient outcomes, emphasizing the necessity for testing assumptions and decisions in ongoing research that uses clinical data.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE 12th International Conference on Healthcare Informatics, ICHI 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages234-241
Number of pages8
ISBN (Electronic)9798350383737
DOIs
StatePublished - 2024
Event12th IEEE International Conference on Healthcare Informatics, ICHI 2024 - Orlando, United States
Duration: 3 Jun 20246 Jun 2024

Publication series

NameProceedings - 2024 IEEE 12th International Conference on Healthcare Informatics, ICHI 2024

Conference

Conference12th IEEE International Conference on Healthcare Informatics, ICHI 2024
Country/TerritoryUnited States
CityOrlando
Period3/06/246/06/24

Keywords

  • Data Processing
  • National COVID Cohort Collaborative (N3C)
  • Prediction
  • Selection Bias

Cite this