Surprising and novel multivariate sequential patterns using odds ratio for temporal evolution in healthcare

Isidoro J. Casanova, Manuel Campos, Jose M. Juarez, Antonio Gomariz, Bernardo Canovas-Segura, Marta Lorente-Ros, Jose A. Lorente

Research output: Contribution to journalArticlepeer-review

Abstract

Background: Pattern mining techniques are helpful tools when extracting new knowledge in real practice, but the overwhelming number of patterns is still a limiting factor in the health-care domain. Current efforts concerning the definition of measures of interest for patterns are focused on reducing the number of patterns and quantifying their relevance (utility/usefulness). However, although the temporal dimension plays a key role in medical records, few efforts have been made to extract temporal knowledge about the patient’s evolution from multivariate sequential patterns. Methods: In this paper, we propose a method to extract a new type of patterns in the clinical domain called Jumping Diagnostic Odds Ratio Sequential Patterns (JDORSP). The aim of this method is to employ the odds ratio to identify a concise set of sequential patterns that represent a patient’s state with a statistically significant protection factor (i.e., a pattern associated with patients that survive) and those extensions whose evolution suddenly changes the patient’s clinical state, thus making the sequential patterns a statistically significant risk factor (i.e., a pattern associated with patients that do not survive), or vice versa. Results: The results of our experiments highlight that our method reduces the number of sequential patterns obtained with state-of-the-art pattern reduction methods by over 95%. Only by achieving this drastic reduction can medical experts carry out a comprehensive clinical evaluation of the patterns that might be considered medical knowledge regarding the temporal evolution of the patients. We have evaluated the surprisingness and relevance of the sequential patterns with clinicians, and the most interesting fact is the high surprisingness of the extensions of the patterns that become a protection factor, that is, the patients that recover after several days of being at high risk of dying. Conclusions: Our proposed method with which to extract JDORSP generates a set of interpretable multivariate sequential patterns with new knowledge regarding the temporal evolution of the patients. The number of patterns is greatly reduced when compared to those generated by other methods and measures of interest. An additional advantage of this method is that it does not require any parameters or thresholds, and that the reduced number of patterns allows a manual evaluation.

Original languageEnglish
Article number165
JournalBMC Medical Informatics and Decision Making
Volume24
Issue number1
DOIs
StatePublished - Dec 2024
Externally publishedYes

Keywords

  • Burn units
  • Data mining
  • Discriminative patterns
  • Interestingness measures
  • Knowledge discovery in databases
  • Odds ratio
  • Sequential patterns

Fingerprint

Dive into the research topics of 'Surprising and novel multivariate sequential patterns using odds ratio for temporal evolution in healthcare'. Together they form a unique fingerprint.

Cite this