TY - JOUR
T1 - Jointly modelling single nucleotide polymorphisms with longitudinal and time-to-event trait
T2 - An application to type 2 diabetes and fasting plasma glucose
AU - Canouil, Mickaël
AU - Balkau, Beverley
AU - Roussel, Ronan
AU - Froguel, Philippe
AU - Rocheleau, Ghislain
N1 - Publisher Copyright:
© 2018 Canouil, Balkau, Roussel, Froguel and Rocheleau.
PY - 2018/6/14
Y1 - 2018/6/14
N2 - In observational cohorts, longitudinal data are collected with repeated measurements at predetermined time points for many biomarkers, along with other variables measured at baseline. In these cohorts, time until a certain event of interest occurs is reported and very often, a relationship will be observed between some biomarker repeatedly measured over time and that event. Joint models were designed to efficiently estimate statistical parameters describing this relationship by combining a mixed model for the longitudinal biomarker trajectory and a survival model for the time until occurrence of the event, using a set of random effects to account for the relationship between the two types of data. In this paper, we discuss the implementation of joint models in genetic association studies. First, we check model consistency based on different simulation scenarios, by varying sample sizes, minor allele frequencies and number of repeated measurements. Second, using genotypes assayed with the Metabochip DNA arrays (Illumina) from about 4,500 individuals recruited in the French cohort D.E.S.I.R. (Data from an Epidemiological Study on the Insulin Resistance syndrome), we assess the feasibility of implementing the joint modelling approach in a real high-throughput genomic dataset. An alternative model approximating the joint model, called the Two-Step approach (TS), is also presented. Although the joint model shows more precise and less biased estimators than its alternative counterpart, the TS approach results in much reduced computational times, and could thus be used for testing millions of SNPs at the genome-wide scale.
AB - In observational cohorts, longitudinal data are collected with repeated measurements at predetermined time points for many biomarkers, along with other variables measured at baseline. In these cohorts, time until a certain event of interest occurs is reported and very often, a relationship will be observed between some biomarker repeatedly measured over time and that event. Joint models were designed to efficiently estimate statistical parameters describing this relationship by combining a mixed model for the longitudinal biomarker trajectory and a survival model for the time until occurrence of the event, using a set of random effects to account for the relationship between the two types of data. In this paper, we discuss the implementation of joint models in genetic association studies. First, we check model consistency based on different simulation scenarios, by varying sample sizes, minor allele frequencies and number of repeated measurements. Second, using genotypes assayed with the Metabochip DNA arrays (Illumina) from about 4,500 individuals recruited in the French cohort D.E.S.I.R. (Data from an Epidemiological Study on the Insulin Resistance syndrome), we assess the feasibility of implementing the joint modelling approach in a real high-throughput genomic dataset. An alternative model approximating the joint model, called the Two-Step approach (TS), is also presented. Although the joint model shows more precise and less biased estimators than its alternative counterpart, the TS approach results in much reduced computational times, and could thus be used for testing millions of SNPs at the genome-wide scale.
KW - Diabetes
KW - Genetic
KW - Glycaemia
KW - Joint modelling
KW - Longitudinal biomarker
KW - Survival analysis
UR - http://www.scopus.com/inward/record.url?scp=85048636385&partnerID=8YFLogxK
U2 - 10.3389/fgene.2018.00210
DO - 10.3389/fgene.2018.00210
M3 - Article
AN - SCOPUS:85048636385
SN - 1664-8021
VL - 9
JO - Frontiers in Genetics
JF - Frontiers in Genetics
IS - JUN
M1 - 210
ER -