gmvjoint
gmvjoint
?gmvjoint
allows the user to fit joint models of survival
and multivariate longitudinal data, where the longitudinal sub-models
are specified by generalised linear mixed models (GLMMs). The joint
models are fit via maximum likelihood using an approximate EM algorithm
first proposed by Bernhardt et al. (2015). The GLMMs are
specified using the same syntax as for package glmmTMB
(Brooks et al., 2017). The joint models themselves are then the
flexible extensions to those in e.g. Wulfsohn and Tsiatis (1997). The
user is able to simulate data under many different response types.
Currently, five families can be fit: Gaussian; Poisson; binomial; Gamma and generalised Poisson.
The package in current incantation is relatively skeletal, as such not a lot of post-hoc analyses on fitted joint models is possible. As such, an immediate to-do list currently looks like
No promises are made w.r.t timescale of these being implemented: Currently I am a PhD student and little cache is awarded for production or maintenance of R packages!
To fit a joint model, we first need to specify the longitudinal and survival sub-models.
The longitudinal sub-model must be a list which
contains the specification of the longitudinal process along with its
random effects structure in the same syntax as a glmmTMB model
(which itself is the same as the widely-used lme4
). As an
example, suppose we want to fit a trivariate model on the oft-used PBC
data, with a linear time-drug interaction term on albumin, a spline term
on (logged) serum bilirubin and a linear fit on spiders, we specify
$serBilir <- log(PBC$serBilir)
PBC<- list(
long.formulas ~ drug * time + (1 + time|id),
albumin ~ drug * splines::ns(time, 3) + (1 + splines::ns(time, 3)|id),
serBilir ~ drug * time + (1|id)
spiders )
where we note interactions and spline-time fits are possible.
Currently, transformations on variables (e.g. log(Y)
) must
be done before this setup of formulae.
The survival sub-model must be set-up using Surv()
from
the survival
package e.g.
<- Surv(survtime, status) ~ drug surv.formula
Currently interaction terms in the survival sub-model specification are unsupported.
Now we can do the joint model call through the main workhorse
function joint
. This notably take a list of family
arguments which must match-up in the desired order as
the longitudinal process list. We call our fit
via
<- joint(long.formulas = long.formulas, surv.formula = surv.formula, data = PBC,
fit family = list("gaussian", "gaussian", "binomial"))
where extra control arguments are documented in ?joint
.
Numerous S3 methods exist for the class of object joint
creates, for example summary()
, logLik()
, and
fixef()
.
Bernhardt PW, Zhang D and Wang HJ. A fast EM Algorithm for Fitting Joint Models of a Binary Response to Multiple Longitudinal Covariates Subject to Detection Limits. Computational Statistics and Data Analysis 2015; 85; 37–53
Mollie E. Brooks, Kasper Kristensen, Koen J. van Benthem, Arni Magnusson, Casper W. Berg, Anders Nielsen, Hans J. Skaug, Martin Maechler and Benjamin M. Bolker (2017). glmmTMB Balances Speed and Flexibility Among Packages for Zero-inflated Generalized Linear Mixed Modeling. The R Journal, 9(2), 378-400.
Murray, J and Philipson P. A fast approximate EM algorithm for joint models of survival and multivariate longitudinal data. Computational Statistics and Data Analysis 2022
Wulfsohn MS, Tsiatis AA. A joint model for survival and longitudinal data measured with error. Biometrics. 1997; 53(1), 330-339.