IPD meta-analysis

Fitting one-stage IPD meta-analysis model

We demonstrate how to run one-stage IPD meta-analysis using this package. First, let’s generate sample IPD for illustration.

#devtools::install_github("MikeJSeo/bipd")
library(bipd) 

##load in data
ds <- generate_ipdma_example(type = "continuous")
ds2 <- generate_ipdma_example(type = "binary")
head(ds)
#>   studyid treat         z1         z2  y
#> 1       1     0  0.3184625 -0.1616561 11
#> 2       1     1 -1.1532349  0.9836978 10
#> 3       1     1  1.3808782  0.5557304  8
#> 4       1     1  0.6179567  0.5831149  9
#> 5       1     0 -1.7102353 -0.7682330 10
#> 6       1     1 -1.0360358 -0.9662960  8

The main function to set up the function for one-stage IPD meta-analysis is ipdma.model.onestage function. Refer to help(ipdma.model.onestage) for more details. Briefly to describe, “y” is the outcome of the study; “study” is a vector indicating which study the patient belongs to in a numerical sequence (i.e. 1, 2, 3, etc); “treat” is a vector indicating which treatment the patient was assigned to (i.e. 1 for treatment, 0 for placebo); “x” is a matrix of covariates for each patients; “response” is the outcome type, either “normal” or “binomial”.

Another important parameter is the “shrinkage” parameter. To specify IPD meta-analysis without shrinkage, we set shrinkage = “none”.

# continuous outcome
ipd <- with(ds, ipdma.model.onestage(y = y, study = studyid, treat = treat, X = cbind(z1, z2), response = "normal", shrinkage = "none"))

To view the JAGS code that was used to run the model, we can run the following command. Note that “alpha” is the study intercept, “beta” is the coefficient for main effects of the covariates, “gamma” is the coefficient for effect modifier, and “delta” is the average treatment effect.

cat(ipd$code)
#> model {
#> 
#> ########## IPD-MA model
#> for (i in 1:Np) {
#>  y[i] ~ dnorm(mu[i], sigma)
#>  mu[i] <- alpha[studyid[i]] + inprod(beta[], X[i,]) +
#>      (1 - equals(treat[i],1)) * inprod(gamma[], X[i,]) + d[studyid[i],treat[i]]
#> }
#> sigma ~ dgamma(0.001, 0.001)
#> 
#> #####treatment effect
#> for(j in 1:Nstudies){
#>  d[j,1] <- 0
#>  d[j,2] ~ dnorm(delta[2], tau)
#> }
#> sd ~ dnorm(0, 1)T(0,)
#> tau <- pow(sd, -2)
#> 
#> ## prior distribution for the average treatment effect
#> delta[1] <- 0
#> delta[2] ~ dnorm(0, 0.001)
#> 
#> ## prior distribution for the study intercept
#> for (j in 1:Nstudies){
#>  alpha[j] ~ dnorm(0, 0.001)
#> }
#> 
#> ## prior distribution for the main effect of the covariates
#> for(k in 1:Ncovariate){
#>  beta[k] ~ dnorm(0, 0.001)
#> }
#> ## prior distribution for the effect modifiers under no shrinkage
#> for(k in 1:Ncovariate){
#>  gamma[k] ~ dnorm(0, 0.001) 
#> }
#> }

Once the model is set up using ipdma.model.onestage function, we use ipd.run function to run the model. help(ipd.run) describes possible parameters to specify.

samples <- ipd.run(ipd, n.chains = 3, n.burnin = 500, n.iter = 5000)
#> Compiling model graph
#>    Resolving undeclared variables
#>    Allocating nodes
#> Graph information:
#>    Observed stochastic nodes: 600
#>    Unobserved stochastic nodes: 19
#>    Total graph size: 6034
#> 
#> Initializing model

samples <- samples[,-3] #remove delta[1] which is 0
summary(samples)
#> 
#> Iterations = 1501:6500
#> Thinning interval = 1 
#> Number of chains = 3 
#> Sample size per chain = 5000 
#> 
#> 1. Empirical mean and standard deviation for each variable,
#>    plus standard error of the mean:
#> 
#>             Mean      SD  Naive SE Time-series SE
#> alpha[1] 10.9537 0.05753 0.0004698      0.0008377
#> alpha[2]  8.0067 0.05818 0.0004750      0.0008475
#> alpha[4]  9.6032 0.05603 0.0004575      0.0007846
#> alpha[5] 12.9351 0.05992 0.0004892      0.0009440
#> alpha[6] 15.7736 0.05343 0.0004362      0.0006800
#> beta[1]   0.2142 0.02409 0.0001967      0.0003627
#> beta[2]   0.3195 0.02334 0.0001906      0.0003314
#> delta[1]  0.0000 0.00000 0.0000000      0.0000000
#> delta[2] -2.4927 0.19728 0.0016108      0.0017744
#> gamma[1] -0.5028 0.03304 0.0002698      0.0005001
#> gamma[2]  0.5978 0.03300 0.0002695      0.0004702
#> 
#> 2. Quantiles for each variable:
#> 
#>             2.5%     25%     50%     75%   97.5%
#> alpha[1] 10.8421 10.9143 10.9536 10.9924 11.0665
#> alpha[2]  7.8916  7.9675  8.0064  8.0460  8.1199
#> alpha[4]  9.4950  9.5654  9.6024  9.6416  9.7130
#> alpha[5] 12.8195 12.8939 12.9350 12.9758 13.0542
#> alpha[6] 15.6680 15.7371 15.7738 15.8097 15.8769
#> beta[1]   0.1674  0.1979  0.2140  0.2304  0.2612
#> beta[2]   0.2731  0.3035  0.3196  0.3353  0.3652
#> delta[1]  0.0000  0.0000  0.0000  0.0000  0.0000
#> delta[2] -2.8789 -2.6024 -2.4935 -2.3848 -2.0926
#> gamma[1] -0.5684 -0.5254 -0.5027 -0.4804 -0.4373
#> gamma[2]  0.5331  0.5757  0.5977  0.6199  0.6627
#plot(samples) #traceplot and posterior of parameters
#coda::gelman.plot(samples) #gelman diagnostic plot

We can find patient-specific treatment effect using the treatment.effect function. To do this we need to specify the covariate values for the patient that we want to predict patient-specific treatment effect.

treatment.effect(ipd, samples, newpatient = c(1,0.5))
#>     0.025       0.5     0.975 
#> -3.125380 -2.731708 -2.328367

Incorporating shrinkage and variable selection

For the second example, let’s use the same data, but include shrinkage (i.e. Bayesian LASSO) in the effect modifiers (i.e. treatment-covariate interactions). We can specify Bayesian LASSO by setting shrinkage = “laplace”. Lambda is the shrinkage parameter and we can set the prior for lambda using lambda.prior parameter. The default lambda prior for Bayesian LASSO is λ⁻¹ ∼ dunif(0, 5).

ipd <- with(ds, ipdma.model.onestage(y = y, study = studyid, treat = treat, X = cbind(z1, z2), response = "normal", shrinkage = "laplace"))
samples <- ipd.run(ipd, pars.save = c("beta", "gamma", "delta", "lambda", "tt"), n.chains = 3, n.burnin = 500, n.iter = 5000)
#> Compiling model graph
#>    Resolving undeclared variables
#>    Allocating nodes
#> Graph information:
#>    Observed stochastic nodes: 600
#>    Unobserved stochastic nodes: 20
#>    Total graph size: 6039
#> 
#> Initializing model
summary(samples)
#> 
#> Iterations = 1501:6500
#> Thinning interval = 1 
#> Number of chains = 3 
#> Sample size per chain = 5000 
#> 
#> 1. Empirical mean and standard deviation for each variable,
#>    plus standard error of the mean:
#> 
#>             Mean      SD  Naive SE Time-series SE
#> beta[1]   0.2126 0.02458 0.0002007      0.0004191
#> beta[2]   0.3214 0.02329 0.0001902      0.0003668
#> delta[1]  0.0000 0.00000 0.0000000      0.0000000
#> delta[2] -2.4949 0.18841 0.0015383      0.0016602
#> gamma[1] -0.5004 0.03339 0.0002726      0.0006060
#> gamma[2]  0.5942 0.03308 0.0002701      0.0005555
#> lambda    0.3477 0.14606 0.0011926      0.0015335
#> tt        2.1763 0.91192 0.0074458      0.0094975
#> 
#> 2. Quantiles for each variable:
#> 
#>             2.5%     25%     50%     75%   97.5%
#> beta[1]   0.1635  0.1961  0.2125  0.2291  0.2605
#> beta[2]   0.2762  0.3055  0.3213  0.3371  0.3675
#> delta[1]  0.0000  0.0000  0.0000  0.0000  0.0000
#> delta[2] -2.8752 -2.6013 -2.4952 -2.3875 -2.1171
#> gamma[1] -0.5649 -0.5231 -0.5003 -0.4778 -0.4343
#> gamma[2]  0.5299  0.5717  0.5942  0.6167  0.6594
#> lambda    0.2040  0.2421  0.3029  0.4072  0.7325
#> tt        1.2457  1.5219  1.8999  2.5477  4.6009

We can also use SSVS (stochastic search variable selection) by setting shrinkage = “SSVS”. This time let’s use the binomial dataset. “Ind” is the indicator for assigning a slab prior (instead of a spike prior) i.e. indicator for including a covariate. “eta” is the standard deviation of the slab prior.

ipd <- with(ds2, ipdma.model.onestage(y = y, study = studyid, treat = treat, X = cbind(w1, w2), response = "binomial", shrinkage = "SSVS"))
samples <- ipd.run(ipd, pars.save = c("beta", "gamma", "delta", "Ind", "eta"), n.chains = 3, n.burnin = 500, n.iter = 5000)
#> Compiling model graph
#>    Resolving undeclared variables
#>    Allocating nodes
#> Graph information:
#>    Observed stochastic nodes: 600
#>    Unobserved stochastic nodes: 21
#>    Total graph size: 6649
#> 
#> Initializing model
summary(samples)
#> 
#> Iterations = 1501:6500
#> Thinning interval = 1 
#> Number of chains = 3 
#> Sample size per chain = 5000 
#> 
#> 1. Empirical mean and standard deviation for each variable,
#>    plus standard error of the mean:
#> 
#>               Mean      SD  Naive SE Time-series SE
#> Ind[1]    0.191467 0.39347 0.0032127       0.009454
#> Ind[2]    0.172733 0.37803 0.0030866       0.009903
#> beta[1]   0.029325 0.11695 0.0009549       0.001806
#> beta[2]   0.059431 0.11477 0.0009371       0.001574
#> delta[1]  0.000000 0.00000 0.0000000       0.000000
#> delta[2]  0.164182 0.27141 0.0022161       0.008372
#> eta       1.902760 1.47873 0.0120738       0.046271
#> gamma[1] -0.012050 0.10031 0.0008190       0.002010
#> gamma[2] -0.008618 0.09577 0.0007820       0.001811
#> 
#> 2. Quantiles for each variable:
#> 
#>              2.5%      25%       50%     75%  97.5%
#> Ind[1]    0.00000  0.00000  0.000000 0.00000 1.0000
#> Ind[2]    0.00000  0.00000  0.000000 0.00000 1.0000
#> beta[1]  -0.19659 -0.04907  0.027275 0.10455 0.2630
#> beta[2]  -0.16315 -0.01699  0.056502 0.13505 0.2870
#> delta[1]  0.00000  0.00000  0.000000 0.00000 0.0000
#> delta[2] -0.36621 -0.01386  0.165207 0.33513 0.7041
#> eta       0.03412  0.53564  1.637826 3.09967 4.7866
#> gamma[1] -0.25155 -0.04739 -0.001440 0.03001 0.1859
#> gamma[2] -0.21967 -0.04321 -0.001131 0.02990 0.1848
treatment.effect(ipd, samples, newpatient = c(1,0.5)) # binary outcome reports odds ratio
#>     0.025       0.5     0.975 
#> 0.6544364 1.1625977 2.0456160

Fitting one-stage IPD network meta-analysis

We now demonstrate how to run IPD network meta-analysis using this package.

##load in data
ds <- generate_ipdnma_example(type = "continuous")
ds2 <- generate_ipdnma_example(type = "binary")
head(ds)
#>   studyid treat         z1           z2  y
#> 1       1     1  2.3629286 -0.016977060 12
#> 2       1     1 -0.6077092 -0.109212517 11
#> 3       1     1  1.1667807  1.032182162 11
#> 4       1     1  0.6557452  0.249371112 11
#> 5       1     1  1.3332085 -0.335346796 11
#> 6       1     2 -0.3454660  0.004405287  8

The main function to set up the function for one-stage IPD network meta-analysis is ipdnma.model.onestage function. The function is very similar to ipdma.model.onestage except that now we have number of treatments greater than 2. Consequently, “treat” parameter is defined differently i.e. 1 assigns baseline treatment and other treatments should be be assigned 2, 3, 4, etc.

# continuous outcome
ipd <- with(ds, ipdnma.model.onestage(y = y, study = studyid, treat = treat, X = cbind(z1, z2), response = "normal", shrinkage = "none"))
cat(ipd$code)
#> model {
#> 
#> ########## IPD-NMA model
#> for (i in 1:Np) {
#>  y[i] ~ dnorm(mu[i], sigma)
#>  mu[i] <- alpha[studyid[i]] + inprod(beta[], X[i,]) +
#>      inprod(gamma[treat[i],], X[i,]) + d[studyid[i],treatment.arm[i]]
#> }
#> sigma ~ dgamma(0.001, 0.001)
#> 
#> #####treatment effect
#> for(i in 1:Nstudies){
#>  w[i,1] <- 0
#>  d[i,1] <- 0
#>  for(k in 2:na[i]){
#>      d[i,k] ~ dnorm(mdelta[i,k], taudelta[i,k])
#>      mdelta[i,k] <-  delta[t[i,k]] - delta[t[i,1]] + sw[i,k]
#>      taudelta[i,k] <- tau * 2 * (k-1)/k
#>      w[i,k] <- d[i,k] - delta[t[i,k]] + delta[t[i,1]]
#>      sw[i,k] <- sum(w[i, 1:(k-1)]) / (k-1)
#>  }
#> }
#> sd ~ dnorm(0, 1)T(0,)
#> tau <- pow(sd, -2)
#> 
#> ## prior distribution for the average treatment effect
#> delta[1] <- 0
#> for(k in 2:Ntreat){
#>  delta[k] ~ dnorm(0, 0.001)
#> }
#> 
#> ## prior distribution for the study intercept
#> for (j in 1:Nstudies){
#>  alpha[j] ~ dnorm(0, 0.001)
#> }
#> 
#> ## prior distribution for the main effect of the covariates
#> for(k in 1:Ncovariate){
#>  beta[k] ~ dnorm(0, 0.001)
#> }
#> ## prior distribution for the effect modifiers under no shrinkage
#> for(k in 1:Ncovariate){
#>  gamma[1,k] <- 0
#>  for(m in 2:Ntreat){
#>      gamma[m,k] ~ dnorm(0, 0.001) 
#>  }
#> }
#> }
samples <- ipd.run(ipd,  pars.save = c("beta", "gamma", "delta"), n.chains = 3, n.burnin = 500, n.iter = 5000)
#> Compiling model graph
#>    Resolving undeclared variables
#>    Allocating nodes
#> Graph information:
#>    Observed stochastic nodes: 1000
#>    Unobserved stochastic nodes: 33
#>    Total graph size: 10133
#> 
#> Initializing model
summary(samples)
#> 
#> Iterations = 1501:6500
#> Thinning interval = 1 
#> Number of chains = 3 
#> Sample size per chain = 5000 
#> 
#> 1. Empirical mean and standard deviation for each variable,
#>    plus standard error of the mean:
#> 
#>               Mean      SD  Naive SE Time-series SE
#> beta[1]     0.2186 0.01923 0.0001570      0.0003296
#> beta[2]     0.2982 0.02061 0.0001682      0.0003865
#> delta[1]    0.0000 0.00000 0.0000000      0.0000000
#> delta[2]   -2.9177 0.06032 0.0004926      0.0009149
#> delta[3]   -1.1348 0.06395 0.0005222      0.0009893
#> gamma[1,1]  0.0000 0.00000 0.0000000      0.0000000
#> gamma[2,1] -0.5872 0.02777 0.0002268      0.0004334
#> gamma[3,1] -0.3054 0.02954 0.0002412      0.0004433
#> gamma[1,2]  0.0000 0.00000 0.0000000      0.0000000
#> gamma[2,2]  0.5779 0.02891 0.0002361      0.0004927
#> gamma[3,2]  0.4087 0.02928 0.0002390      0.0004714
#> 
#> 2. Quantiles for each variable:
#> 
#>               2.5%     25%     50%     75%   97.5%
#> beta[1]     0.1807  0.2058  0.2189  0.2316  0.2566
#> beta[2]     0.2574  0.2844  0.2984  0.3123  0.3381
#> delta[1]    0.0000  0.0000  0.0000  0.0000  0.0000
#> delta[2]   -3.0387 -2.9553 -2.9179 -2.8797 -2.7968
#> delta[3]   -1.2603 -1.1742 -1.1355 -1.0950 -1.0063
#> gamma[1,1]  0.0000  0.0000  0.0000  0.0000  0.0000
#> gamma[2,1] -0.6413 -0.6060 -0.5872 -0.5684 -0.5333
#> gamma[3,1] -0.3638 -0.3252 -0.3056 -0.2852 -0.2485
#> gamma[1,2]  0.0000  0.0000  0.0000  0.0000  0.0000
#> gamma[2,2]  0.5213  0.5583  0.5777  0.5975  0.6351
#> gamma[3,2]  0.3521  0.3887  0.4084  0.4281  0.4666
treatment.effect(ipd, samples, newpatient = c(1,0.5))
#> $`treatment 2`
#>     0.025       0.5     0.975 
#> -3.367613 -3.235158 -3.099755 
#> 
#> $`treatment 3`
#>     0.025       0.5     0.975 
#> -1.389420 -1.251320 -1.109692

We can apply shrinkage on the effect modifiers (treatment-covariate interactions) as before.

# SSVS
ipd <- with(ds, ipdnma.model.onestage(y = y, study = studyid, treat = treat, X = cbind(z1, z2), response = "normal", shrinkage = "SSVS"))
samples <- ipd.run(ipd,  pars.save = c("beta", "gamma", "delta", "Ind", "eta"), n.chains = 3, n.burnin = 500, n.iter = 5000)
#> Compiling model graph
#>    Resolving undeclared variables
#>    Allocating nodes
#> Graph information:
#>    Observed stochastic nodes: 1000
#>    Unobserved stochastic nodes: 38
#>    Total graph size: 10155
#> 
#> Initializing model
summary(samples)
#> 
#> Iterations = 1501:6500
#> Thinning interval = 1 
#> Number of chains = 3 
#> Sample size per chain = 5000 
#> 
#> 1. Empirical mean and standard deviation for each variable,
#>    plus standard error of the mean:
#> 
#>               Mean      SD  Naive SE Time-series SE
#> Ind[2,1]    0.9995 0.02160 0.0001763      0.0002421
#> Ind[3,1]    0.9733 0.16111 0.0013155      0.0077224
#> Ind[2,2]    0.9995 0.02160 0.0001763      0.0002185
#> Ind[3,2]    0.9897 0.10081 0.0008231      0.0030240
#> beta[1]     0.2177 0.01933 0.0001578      0.0003317
#> beta[2]     0.3003 0.02045 0.0001670      0.0003830
#> delta[1]    0.0000 0.00000 0.0000000      0.0000000
#> delta[2]   -2.9163 0.05901 0.0004818      0.0008865
#> delta[3]   -1.1341 0.06289 0.0005135      0.0009388
#> eta         0.8680 0.76515 0.0062474      0.0382971
#> gamma[1,1]  0.0000 0.00000 0.0000000      0.0000000
#> gamma[2,1] -0.5861 0.02775 0.0002266      0.0004309
#> gamma[3,1] -0.3035 0.02969 0.0002424      0.0004498
#> gamma[1,2]  0.0000 0.00000 0.0000000      0.0000000
#> gamma[2,2]  0.5747 0.02869 0.0002343      0.0004873
#> gamma[3,2]  0.4059 0.02930 0.0002392      0.0004893
#> 
#> 2. Quantiles for each variable:
#> 
#>               2.5%     25%     50%     75%   97.5%
#> Ind[2,1]    1.0000  1.0000  1.0000  1.0000  1.0000
#> Ind[3,1]    0.0000  1.0000  1.0000  1.0000  1.0000
#> Ind[2,2]    1.0000  1.0000  1.0000  1.0000  1.0000
#> Ind[3,2]    1.0000  1.0000  1.0000  1.0000  1.0000
#> beta[1]     0.1795  0.2048  0.2178  0.2306  0.2562
#> beta[2]     0.2604  0.2863  0.3005  0.3141  0.3404
#> delta[1]    0.0000  0.0000  0.0000  0.0000  0.0000
#> delta[2]   -3.0335 -2.9534 -2.9166 -2.8783 -2.8004
#> delta[3]   -1.2568 -1.1736 -1.1348 -1.0944 -1.0076
#> eta         0.3163  0.4823  0.6389  0.9161  3.8998
#> gamma[1,1]  0.0000  0.0000  0.0000  0.0000  0.0000
#> gamma[2,1] -0.6406 -0.6047 -0.5860 -0.5673 -0.5323
#> gamma[3,1] -0.3621 -0.3238 -0.3033 -0.2835 -0.2454
#> gamma[1,2]  0.0000  0.0000  0.0000  0.0000  0.0000
#> gamma[2,2]  0.5186  0.5552  0.5747  0.5943  0.6308
#> gamma[3,2]  0.3483  0.3862  0.4060  0.4257  0.4630
treatment.effect(ipd, samples, newpatient = c(1,0.5))
#> $`treatment 2`
#>     0.025       0.5     0.975 
#> -3.364734 -3.233957 -3.100706 
#> 
#> $`treatment 3`
#>     0.025       0.5     0.975 
#> -1.386936 -1.249602 -1.108464
# Bayesian LASSO  
# ipd <- with(ds, ipdnma.model.onestage(y = y, study = studyid, treat = treat, X = cbind(z1, z2), response = "normal", shrinkage = "laplace", lambda.prior = list("dgamma",2,0.1)))
#samples <- ipd.run(ipd, pars.save = c("beta", "gamma", "delta", "lambda", "tt"), n.chains = 3, n.burnin = 500, n.iter = 5000)

p.s. Note that in the network meta-analysis literature, “d” usually refers to average treatment effect and “delta” refers to study-specific treatment effect. In this R package, we have flipped around the two notations.