Package 'NCC'

Title: Simulation and Analysis of Platform Trials with Non-Concurrent Controls
Description: Design and analysis of flexible platform trials with non-concurrent controls. Functions for data generation, analysis, visualization and running simulation studies are provided. The implemented analysis methods are described in: Bofill Roig et al. (2022) <doi:10.1186/s12874-022-01683-w>, Saville et al. (2022) <doi:10.1177/17407745221112013> and Schmidli et al. (2014) <doi:10.1111/biom.12242>.
Authors: Pavla Krotka [aut, cre] , Marta Bofill Roig [aut, ths] , Katharina Hees [aut], Peter Jacko [aut], Dominic Magirr [aut], Martin Posch [ctb]
Maintainer: Pavla Krotka <[email protected]>
License: MIT + file LICENSE
Version: 1.0
Built: 2025-01-02 06:52:27 UTC
Source: CRAN

Help Index


Simulate binary data from a platform trial with a shared control arm and a given number of experimental treatment arms entering at given time points

Description

This function simulates data from a platform trial with a given number of experimental treatment arms entering at given time points and a shared control arm. The primary endpoint is a binary endpoint. The user specifies the timing of adding arms in terms of patients recruited to the trial so far and the sample size per experimental treatment arm.

Usage

datasim_bin(
  num_arms,
  n_arm,
  d,
  period_blocks = 2,
  p0,
  OR,
  lambda,
  trend,
  N_peak,
  n_wave,
  full = FALSE,
  check = TRUE
)

Arguments

num_arms

Integer. Number of experimental treatment arms in the trial.

n_arm

Integer. Sample size per experimental treatment arm (assumed equal).

d

Integer vector with timings of adding new arms in terms of number of patients recruited to the trial so far. The first entry must be 0, so that the trial starts with at least one experimental treatment arm, and the entries must be non-decreasing. The vector length equals num_arms.

period_blocks

Integer. Number to define the size of the blocks for the block randomization. The block size in each period equals period_blockstimes the number of active arms in the period (see Details). Default=2.

p0

Double. Response probability in the control arm.

OR

Double vector with treatment effects in terms of odds ratios for each experimental treatment arm compared to control. The elements of the vector (odds ratios) are ordered by the entry time of the experimental treatment arms (e.g., the first entry in the vector corresponds to the odds ratio of the first experimental treatment arm). The vector length equals num_arms.

lambda

Double vector with strength of time trend in each arm ordered by the entry time of the arms (e.g., the first entry in the vector corresponds to the time trend in the control arm, second entry to the time trend in the first experimental treatment arm). The vector length equals num_arms+1, as time trend in the control is also allowed.

trend

String indicating the time trend pattern ("linear", "linear_2, "stepwise", "stepwise_2", "inv_u" or "seasonal"). See Details for more information.

N_peak

Integer. Timepoint at which the inverted-u time trend switches direction in terms of overall sample size (i.e. after how many recruited participants the trend direction switches).

n_wave

Integer. Number of cycles (waves) should the seasonal trend have.

full

Logical. Indicates whether the output should be in form of a data frame with variables needed for the analysis only (FALSE) or in form of a list containing more information (TRUE). Default=FALSE.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

Details

Design assumptions:

  • The simulated platform trial consists of a given number of experimental treatment arms (specified by the argument num_arms) and one control arm that is shared across the whole platform.

  • Participants are indexed by entry order, assuming that at each time unit exactly one participant is recruited and the time of recruitment and observation of the response are equal.

  • All participants are assumed to be eligible for all arms in the trial, i.e. the same inclusion and exclusion criteria apply to all experimental and control arms.

  • Equal sample sizes (given by parameter n_arm) in all experimental treatment arms are assumed.

  • The duration of the trial is divided into so-called periods, defined as time intervals bounded by distinct time points of any treatment arm entering or leaving the platform. Hence, multiple treatment arms entering or leaving at the same time point imply the start of only one additional period.

  • Allocation ratio of 1:1:...:1 in each period. Furthermore, block randomization is used to assign patients to the active arms. Block size in each period = period_blocks* (number of active arms in the period).

  • If the period sample size is not a multiple of the block size, arms for the remaining participants are chosen by sampling without replacement from a vector containing the indices of active arms replicated times ⁠ceiling(remaining sample size/number of active arms)⁠.

Data generation:

The binary response yjy_j for patient jj is generated according to:

g(E(yj))=η0+k=1KI(kj=k)+f(j)g(E(y_j)) = \eta_0 + \sum_{k=1}^K \cdot I(k_j=k) + f(j)

where g()g(\cdot) is the logit link function, and η0\eta_0 (logit function of parameter p0) and θk\theta_k (log of the parameter OR) are the log odds in the control arm and the log odds ratio of treatment kk. KK is the total number of treatment arms in the trial (parameter num_arms) and kjk_j is an indicator of the treatment arm patient jj is allocated to.

The function f(j)f(j) denotes the time trend, whose strength is indicated by λkj\lambda_{k_j} (parameter lambda) and which can have the following patterns (parameter trend):

  • "linear" - trend starts at the beginning of the trial and the log odds increases or decreases linearly with a slope of λ\lambda, according to the function f(j)=λj1N1f(j) = \lambda \cdot \frac{j-1}{N-1}, where NN is the total sample size in the trial

  • "linear_2" - trend starts after the first period (i.e. there is no time trend in the first period) and the log odds increases or decreases linearly with a slope of λ\lambda, according to the function f(j)=λj1N1f(j) = \lambda \cdot \frac{j-1}{N-1}, where NN is the total sample size in the trial

  • "stepwise" - the log odds is constant in each period and increases or decreases by λ\lambda each time any treatment arm enters or leaves the trial (i.e. in each period), according to the function f(j)=λkj(cj1)f(j) = \lambda_{k_j} \cdot (c_j - 1), where cjc_j is an index of the period patient jj was enrolled in

  • "stepwise_2" - the log odds is constant in each period and increases or decreases by λ\lambda each time a new treatment arm is added to the trial, according to the function f(j)=λkj(wj1)f(j) = \lambda_{k_j} \cdot (w_j - 1), where wjw_j is an indicator of how many treatment arms have already entered the ongoing trial, when patient jj was enrolled

  • "inv_u" - the log odds increases up to the point NpN_p (parameter N_peak) and decreases afterwards, linearly with a slope of λ\lambda, according to the function f(j)=λj1N1(I(jNp)I(j>Np))f(j) = \lambda \cdot \frac{j-1}{N-1} (I(j \leq N_p) - I(j > N_p)), where NpN_p indicates the point at which the trend turns from positive to negative in terms of the sample size (note that for negative λ\lambda, the log odds ratio decreases first and increases after)

  • "seasonal" - the log odds increases and decreases periodically with a magnitude of λ\lambda, according to the function f(j)=λsin(ψ2πj1N1)f(j) = \lambda \cdot \mathrm{sin} \big( \psi \cdot 2\pi \cdot \frac{j-1}{N-1} \big), where ψ\psi indicates how many cycles should the time trend have (parameter n_wave)

Trials with no time trend can be simulated too, by setting all elements of the vector lambda to zero and choosing an arbitrary pattern.

Value

Data frame: simulated trial data (if full=FALSE, i.e. default) with the following columns:

  • j - patient recruitment index

  • response - binary response for patient j

  • treatment- index of the treatment patient j was allocated to

  • period - index of the period patient j was recruited in

or List (if full=TRUE) containing the following elements:

  • Data - simulated trial data, including an additional column p with the probability used for simulating the response for patient j

  • n_total - total sample size in the trial

  • n_arm - sample size per arm (assumed equal)

  • num_arms - number of experimental treatment arms in the trial

  • d - timings of adding new arms

  • SS_matrix - matrix with the sample sizes per arm and per period

  • period_blocks - number to multiply the number of active arms with, in order to get the block size per period

  • p0 - response probability in the control arm

  • OR - odds ratios for each experimental treatment arm

  • lambda - strength of time trend in each arm

  • time_dep_effect - time dependent treatment effects for each experimental treatment arm (for computing the bias)

  • trend - time trend pattern

Author(s)

Pavla Krotka, Marta Bofill Roig

Examples

head(datasim_bin(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
p0 = 0.7, OR = rep(1.8, 3), lambda = rep(0.15, 4), trend="stepwise"))

Simulate continuous data from a platform trial with a shared control arm and a given number of experimental treatment arms entering at given time points

Description

This function simulates data from a platform trial with a given number of experimental treatment arms entering at given time points and a shared control arm. The primary endpoint is a continuous endpoint. The user specifies the timing of adding arms in terms of patients recruited to the trial so far and the sample size per arm.

Usage

datasim_cont(
  num_arms,
  n_arm,
  d,
  period_blocks = 2,
  mu0 = 0,
  theta,
  lambda,
  sigma,
  trend,
  N_peak,
  n_wave,
  full = FALSE,
  check = TRUE
)

Arguments

num_arms

Integer. Number of experimental treatment arms in the trial.

n_arm

Integer. Sample size per experimental treatment arm (assumed equal).

d

Integer vector with timings of adding new arms in terms of number of patients recruited to the trial so far. The first entry must be 0, so that the trial starts with at least one experimental treatment arm, and the entries must be non-decreasing. The vector length equals num_arms.

period_blocks

Integer. Number to define the size of the blocks for the block randomization. The block size in each period equals period_blockstimes the number of active arms in the period (see Details). Default=2.

mu0

Double. Response in the control arm. Default=0.

theta

Double vector with treatment effects in terms of difference of means for each experimental treatment arm compared to control. The elements of the vector (treatment effects) are ordered by the entry time of the experimental treatment arms (e.g., the first entry in the vector corresponds to the treatment effect of the first experimental treatment arm). The vector length equals num_arms.

lambda

Double vector with strength of time trend in each arm ordered by the entry time of the arms (e.g., the first entry in the vector corresponds to the time trend in the control arm, second entry to the time trend in the first experimental treatment arm). The vector length equals num_arms+1, as time trend in the control is also allowed.

sigma

Double. Standard deviation of the responses.

trend

String indicating the time trend pattern ("linear", "linear_2, "stepwise", "stepwise_2", "inv_u" or "seasonal"). See Details for more information.

N_peak

Integer. Timepoint at which the inverted-u time trend switches direction in terms of overall sample size (i.e. after how many recruited participants the trend direction switches).

n_wave

Integer. Number of cycles (waves) should the seasonal trend have.

full

Logical. Indicates whether the output should be in form of a data frame with variables needed for the analysis only (FALSE) or in form of a list containing more information (TRUE). Default=FALSE.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

Details

Design assumptions:

  • The simulated platform trial consists of a given number of experimental treatment arms (specified by the argument num_arms) and one control arm that is shared across the whole platform.

  • Participants are indexed by entry order, assuming that at each time unit exactly one participant is recruited and the time of recruitment and observation of the response are equal.

  • All participants are assumed to be eligible for all arms in the trial, i.e. the same inclusion and exclusion criteria apply to all experimental and control arms.

  • Equal sample sizes (given by parameter n_arm) in all experimental treatment arms are assumed.

  • The duration of a platform trial is divided into so-called periods, defined as time intervals bounded by distinct time points of any treatment arm entering or leaving the platform. Hence, multiple treatment arms entering or leaving at the same time point imply the start of only one additional period.

  • Allocation ratio of 1:1:...:1 in each period. Furthermore, block randomization is used to assign patients to the active arms. Block size in each period = period_blocks* (number of active arms in the period).

  • If the period sample size is not a multiple of the block size, arms for the remaining participants are chosen by sampling without replacement from a vector containing the indices of active arms replicated times ⁠ceiling(remaining sample size/number of active arms)⁠.

Data generation:

The continuous response yjy_j for patient jj is generated according to:

E(yj)=η0+k=1KI(kj=k)+f(j)E(y_j) = \eta_0 + \sum_{k=1}^K \cdot I(k_j=k) + f(j)

where η0\eta_0 (parameter mu0) and θk\theta_k (parameter theta) are the response in the control arm and the effect of treatment kk. KK is the total number of treatment arms in the trial (parameter num_arms) and kjk_j is an indicator of the treatment arm patient jj is allocated to.

The function f(j)f(j) denotes the time trend, whose strength is indicated by λkj\lambda_{k_j} (parameter lambda) and which can have the following patterns (parameter trend):

  • "linear" - trend starts at the beginning of the trial and the mean response increases or decreases linearly with a slope of λ\lambda, according to the function f(j)=λj1N1f(j) = \lambda \cdot \frac{j-1}{N-1}, where NN is the total sample size in the trial

  • "linear_2" - trend starts after the first period (i.e. there is no time trend in the first period) and the mean response increases or decreases linearly with a slope of λ\lambda, according to the function f(j)=λj1N1f(j) = \lambda \cdot \frac{j-1}{N-1}, where NN is the total sample size in the trial

  • "stepwise" - the mean response is constant in each period and increases or decreases by λ\lambda each time any treatment arm enters or leaves the trial (i.e. in each period), according to the function f(j)=λkj(cj1)f(j) = \lambda_{k_j} \cdot (c_j - 1), where cjc_j is an index of the period patient jj was enrolled in

  • "stepwise_2" - the mean response is constant in each period and increases or decreases by λ\lambda each time a new treatment arm is added to the trial, according to the function f(j)=λkj(wj1)f(j) = \lambda_{k_j} \cdot (w_j - 1), where wjw_j is an indicator of how many treatment arms have already entered the ongoing trial, when patient jj was enrolled

  • "inv_u" - the mean response increases up to the point NpN_p (parameter N_peak) and decreases afterwards, linearly with a slope of λ\lambda, according to the function f(j)=λj1N1(I(jNp)I(j>Np))f(j) = \lambda \cdot \frac{j-1}{N-1} (I(j \leq N_p) - I(j > N_p)), where NpN_p indicates the point at which the trend turns from positive to negative in terms of the sample size (note that for negative λ\lambda, the mean response decreases first and increases after)

  • "seasonal" - the mean response increases and decreases periodically with a magnitude of λ\lambda, according to the function f(j)=λsin(ψ2πj1N1)f(j) = \lambda \cdot \mathrm{sin} \big( \psi \cdot 2\pi \cdot \frac{j-1}{N-1} \big), where ψ\psi indicates how many cycles should the time trend have (parameter n_wave)

Trials with no time trend can be simulated too, by setting all elements of the vector lambda to zero and choosing an arbitrary pattern.

Value

Data frame: simulated trial data (if full=FALSE, i.e. default) with the following columns:

  • j - patient recruitment index

  • response - continuous response for patient j

  • treatment- index of the treatment patient j was allocated to

  • period - index of the period patient j was recruited in

or List (if full=TRUE) containing the following elements:

  • Data - simulated trial data, including an additional column means with the theoretical means used for the simulation of the response for patient j

  • n_total - total sample size in the trial

  • n_arm - sample size per arm (assumed equal)

  • num_arms - number of experimental treatment arms in the trial

  • d - timings of adding new arms

  • SS_matrix - matrix with the sample sizes per arm and per period

  • period_blocks - number to multiply the number of active arms with, in order to get the block size per period

  • mu0 - response in the control arm

  • theta - treatment effects for each experimental treatment arm

  • lambda - strength of time trend in each arm

  • time_dep_effect - time dependent treatment effects for each experimental treatment arm (for computing the bias)

  • sigma - standard deviation of the responses

  • trend - time trend pattern

Author(s)

Pavla Krotka, Marta Bofill Roig

Examples

head(datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear"))

Frequentist logistic regression model analysis for binary data adjusting for periods

Description

This function performs logistic regression taking into account all trial data until the arm under study leaves the trial and adjusting for periods as factors.

Usage

fixmodel_bin(data, arm, alpha = 0.025, ncc = TRUE, check = TRUE, ...)

Arguments

data

Data frame with trial data, e.g. result from the datasim_bin() function. Must contain columns named 'treatment', 'response' and 'period'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

ncc

Logical. Indicates whether to include non-concurrent data into the analysis. Default=TRUE.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Details

The model-based analysis adjusts for the time effect by including the factor period (defined as a time interval bounded by any treatment arm entering or leaving the platform). The time is then modelled as a step-function with jumps at the beginning of each period. Denoting by yjy_j the response probability for patient jj, by kjk_j the arm patient jj was allocated to, and by MM the treatment arm under evaluation, the regression model is given by:

g(E(yj))=η0+kKMθkI(kj=k)+s=2SMτsI(tjTSs)g(E(y_j)) = \eta_0 + \sum_{k \in \mathcal{K}_M} \theta_k \cdot I(k_j=k) + \sum_{s=2}^{S_M} \tau_s \cdot I(t_j \in T_{S_s})

where g()g(\cdot) denotes the logit link function and η0\eta_0 is the log odds in the control arm in the first period; θk\theta_k represents the log odds ratio of treatment kk and control for kKMk\in\mathcal{K}_M, where KM\mathcal{K}_M is the set of treatments that were active in the trial during periods prior or up to the time when the investigated treatment arm left the trial; τs\tau_s indicates the stepwise period effect in terms of the log odds ratio between periods 1 and ss (s=2,,SMs = 2, \ldots, S_M), where SMS_M denotes the period, in which the investigated treatment arm left the trial.

If the data consists of only one period (e.g. in case of a multi-arm trial), the period in not used as covariate.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the log-odds ratio

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

References

On model-based time trend adjustments in platform trials with non-concurrent controls. Bofill Roig, M., Krotka, P., et al. BMC Medical Research Methodology 22.1 (2022): 1-16.

Examples

trial_data <- datasim_bin(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
p0 = 0.7, OR = rep(1.8, 3), lambda = rep(0.15, 4), trend="stepwise")

fixmodel_bin(data = trial_data, arm = 3)

Frequentist logistic regression model analysis for binary data adjusting for calendar time units

Description

This function performs logistic regression taking into account all trial data until the arm under study leaves the trial and adjusting for calendar time units as factors.

Usage

fixmodel_cal_bin(
  data,
  arm,
  alpha = 0.025,
  unit_size = 25,
  ncc = TRUE,
  check = TRUE,
  ...
)

Arguments

data

Data frame with trial data, e.g. result from the datasim_bin() function. Must contain columns named 'treatment' and 'response'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

unit_size

Integer. Number of patients per calendar time unit. Default=25.

ncc

Logical. Indicates whether to include non-concurrent data into the analysis. Default=TRUE.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Details

The model-based analysis adjusts for the time effect by including the factor calendar time unit (defined as time units of fixed length, defined by ùnit_size). The time is then modelled as a step-function with jumps at the beginning of each calendar time unit. Denoting by yjy_j the response probability for patient jj, by kjk_j the arm patient jj was allocated to, and by MM the treatment arm under evaluation, the regression model is given by:

g(E(yj))=η0+kKMθkI(kj=k)+c=2CMτcI(tjTCc)g(E(y_j)) = \eta_0 + \sum_{k \in \mathcal{K}_M} \theta_k \cdot I(k_j=k) + \sum_{c=2}^{C_M} \tau_c \cdot I(t_j \in T_{C_c})

where g()g(\cdot) denotes the logit link function and η0\eta_0 is the log odds in the control arm in the first calendar time unit; θk\theta_k represents the log odds ratio of treatment kk and control for kKMk\in\mathcal{K}_M, where KM\mathcal{K}_M is the set of treatments that were active in the trial during calendar time units prior or up to the time when the investigated treatment arm left the trial; τc\tau_c indicates the stepwise calendar time effect in terms of the log odds ratio between calendar time units 1 and cc (c=2,,CMc = 2, \ldots, C_M), where CMC_M denotes the calendar time unit, in which the investigated treatment arm left the trial.

If the data consists of only one calendar time unit, the calendar time unit in not used as covariate.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the log-odds ratio

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_bin(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
p0 = 0.7, OR = rep(1.8, 3), lambda = rep(0.15, 4), trend="stepwise")

fixmodel_cal_bin(data = trial_data, arm = 3)

Frequentist linear regression model analysis for continuous data adjusting for calendar time units

Description

This function performs linear regression taking into account all trial data until the arm under study leaves the trial and adjusting for calendar time units as factors.

Usage

fixmodel_cal_cont(
  data,
  arm,
  alpha = 0.025,
  unit_size = 25,
  ncc = TRUE,
  check = TRUE,
  ...
)

Arguments

data

Data frame with trial data, e.g. result from the datasim_cont() function. Must contain columns named 'treatment' and 'response'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

unit_size

Integer. Number of patients per calendar time unit. Default=25.

ncc

Logical. Indicates whether to include non-concurrent data into the analysis. Default=TRUE.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Details

The model-based analysis adjusts for the time effect by including the factor calendar time unit (defined as time units of fixed length, defined by ùnit_size). The time is then modelled as a step-function with jumps at the beginning of each calendar time unit. Denoting by yjy_j the continuous response for patient jj, by kjk_j the arm patient jj was allocated to, and by MM the treatment arm under evaluation, the regression model is given by:

E(yj)=η0+kKMθkI(kj=k)+c=2CMτcI(tjTCc)E(y_j) = \eta_0 + \sum_{k \in \mathcal{K}_M} \theta_k \cdot I(k_j=k) + \sum_{c=2}^{C_M} \tau_c \cdot I(t_j \in T_{C_c})

where η0\eta_0 is the response in the control arm in the first calendar time unit; θk\theta_k represents the effect of treatment kk compared to control for kKMk\in\mathcal{K}_M, where KM\mathcal{K}_M is the set of treatments that were active in the trial during calendar time units prior or up to the time when the investigated treatment arm left the trial; τc\tau_c indicates the stepwise calendar time effect between calendar time units 1 and cc (c=2,,CMc = 2, \ldots, C_M), where CMC_M denotes the calendar time unit, in which the investigated treatment arm left the trial.

If the data consists of only one calendar time unit, the calendar time unit in not used as covariate.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the difference in means

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear")

fixmodel_cal_cont(data = trial_data, arm = 3)

Frequentist linear regression model analysis for continuous data adjusting for periods

Description

This function performs linear regression taking into account all trial data until the arm under study leaves the trial and adjusting for periods as factors.

Usage

fixmodel_cont(data, arm, alpha = 0.025, ncc = TRUE, check = TRUE, ...)

Arguments

data

Data frame with trial data, e.g. result from the datasim_cont() function. Must contain columns named 'treatment', 'response' and 'period'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

ncc

Logical. Indicates whether to include non-concurrent data into the analysis. Default=TRUE.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Details

The model-based analysis adjusts for the time effect by including the factor period (defined as a time interval bounded by any treatment arm entering or leaving the platform). The time is then modelled as a step-function with jumps at the beginning of each period. Denoting by yjy_j the continuous response for patient jj, by kjk_j the arm patient jj was allocated to, and by MM the treatment arm under evaluation, the regression model is given by:

E(yj)=η0+kKMθkI(kj=k)+s=2SMτsI(tjTSs)E(y_j) = \eta_0 + \sum_{k \in \mathcal{K}_M} \theta_k \cdot I(k_j=k) + \sum_{s=2}^{S_M} \tau_s \cdot I(t_j \in T_{S_s})

where η0\eta_0 is the response in the control arm in the first period; θk\theta_k represents the effect of treatment kk compared to control for kKMk\in\mathcal{K}_M, where KM\mathcal{K}_M is the set of treatments that were active in the trial during periods prior or up to the time when the investigated treatment arm left the trial; τs\tau_s indicates the stepwise period effect between periods 1 and ss (s=2,,SMs = 2, \ldots, S_M), where SMS_M denotes the period, in which the investigated treatment arm left the trial.

If the data consists of only one period (e.g. in case of a multi-arm trial), the period in not used as covariate.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the difference in means

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

References

On model-based time trend adjustments in platform trials with non-concurrent controls. Bofill Roig, M., Krotka, P., et al. BMC Medical Research Methodology 22.1 (2022): 1-16.

Examples

trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear")

fixmodel_cont(data = trial_data, arm = 3)

Generalized additive model analysis for continuous data

Description

This function performs analysis using a generalized additive model taking into account all trial data until the arm under study leaves the trial and smoothing over the patient entry index.

Usage

gam_cont(
  data,
  arm,
  alpha = 0.025,
  ci = FALSE,
  smoothing_basis = "tp",
  basis_dim = -1,
  gam_method = "GCV.Cp",
  check = TRUE,
  ...
)

Arguments

data

Data frame with trial data, e.g. result from the datasim_cont() function. Must contain columns named 'treatment', 'response', 'period' and 'j'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

ci

Logical. Indicates whether confidence intervals should be computed. Default=FALSE.

smoothing_basis

String indicating the (penalized) smoothing basis to use. Default="tp" for thin plate regression spline. Available strings are 'tp', 'ts', 'ds', 'cr', 'cs', 'cc', 'sos', 'ps', 'cp', 're', 'mrf', 'gp', and 'so'. For more information see https://stat.ethz.ch/R-manual/R-devel/library/mgcv/html/smooth.terms.html.

basis_dim

Integer. The dimension of the basis used to represent the smooth term. The default depends on the number of variables that the smooth is a function of. Default=-1. For more information see the description of the parameter 'k' in https://stat.ethz.ch/R-manual/R-devel/library/mgcv/html/s.html.

gam_method

String indicating the smoothing parameter estimation method. Default="GCV.Cp". Available strings are 'GCV.Cp', 'GACV.Cp', 'REML', 'P-REML', 'ML', and 'P-ML'. For more information see the description of the parameter 'method' in https://stat.ethz.ch/R-manual/R-devel/library/mgcv/html/gam.html.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the difference in means

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear")

gam_cont(data = trial_data, arm = 3, ci = TRUE)

Sample size matrix for a platform trial with a given number of treatment arms

Description

This function computes the matrix with sample sizes per arm and period. It is used in the functions datasim_bin() and datasim_cont().

Usage

get_ss_matrix(num_arms, n_arm, d)

Arguments

num_arms

Integer. Number of experimental treatment arms in the trial.

n_arm

Integer. Sample size per experimental treatment arm.

d

Integer vector with timings of adding new arms in terms of number of patients recruited to the trial so far. The first entry must be 0, so that the trial starts with at least one experimental treatment arm, and the entries must be non-decreasing. The vector length equals num_arms.

Value

Sample size matrix, consisting of the sample size per arm and per period, where the arms are represented in the rows (with the control arm in the first row and the experimental arms coming after ordered by entry time) and the periods are represented in the columns.

Author(s)

Pavla Krotka

Examples

get_ss_matrix(num_arms = 3, n_arm = 100, d = c(0, 100, 250))

Generation of an inverted-u trend

Description

This function generates a time trend for given time points in the trial according to an inverted-u function.

Usage

inv_u_trend(j, lambda, N_peak, n_total)

Arguments

j

Time points for which the trend should be generated.

lambda

Strength of time trend.

N_peak

Point at which the time trend switches direction in terms of overall sample size.

n_total

Total sample size in the trial.

Details

The time trend is generated according to the function f(j)=λj1N1(I(jNp)I(j>Np))f(j) = \lambda \cdot \frac{j-1}{N-1} (I(j \leq N_p) - I(j > N_p)), where NN is the total sample size (parameter n_total) and NpN_p (parameter N_peak) indicates the point at which the trend switches direction.

Value

Time trend for time points j.

Author(s)

Marta Bofill Roig, Pavla Krotka


Generation of a linear trend that starts in a given period

Description

This function generates a time trend for given time points in the trial according to a linear function.

Usage

linear_trend(j, lambda, sample_size)

Arguments

j

Time points for which the trend should be generated.

lambda

Strength of time trend.

sample_size

Vector of dimension 2, indicating sample size in the trial period until the time trend starts and the remaining sample size.

Details

The time trend is generated according to the function f(j)=λj1N1f(j) = \lambda \cdot \frac{j-1}{N-1}, where NN is the total sample size.

Value

Time trend for time points j.

Author(s)

Marta Bofill Roig, Pavla Krotka


Analysis for binary data using the MAP Prior approach

Description

This function performs analysis of binary data using the Meta-Analytic-Predictive (MAP) Prior approach. The method borrows data from non-concurrent controls to obtain the prior distribution for the control response in the concurrent periods.

Usage

MAPprior_bin(
  data,
  arm,
  alpha = 0.025,
  opt = 2,
  prior_prec_tau = 4,
  prior_prec_eta = 0.001,
  n_samples = 1000,
  n_chains = 4,
  n_iter = 4000,
  n_adapt = 1000,
  robustify = TRUE,
  weight = 0.1,
  check = TRUE,
  ...
)

Arguments

data

Data frame with trial data, e.g. result from the datasim_bin() function. Must contain columns named 'treatment', 'response' and 'period'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Decision boundary (one-sided). Default=0.025

opt

Integer (1 or 2). If opt==1, all former periods are used as one source; if opt==2, periods get separately included into the final analysis. Default=2.

prior_prec_tau

Double. Precision parameter (1/στ21/\sigma^2_{\tau}) of the half normal hyperprior, the prior for the between study heterogeneity. Default=4.

prior_prec_eta

Double. Precision parameter (1/ση21/\sigma^2_{\eta}) of the normal hyperprior, the prior for the hyperparameter mean of the control log-odds. Default=0.001.

n_samples

Integer. Number of how many random samples will get drawn for the calculation of the posterior mean, the p-value and the CI's. Default=1000.

n_chains

Integer. Number of parallel chains for the rjags model. Default=4.

n_iter

Integer. Number of iterations to monitor of the jags.model. Needed for coda.samples. Default=4000.

n_adapt

Integer. Number of iterations for adaptation, an initial sampling phase during which the samplers adapt their behavior to maximize their efficiency. Needed for jags.model. Default=1000.

robustify

Logical. Indicates whether a robust prior is to be used. If TRUE, a mixture prior is considered combining a MAP prior and a weakly non-informative component prior. Default=TRUE.

weight

Double. Weight given to the non-informative component (0 < weight < 1) for the robustification of the MAP prior according to Schmidli (2014). Default=0.1.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Details

The MAP approach derives the prior distribution for the control response in the concurrent periods by combining the control information from the non-concurrent periods with a non-informative prior.

The model for the binary response yjsy_{js} for the control patient jj in the non-concurrent period ss is defined as follows:

g(E(yjs))=ηsg(E(y_{js})) = \eta_s

where g()g(\cdot) denotes the logit link function and ηs\eta_s represents the control log odds in the non-concurrent period ss.

The log odds for the non-concurrent controls in period ss are assumed to have a normal prior distribution with mean μη\mu_{\eta} and variance τ2\tau^2:

ηsN(μη,τ2)\eta_s \sim \mathcal{N}(\mu_{\eta}, \tau^2)

For the hyperparameters μη\mu_{\eta} and τ\tau, normal and half-normal hyperprior distributions are assumed, with mean 0 and variances ση2\sigma^2_{\eta} and στ2\sigma^2_{\tau}, respectively:

μηN(0,ση2)\mu_{\eta} \sim \mathcal{N}(0, \sigma^2_{\eta})

τHalfNormal(0,στ2)\tau \sim HalfNormal(0, \sigma^2_{\tau})

The MAP prior distribution pMAP(ηCC)p_{MAP}(\eta_{CC}) for the control response in the concurrent periods is then obtained as the posterior distribution of the parameters ηs\eta_s from the above specified model.

If robustify=TRUE, the MAP prior is robustified by adding a weakly-informative mixture component pnoninfp_{\mathrm{non-inf}}, leading to a robustified MAP prior distribution:

prMAP(ηCC)=(1w)pMAP(ηCC)+wpnoninf(ηCC)p_{rMAP}(\eta_{CC}) = (1-w) \cdot p_{MAP}(\eta_{CC}) + w \cdot p_{\mathrm{non-inf}}(\eta_{CC})

where ww (parameter weight) may be interpreted as the degree of skepticism towards borrowing strength.

In this function, the argument alpha corresponds to 1γ1-\gamma, where γ\gamma is the decision boundary. Specifically, the posterior probability of the difference distribution under the null hypothesis is such that: P(ptreatmentpcontrol>0)1P(p_{treatment}-p_{control}>0) \ge 1-alpha. In case of a non-informative prior this coincides with the frequentist type I error.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - posterior probability that the log-odds ratio is less than zero

  • treat_effect - posterior mean of log-odds ratio

  • lower_ci - lower limit of the (1-2*alpha)*100% credible interval for log-odds ratio

  • upper_ci - upper limit of the (1-2*alpha)*100% credible interval for log-odds ratio

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

Author(s)

Katharina Hees

References

Robust meta-analytic-predictive priors in clinical trials with historical control information. Schmidli, H., et al. Biometrics 70.4 (2014): 1023-1032.

Applying Meta-Analytic-Predictive Priors with the R Bayesian Evidence Synthesis Tools. Weber, S., et al. Journal of Statistical Software 100.19 (2021): 1548-7660.

Examples

trial_data <- datasim_bin(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
p0 = 0.7, OR = rep(1.8, 3), lambda = rep(0.15, 4), trend="stepwise")

MAPprior_bin(data = trial_data, arm = 3)

Analysis for continuous data using the MAP Prior approach

Description

This function performs analysis of continuous data using the Meta-Analytic-Predictive (MAP) Prior approach. The method borrows data from non-concurrent controls to obtain the prior distribution for the control response in the concurrent periods.

Usage

MAPprior_cont(
  data,
  arm,
  alpha = 0.025,
  opt = 2,
  prior_prec_tau = 4,
  prior_prec_eta = 0.001,
  n_samples = 1000,
  n_chains = 4,
  n_iter = 4000,
  n_adapt = 1000,
  robustify = TRUE,
  weight = 0.1,
  check = TRUE,
  ...
)

Arguments

data

Data frame with trial data, e.g. result from the datasim_bin() function. Must contain columns named 'treatment', 'response' and 'period'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Decision boundary (one-sided). Default=0.025

opt

Integer (1 or 2). If opt==1, all former periods are used as one source; if opt==2, periods get separately included into the final analysis. Default=2.

prior_prec_tau

Double. Precision parameter (1/στ21/\sigma^2_{\tau}) of the half normal hyperprior, the prior for the between study heterogeneity. Default=4.

prior_prec_eta

Double. Precision parameter (1/ση21/\sigma^2_{\eta}) of the normal hyperprior, the prior for the hyperparameter mean of the control mean. Default=0.001.

n_samples

Integer. Number of how many random samples will get drawn for the calculation of the posterior mean, the p-value and the CI's. Default=1000.

n_chains

Integer. Number of parallel chains for the rjags model. Default=4.

n_iter

Integer. Number of iterations to monitor of the jags.model. Needed for coda.samples. Default=4000.

n_adapt

Integer. Number of iterations for adaptation, an initial sampling phase during which the samplers adapt their behavior to maximize their efficiency. Needed for jags.model. Default=1000.

robustify

Logical. Indicates whether a robust prior is to be used. If TRUE, a mixture prior is considered combining a MAP prior and a weakly non-informative component prior. Default=TRUE.

weight

Double. Weight given to the non-informative component (0 < weight < 1) for the robustification of the MAP prior according to Schmidli (2014). Default=0.1.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Details

The MAP approach derives the prior distribution for the control response in the concurrent periods by combining the control information from the non-concurrent periods with a non-informative prior.

The model for the continuous response yjsy_{js} for the control patient jj in the non-concurrent period ss is defined as follows:

E(yjs)=ηsE(y_{js}) = \eta_s

where ηs\eta_s represents the control mean in the non-concurrent period ss.

The means for the non-concurrent controls in period ss are assumed to have a normal prior distribution with mean μη\mu_{\eta} and variance τ2\tau^2:

ηsN(μη,τ2)\eta_s \sim \mathcal{N}(\mu_{\eta}, \tau^2)

For the hyperparameters μη\mu_{\eta} and τ\tau, normal and half-normal hyperprior distributions are assumed, with mean 0 and variances ση2\sigma^2_{\eta} and στ2\sigma^2_{\tau}, respectively:

μηN(0,ση2)\mu_{\eta} \sim \mathcal{N}(0, \sigma^2_{\eta})

τHalfNormal(0,στ2)\tau \sim HalfNormal(0, \sigma^2_{\tau})

The MAP prior distribution pMAP(ηCC)p_{MAP}(\eta_{CC}) for the control response in the concurrent periods is then obtained as the posterior distribution of the parameters ηs\eta_s from the above specified model.

If robustify=TRUE, the MAP prior is robustified by adding a weakly-informative mixture component pnoninfp_{\mathrm{non-inf}}, leading to a robustified MAP prior distribution:

prMAP(ηCC)=(1w)pMAP(ηCC)+wpnoninf(ηCC)p_{rMAP}(\eta_{CC}) = (1-w) \cdot p_{MAP}(\eta_{CC}) + w \cdot p_{\mathrm{non-inf}}(\eta_{CC})

where ww (parameter weight) may be interpreted as the degree of skepticism towards borrowing strength.

In this function, the argument alpha corresponds to 1γ1-\gamma, where γ\gamma is the decision boundary. Specifically, the posterior probability of the difference distribution under the null hypothesis is such that: P(μtreatmentμcontrol>0)1P(\mu_{treatment}-\mu_{control}>0) \ge 1-alpha. In case of a non-informative prior this coincides with the frequentist type I error.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - posterior probability that the difference in means is less than zero

  • treat_effect - posterior mean of difference in means

  • lower_ci - lower limit of the (1-2*alpha)*100% credible interval for difference in means

  • upper_ci - upper limit of the (1-2*alpha)*100% credible interval for difference in means

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

Author(s)

Katharina Hees

References

Robust meta-analytic-predictive priors in clinical trials with historical control information. Schmidli, H., et al. Biometrics 70.4 (2014): 1023-1032.

Applying Meta-Analytic-Predictive Priors with the R Bayesian Evidence Synthesis Tools. Weber, S., et al. Journal of Statistical Software 100.19 (2021): 1548-7660.

Examples

trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "stepwise")

MAPprior_cont(data = trial_data, arm = 3)

Mixed regression model analysis for continuous data adjusting for calendar time units as a random factor with AR1 correlation structure

Description

This function performs linear mixed model regression taking into account all trial data until the arm under study leaves the trial and adjusting for calendar time units as random factors with AR1 correlation structure.

Usage

mixmodel_AR1_cal_cont(
  data,
  arm,
  alpha = 0.025,
  ci = FALSE,
  unit_size = 25,
  ncc = TRUE,
  check = TRUE,
  ...
)

Arguments

data

Data frame with trial data, e.g. result from the datasim_cont() function. Must contain columns named 'treatment' and 'response'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

ci

Logical. Indicates whether confidence intervals should be computed. Default=FALSE.

unit_size

Integer. Number of patients per calendar time unit. Default=25.

ncc

Logical. Indicates whether to include non-concurrent data into the analysis. Default=TRUE.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the difference in means

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear")

mixmodel_AR1_cal_cont(data = trial_data, arm = 3, ci = TRUE)

Mixed regression model analysis for continuous data adjusting for periods as a random factor with AR1 correlation structure

Description

This function performs linear mixed model regression taking into account all trial data until the arm under study leaves the trial and adjusting for periods as random factors with AR1 correlation structure.

Usage

mixmodel_AR1_cont(
  data,
  arm,
  alpha = 0.025,
  ci = FALSE,
  ncc = TRUE,
  check = TRUE,
  ...
)

Arguments

data

Data frame with trial data, e.g. result from the datasim_cont() function. Must contain columns named 'treatment', 'response' and 'period'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

ci

Logical. Indicates whether confidence intervals should be computed. Default=FALSE.

ncc

Logical. Indicates whether to include non-concurrent data into the analysis. Default=TRUE.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the difference in means

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear")

mixmodel_AR1_cont(data = trial_data, arm = 3, ci = TRUE)

Mixed regression model analysis for continuous data adjusting for calendar time units as a random factor

Description

This function performs linear mixed model regression taking into account all trial data until the arm under study leaves the trial and adjusting for calendar time units as random factors.

Usage

mixmodel_cal_cont(
  data,
  arm,
  alpha = 0.025,
  ci = FALSE,
  unit_size = 25,
  ncc = TRUE,
  check = TRUE,
  ...
)

Arguments

data

Data frame with trial data, e.g. result from the datasim_cont() function. Must contain columns named 'treatment' and 'response'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

ci

Logical. Indicates whether confidence intervals should be computed. Default=FALSE.

unit_size

Integer. Number of patients per calendar time unit. Default=25.

ncc

Logical. Indicates whether to include non-concurrent data into the analysis. Default=TRUE.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the difference in means

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear")

mixmodel_cal_cont(data = trial_data, arm = 3, ci = TRUE)

Mixed regression model analysis for continuous data adjusting for periods as a random factor

Description

This function performs linear mixed model regression taking into account all trial data until the arm under study leaves the trial and adjusting for periods as random factors.

Usage

mixmodel_cont(
  data,
  arm,
  alpha = 0.025,
  ci = FALSE,
  ncc = TRUE,
  check = TRUE,
  ...
)

Arguments

data

Data frame with trial data, e.g. result from the datasim_cont() function. Must contain columns named 'treatment', 'response' and 'period'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

ci

Logical. Indicates whether confidence intervals should be computed. Default=FALSE.

ncc

Logical. Indicates whether to include non-concurrent data into the analysis. Default=TRUE.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the difference in means

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear")

mixmodel_cont(data = trial_data, arm = 3, ci = TRUE)

Model-based analysis for continuous data using discontinuous piecewise polynomials per calendar time unit

Description

This function performs linear regression taking into account all trial data until the arm under study leaves the trial and adjusting for time using discontinuous piecewise polynomials in each calendar time unit.

Usage

piecewise_cal_cont(
  data,
  arm,
  alpha = 0.025,
  unit_size = 25,
  ncc = TRUE,
  poly_degree = 3,
  check = TRUE,
  ...
)

Arguments

data

Data frame with trial data, e.g. result from the datasim_cont() function. Must contain columns named 'treatment', 'response' and 'j'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

unit_size

Integer. Number of patients per calendar time unit. Default=25.

ncc

Logical. Indicates whether to include non-concurrent data into the analysis. Default=TRUE.

poly_degree

Integer. Degree of the piecewise polynomial. Default=3.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the difference in means

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear")

piecewise_cal_cont(data = trial_data, arm = 3)

Model-based analysis for continuous data using discontinuous piecewise polynomials per period

Description

This function performs linear regression taking into account all trial data until the arm under study leaves the trial and adjusting for time using discontinuous piecewise polynomials in each period.

Usage

piecewise_cont(
  data,
  arm,
  alpha = 0.025,
  ncc = TRUE,
  poly_degree = 3,
  check = TRUE,
  ...
)

Arguments

data

Data frame with trial data, e.g. result from the datasim_cont() function. Must contain columns named 'treatment', 'response', 'period' and 'j'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

ncc

Logical. Indicates whether to include non-concurrent data into the analysis. Default=TRUE.

poly_degree

Integer. Degree of the piecewise polynomial. Default=3.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the difference in means

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear")

piecewise_cont(data = trial_data, arm = 3)

Function for visualizing the simulated trial

Description

This function creates a plot visualizing the trial progress over time.

Usage

plot_trial(treatments)

Arguments

treatments

Vector with indices of assigned arms for each participant, ordered by time, e.g. column treatment from the dataframe resulting from the datasim_bin() or datasim_cont() function.

Value

ggplot showing trial progress over time.

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_bin(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
p0 = 0.7, OR = rep(1.8, 3), lambda = rep(0.15, 4), trend="stepwise")

plot_trial(treatments = trial_data$treatment)

Pooled analysis for binary data

Description

This function performs pooled analysis (naively pooling concurrent and non-concurrent controls without adjustment) using a logistic model.

Usage

poolmodel_bin(data, arm, alpha = 0.025, check = TRUE, ...)

Arguments

data

Data frame with trial data, e.g. result from the datasim_bin() function. Must contain columns named 'treatment', 'response' and 'period'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Details

The pooled analysis takes into account only the data from the evaluated experimental treatment arm and the whole control arm and uses a logistic regression model to evaluate the given treatment arm. Denoting by yjy_j the response probability for patient jj, by kjk_j the arm patient jj was allocated to, and by MM the treatment arm under evaluation, the regression model is given by:

g(E(yj))=η0+θMI(kj=M)g(E(y_j)) = \eta_0 + \theta_M \cdot I(k_j=M)

where g()g(\cdot) denotes the logit link function and η0\eta_0 is the log odds in the control arm; θM\theta_M represents the log odds ratio of treatment MM and control.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the log-odds ratio

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_bin(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
p0 = 0.7, OR = rep(1.8, 3), lambda = rep(0.15, 4), trend="stepwise")

poolmodel_bin(data = trial_data, arm = 3)

Pooled analysis for continuous data

Description

This function performs pooled analysis (naively pooling concurrent and non-concurrent controls without adjustment) using a linear model.

Usage

poolmodel_cont(data, arm, alpha = 0.025, check = TRUE, ...)

Arguments

data

Data frame with trial data, e.g. result from the datasim_cont() function. Must contain columns named 'treatment', 'response' and 'period'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Details

The pooled analysis takes into account only the data from the evaluated experimental treatment arm and the whole control arm and uses a linear regression model to evaluate the given treatment arm. Denoting by yjy_j the continuous response for patient jj, by kjk_j the arm patient jj was allocated to, and by MM the treatment arm under evaluation, the regression model is given by:

E(yj)=η0+θMI(kj=M)E(y_j) = \eta_0 + \theta_M \cdot I(k_j=M)

where η0\eta_0 is the response in the control arm; θM\theta_M represents the treatment effect of treatment MM as compared to control.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the difference in means

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear")

poolmodel_cont(data = trial_data, arm = 3)

Generation of a seasonal trend

Description

This function generates a time trend for given time points in the trial according to a periodic function.

Usage

seasonal_trend(j, lambda, n_wave, n_total)

Arguments

j

Time points for which the trend should be generated.

lambda

Strength of time trend.

n_wave

How many cycles (waves) should the time trend have (ψ\psi).

n_total

Total sample size in the trial.

Details

The time trend is generated according to the function f(j)=λsin(ψ2πj1N1)f(j) = \lambda \cdot \mathrm{sin} \big( \psi \cdot 2\pi \cdot \frac{j-1}{N-1} \big), where NN is the total sample size (parameter n_total) and the parameter ψ\psi corresponds to the input parameter n_wave.

Value

Time trend for time points j.

Author(s)

Marta Bofill Roig, Pavla Krotka


Separate analysis for binary data adjusted for periods

Description

This function performs separate analysis (only taking into account concurrent controls) using a logistic model and adjusting for periods, if the treatment arm stays in the trial for more than one period.

Usage

sepmodel_adj_bin(data, arm, alpha = 0.025, check = TRUE, ...)

Arguments

data

Data frame with trial data, e.g. result from the datasim_bin() function. Must contain columns named 'treatment', 'response' and 'period'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Details

The adjusted separate analysis takes into account only the data from the evaluated experimental treatment arm and its concurrent controls and adjusts for the time effect by including the factor period (defined as a time interval bounded by any treatment arm entering or leaving the platform). The time is then modelled as a step-function with jumps at the beginning of each period. Denoting by yjy_j the response probability for patient jj, by kjk_j the arm patient jj was allocated to, and by MM the treatment arm under evaluation, the regression model is given by:

g(E(yj))=η0+θMI(kj=M)+s=SM1+1SM2τsI(tjTSs)g(E(y_j)) = \eta_0 + \theta_M \cdot I(k_j=M) + \sum_{s=S_{M_1}+1}^{S_{M_2}} \tau_s \cdot I(t_j \in T_{S_s})

where g()g(\cdot) denotes the logit link function and η0\eta_0 is the log odds in the concurrent controls; θM\theta_M represents the log odds ratio of treatment MM and control; τs\tau_s indicates the stepwise period effect in terms of the log odds ratio between periods SM1S_{M_1} and ss (s=SM1+1,,SM2s = S_{M_1}+1, \ldots, S_{M_2}), where SM1S_{M_1} and SM2S_{M_2} denote the periods, in which the investigated treatment arm joined and left the trial, respectively.

If the data consists of only one period, the period in not used as covariate.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the log-odds ratio

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_bin(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
p0 = 0.7, OR = rep(1.8, 3), lambda = rep(0.15, 4), trend="stepwise")

sepmodel_adj_bin(data = trial_data, arm = 3)

Separate analysis for continuous data adjusted for periods

Description

This function performs separate analysis (only taking into account concurrent controls) using a linear model and adjusting for periods, if the treatment arm stays in the trial for more than one period.

Usage

sepmodel_adj_cont(data, arm, alpha = 0.025, check = TRUE, ...)

Arguments

data

Data frame with trial data, e.g. result from the datasim_cont() function. Must contain columns named 'treatment', 'response' and 'period'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Details

The adjusted separate analysis takes into account only the data from the evaluated experimental treatment arm and its concurrent controls and adjusts for the time effect by including the factor period (defined as a time interval bounded by any treatment arm entering or leaving the platform). The time is then modelled as a step-function with jumps at the beginning of each period. Denoting by yjy_j the response probability for patient jj, by kjk_j the arm patient jj was allocated to, and by MM the treatment arm under evaluation, the regression model is given by:

E(yj)=η0+θMI(kj=M)+s=SM1+1SM2τsI(tjTSs)E(y_j) = \eta_0 + \theta_M \cdot I(k_j=M) + \sum_{s=S_{M_1}+1}^{S_{M_2}} \tau_s \cdot I(t_j \in T_{S_s})

where η0\eta_0 is the response in the concurrent controls; θM\theta_M represents the treatment effect of treatment MM as compared to control; τs\tau_s indicates the stepwise period effect between periods SM1S_{M_1} and ss (s=SM1+1,,SM2s = S_{M_1}+1, \ldots, S_{M_2}), where SM1S_{M_1} and SM2S_{M_2} denote the periods, in which the investigated treatment arm joined and left the trial, respectively.

If the data consists of only one period, the period in not used as covariate.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the difference in means

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear")

sepmodel_adj_cont(data = trial_data, arm = 3)

Separate analysis for binary data

Description

This function performs separate analysis (only taking into account concurrent controls) using a logistic model.

Usage

sepmodel_bin(data, arm, alpha = 0.025, check = TRUE, ...)

Arguments

data

Data frame with trial data, e.g. result from the datasim_bin() function. Must contain columns named 'treatment', 'response' and 'period'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Details

The separate analysis takes into account only the data from the evaluated experimental treatment arm and its concurrent controls and uses a logistic regression model to evaluate the given treatment arm. Denoting by yjy_j the response probability for patient jj, by kjk_j the arm patient jj was allocated to, and by MM the treatment arm under evaluation, the regression model is given by:

g(E(yj))=η0+θMI(kj=M)g(E(y_j)) = \eta_0 + \theta_M \cdot I(k_j=M)

where g()g(\cdot) denotes the logit link function and η0\eta_0 is the log odds in the concurrent controls; θM\theta_M represents the log odds ratio of treatment MM and control.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the log-odds ratio

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_bin(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
p0 = 0.7, OR = rep(1.8, 3), lambda = rep(0.15, 4), trend="stepwise")

sepmodel_bin(data = trial_data, arm = 3)

Separate analysis for continuous data

Description

This function performs separate analysis (only taking into account concurrent controls) using a linear model.

Usage

sepmodel_cont(data, arm, alpha = 0.025, check = TRUE, ...)

Arguments

data

Data frame trial data, e.g. result from the datasim_cont() function. Must contain columns named 'treatment', 'response' and 'period'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Details

The separate analysis takes into account only the data from the evaluated experimental treatment arm and its concurrent controls and uses a linear regression model to evaluate the given treatment arm. Denoting by yjy_j the continuous response for patient jj, by kjk_j the arm patient jj was allocated to, and by MM the treatment arm under evaluation, the regression model is given by:

E(yj)=η0+θMI(kj=M)E(y_j) = \eta_0 + \theta_M \cdot I(k_j=M)

where η0\eta_0 is the response in the concurrent controls; θM\theta_M represents the treatment effect of treatment MM as compared to control.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the difference in means

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear")

sepmodel_cont(data = trial_data, arm = 3)

Wrapper function performing simulation studies for a given set of scenarios (not parallelized)

Description

This function performs a simulation study for a given set of scenarios, analyzing simulated data using different models as indicated by the user. Performs inference for indicated experimental treatment arms. Simulates the probability to reject H0H_0 based on a given number of replications.

Usage

sim_study(
  nsim,
  scenarios,
  arms,
  models = c("fixmodel", "sepmodel", "poolmodel"),
  endpoint,
  verbose = TRUE
)

Arguments

nsim

Integer. Number of replications. Must be larger than 1.

scenarios

Data frame containing all parameters for scenarios that should be simulated.

arms

Integer vector with treatment arms to perform inference on. These arms are compared to the control group. Default - all arms except the first one.

models

Character vector with models that should be used for the analysis. Default=c("fixmodel", "sepmodel", "poolmodel"). Available models for continuous endpoints are: 'fixmodel', 'fixmodel_cal', 'gam', 'MAPprior', 'mixmodel', 'mixmodel_cal', 'mixmodel_AR1', 'mixmodel_AR1_cal', 'piecewise', 'piecewise_cal', 'poolmodel', 'sepmodel', 'sepmodel_adj', 'splines', 'splines_cal', 'timemachine'. Available models for binary endpoints are: 'fixmodel', 'fixmodel_cal', 'MAPprior', 'poolmodel', 'sepmodel', 'sepmodel_adj', 'timemachine'.

endpoint

Endpoint indicator. "cont" for continuous endpoints, "bin" for binary endpoints.

verbose

Logical. Indicates whether to print a message (system time and number of finished scenarios) after simulating each scenario in order to track the progress of the simulations. Default=TRUE.

Value

Data frame with all considered scenarios and corresponding results - the probability to reject H0H_0.

Author(s)

Pavla Krotka

Examples

# Create data frame with all parameters:
sim_scenarios <- data.frame(num_arms = 4,
n_arm = 250,
d1 = 250*0,
d2 = 250*1,
d3 = 250*2,
d4 = 250*3,
period_blocks = 2,
mu0 = 0,
sigma = 1,
theta1 = 0,
theta2 = 0,
theta3 = 0,
theta4 = 0,
lambda0 = rep(seq(-0.15, 0.15, length.out = 9), 2),
lambda1 = rep(seq(-0.15, 0.15, length.out = 9), 2),
lambda2 = rep(seq(-0.15, 0.15, length.out = 9), 2),
lambda3 = rep(seq(-0.15, 0.15, length.out = 9), 2),
lambda4 = rep(seq(-0.15, 0.15, length.out = 9), 2),
trend = c(rep("linear", 9), rep("stepwise_2", 9)),
alpha = 0.025,
ncc = TRUE)

# Run simulation study:
sim_results <- sim_study(nsim = 100, scenarios = sim_scenarios, arms = c(3, 4),
models = c("fixmodel", "sepmodel", "poolmodel"), endpoint = "cont")

Wrapper function performing simulation studies for a given set of scenarios (parallelized on replication level)

Description

This function performs a simulation study for a given set of scenarios, analyzing simulated data using different models as indicated by the user. Performs inference for indicated experimental treatment arms. Simulates the probability to reject H0H_0, and the bias, as well as the mean squared error (MSE) of the treatment effect estimates based on a given number of replications.

Usage

sim_study_par(
  nsim,
  scenarios,
  arms,
  models = c("fixmodel", "sepmodel", "poolmodel"),
  endpoint,
  perc_cores = 0.9,
  verbose = TRUE
)

Arguments

nsim

Integer. Number of replications. Must be larger than 1.

scenarios

Data frame containing all parameters for scenarios that should be simulated.

arms

Integer vector with treatment arms to perform inference on. These arms are compared to the control group. Default - all arms except the first one.

models

Character vector with models that should be used for the analysis. Default=c("fixmodel", "sepmodel", "poolmodel"). Available models for continuous endpoints are: 'fixmodel', 'fixmodel_cal', 'gam', 'MAPprior', 'mixmodel', 'mixmodel_cal', 'mixmodel_AR1', 'mixmodel_AR1_cal', 'piecewise', 'piecewise_cal', 'poolmodel', 'sepmodel', 'sepmodel_adj', 'splines', 'splines_cal', 'timemachine'. Available models for binary endpoints are: 'fixmodel', 'fixmodel_cal', 'MAPprior', 'poolmodel', 'sepmodel', 'sepmodel_adj', 'timemachine'.

endpoint

Endpoint indicator. "cont" for continuous endpoints, "bin" for binary endpoints.

perc_cores

Double. What percentage of available cores should be used for the simulations. Default=0.9.

verbose

Logical. Indicates whether to print a message (system time and number of finished scenarios) after simulating each scenario in order to track the progress of the simulations. Default=TRUE.

Value

Data frame with all considered scenarios and corresponding results - the probability to reject H0H_0, and the bias, as well as the mean squared error (MSE) of the treatment effect estimates.

Author(s)

Pavla Krotka

Examples

# Create data frame with all parameters:
sim_scenarios <- data.frame(num_arms = 4,
n_arm = 250,
d1 = 250*0,
d2 = 250*1,
d3 = 250*2,
d4 = 250*3,
period_blocks = 2,
mu0 = 0,
sigma = 1,
theta1 = 0,
theta2 = 0,
theta3 = 0,
theta4 = 0,
lambda0 = rep(seq(-0.15, 0.15, length.out = 9), 2),
lambda1 = rep(seq(-0.15, 0.15, length.out = 9), 2),
lambda2 = rep(seq(-0.15, 0.15, length.out = 9), 2),
lambda3 = rep(seq(-0.15, 0.15, length.out = 9), 2),
lambda4 = rep(seq(-0.15, 0.15, length.out = 9), 2),
trend = c(rep("linear", 9), rep("stepwise_2", 9)),
alpha = 0.025,
ncc = TRUE)

# Run simulation study:
sim_results <- sim_study_par(nsim = 100, scenarios = sim_scenarios, arms = c(3, 4),
models = c("fixmodel", "sepmodel", "poolmodel"), endpoint = "cont")

Spline regression analysis for continuous data with knots placed according to calendar time units

Description

This function performs linear regression taking into account all trial data until the arm under study leaves the trial and adjusting for time using regression splines with knots placed according to calendar time units.

Usage

splines_cal_cont(
  data,
  arm,
  alpha = 0.025,
  unit_size = 25,
  ncc = TRUE,
  bs_degree = 3,
  check = TRUE,
  ...
)

Arguments

data

Data frame with trial data, e.g. result from the datasim_cont() function. Must contain columns named 'treatment', 'response' and 'j'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

unit_size

Integer. Number of patients per calendar time unit. Default=25.

ncc

Logical. Indicates whether to include non-concurrent data into the analysis. Default=TRUE.

bs_degree

Integer. Degree of the polynomial spline. Default=3 for cubic spline.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the difference in means

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • knots - positions of the knots in terms of patient index

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear")

splines_cal_cont(data = trial_data, arm = 3)

Spline regression analysis for continuous data with knots placed according to periods

Description

This function performs linear regression taking into account all trial data until the arm under study leaves the trial and adjusting for time using regression splines with knots placed according to periods.

Usage

splines_cont(
  data,
  arm,
  alpha = 0.025,
  ncc = TRUE,
  bs_degree = 3,
  check = TRUE,
  ...
)

Arguments

data

Data frame with trial data, e.g. result from the datasim_cont() function. Must contain columns named 'treatment', 'response', 'period' and 'j'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Significance level (one-sided). Default=0.025.

ncc

Logical. Indicates whether to include non-concurrent data into the analysis. Default=TRUE.

bs_degree

Integer. Degree of the polynomial spline. Default=3 for cubic spline.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - p-value (one-sided)

  • treat_effect - estimated treatment effect in terms of the difference in means

  • lower_ci - lower limit of the (1-2*alpha)*100% confidence interval

  • upper_ci - upper limit of the (1-2*alpha)*100% confidence interval

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

  • knots - positions of the knots in terms of patient index

  • model - fitted model

Author(s)

Pavla Krotka

Examples

trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear")

splines_cont(data = trial_data, arm = 3)

Generation of stepwise trend with equal jumps between periods

Description

This function generates a stepwise trend for a given period. No time trend is assumed in the first period.

Usage

sw_trend(cj, lambda)

Arguments

cj

Period indicator.

lambda

Strength of time trend.

Details

The time trend is generated according to the function f(j)=λ(cj1)f(j) = \lambda \cdot (c_j - 1), where cjc_j is an index of the period patient jj was enrolled in.

Value

Time trend in period cjc_j.

Author(s)

Marta Bofill Roig, Pavla Krotka


Time machine analysis for binary data

Description

This function performs analysis of binary data using the Time Machine approach. It takes into account all data until the investigated arm leaves the trial. It is based on logistic regression with treatment as a categorical variable and covariate adjustment for time via a second-order Bayesian normal dynamic linear model (separating the trial into buckets of pre-defined size).

Usage

timemachine_bin(
  data,
  arm,
  alpha = 0.025,
  prec_theta = 0.001,
  prec_eta = 0.001,
  tau_a = 0.1,
  tau_b = 0.01,
  bucket_size = 25,
  check = TRUE,
  ...
)

Arguments

data

Data frame with trial data, e.g. result from the datasim_bin() function. Must contain columns named 'treatment', 'response' and 'period'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Decision boundary (one-sided). Default=0.025.

prec_theta

Double. Precision (1/σθ21/\sigma^2_{\theta}) of the prior regarding the treatment effect θ\theta. I.e. θN(0,σθ2)\theta \sim N(0, \sigma^2_{\theta}) . Default=0.001.

prec_eta

Double. Precision (1/ση021/\sigma^2_{\eta_0}) of the prior regarding the control log-odds η0\eta_0. I.e. η0N(0,ση02)\eta_0 \sim N(0, \sigma^2_{\eta_0}). Default=0.001.

tau_a

Double. Parameter aτa_{\tau} of the Gamma distribution for the precision parameter τ\tau in the model for the time trend. I.e., τGamma(aτ,bτ)\tau \sim Gamma(a_{\tau},b_{\tau}). Default=0.1.

tau_b

Double. Parameter bτb_{\tau} of the Gamma distribution for the precision parameter τ\tau in the model for the time trend. I.e., τGamma(aτ,bτ)\tau \sim Gamma(a_{\tau},b_{\tau}). Default=0.01.

bucket_size

Integer. Number of patients per time bucket. Default=25.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Details

The Time Machine divides the trial duration into CC calendar time intervals of equal length ("buckets"), which are indexed backwards in time. That is to say, the most recent time interval is denoted by c=1c=1 and the time interval corresponding to the beginning of the trial by c=Cc=C. The analysis is performed as soon as the analyzed treatment arm finishes in the trial.

The model is defined as follows:

g(E(yj))=η0+θkj+αcjg(E(y_j)) = \eta_0 + \theta_{k_j} + \alpha_{c_j}

where yjy_j is the binary response for patient jj and g()g(\cdot) is the logit link function, which maps the expected value of the patient response to the linear predictors in the model. The model intercept η0\eta_0 denotes the response of the control group at time of the analysis, θkj\theta_{k_j} is the effect of the treatment arm kk that patient jj was enrolled in, relative to control in terms of the log odds ratio. For the parameters η0\eta_0 and θkj\theta_{k_j}, normal prior distributions are assumed, with mean 0 and variances ση02\sigma^2_{\eta_0} and σθ2\sigma^2_{\theta}, respectively:

η0N(0,ση02)\eta_0 \sim \mathcal{N}(0, \sigma^2_{\eta_0})

θkjN(0,σθ2)\theta_{k_j} \sim \mathcal{N}(0, \sigma^2_{\theta})

In the Time Machine, time effect is represented by αcj\alpha_{c_j}, which is the change in the response in time bucket cjc_j (which denotes the time bucket in which patient jj is enrolled) compared to the most recent time bucket c=1c=1 and is modeled using a Bayesian second-order normal dynamic linear model. This creates a smoothing over the control response, such that closer time buckets are modeled with more similar response rates:

α1=0\alpha_1 = 0

α2N(0,1/τ)\alpha_2 \sim \mathcal{N}(0, 1/\tau)

αcN(2αc1αc2,1/τ),3cC\alpha_c \sim \mathcal{N}(2 \alpha_{c-1} - \alpha_{c-2}, 1/\tau), 3 \le c \le C

where τ\tau denotes the drift parameter that controls the degree of smoothing over the time buckets and is assumed to have a Gamma hyperprior distribution:

τGamma(aτ,bτ)\tau \sim Gamma(a_{\tau}, b_{\tau})

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - posterior probability that the log-odds ratio is less than zero

  • treat_effect - posterior mean of log-odds ratio

  • lower_ci - lower limit of the (1-2*alpha)*100% credible interval for log-odds ratio

  • upper_ci - upper limit of the (1-2*alpha)*100% credible interval for log-odds ratio

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

Author(s)

Dominic Magirr, Peter Jacko

References

The Bayesian Time Machine: Accounting for Temporal Drift in Multi-arm Platform Trials. Saville, B. R., Berry, D. A., et al. Clinical Trials 19.5 (2022): 490-501.

Examples

trial_data <- datasim_bin(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
p0 = 0.7, OR = rep(1.8, 3), lambda = rep(0.15, 4), trend="stepwise")

timemachine_bin(data = trial_data, arm = 3)

Time machine analysis for continuous data

Description

This function performs analysis of continuous data using the Time Machine approach. It takes into account all data until the investigated arm leaves the trial. It is based on linear regression with treatment as a categorical variable and covariate adjustment for time via a second-order Bayesian normal dynamic linear model (separating the trial into buckets of pre-defined size).

Usage

timemachine_cont(
  data,
  arm,
  alpha = 0.025,
  prec_theta = 0.001,
  prec_eta = 0.001,
  tau_a = 0.1,
  tau_b = 0.01,
  prec_a = 0.001,
  prec_b = 0.001,
  bucket_size = 25,
  check = TRUE,
  ...
)

Arguments

data

Data frame with trial data, e.g. result from the datasim_cont() function. Must contain columns named 'treatment', 'response' and 'period'.

arm

Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.

alpha

Double. Decision boundary (one-sided). Default=0.025.

prec_theta

Double. Precision (1/σθ21/\sigma^2_{\theta}) of the prior regarding the treatment effect θ\theta. I.e. θN(0,σθ2)\theta \sim N(0, \sigma^2_{\theta}). Default=0.001.

prec_eta

Double. Precision (1/ση021/\sigma^2_{\eta_0}) of the prior regarding the control mean η0\eta_0. I.e. η0N(0,ση02)\eta_0 \sim N(0, \sigma^2_{\eta_0}). Default=0.001.

tau_a

Double. Parameter aτa_{\tau} of the Gamma distribution for the precision parameter τ\tau in the model for the time trend. I.e., τGamma(aτ,bτ)\tau \sim Gamma(a_{\tau},b_{\tau}). Default=0.1.

tau_b

Double. Parameter bτb_{\tau} of the Gamma distribution for the precision parameter τ\tau in the model for the time trend. I.e., τGamma(aτ,bτ)\tau \sim Gamma(a_{\tau},b_{\tau}). Default=0.01.

prec_a

Double. Parameter aσ2a_{\sigma^2} of the Gamma distribution regarding the precision of the responses. I.e., 1/σ2Gamma(aσ2,bσ2)1/\sigma^2 \sim Gamma(a_{\sigma^2},b_{\sigma^2}). Default=0.001.

prec_b

Double. Parameter bσ2b_{\sigma^2} of the Gamma distribution regarding the precision of the responses. I.e., 1/σ2Gamma(aσ2,bσ2)1/\sigma^2 \sim Gamma(a_{\sigma^2},b_{\sigma^2}). Default=0.001.

bucket_size

Integer. Number of patients per time bucket. Default=25.

check

Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.

...

Further arguments passed by wrapper functions when running simulations.

Details

The Time Machine divides the trial duration into CC calendar time intervals of equal length ("buckets"), which are indexed backwards in time. That is to say, the most recent time interval is denoted by c=1c=1 and the time interval corresponding to the beginning of the trial by c=Cc=C. The analysis is performed as soon as the analyzed treatment arm finishes in the trial.

The model is defined as follows:

E(yj)=η0+θkj+αcjE(y_j) = \eta_0 + \theta_{k_j} + \alpha_{c_j}

where yjy_j is the continuous response for patient jj. The model intercept η0\eta_0 denotes the response of the control group at time of the analysis, θkj\theta_{k_j} is the effect of the treatment arm kk that patient jj was enrolled in, relative to control. For the parameters η0\eta_0 and θkj\theta_{k_j}, normal prior distributions are assumed, with mean 0 and variances ση02\sigma^2_{\eta_0} and σθ2\sigma^2_{\theta}, respectively:

η0N(0,ση02)\eta_0 \sim \mathcal{N}(0, \sigma^2_{\eta_0})

θkjN(0,σθ2)\theta_{k_j} \sim \mathcal{N}(0, \sigma^2_{\theta})

In the Time Machine, time effect is represented by αcj\alpha_{c_j}, which is the change in the response in time bucket cjc_j (which denotes the time bucket in which patient jj is enrolled) compared to the most recent time bucket c=1c=1 and is modeled using a Bayesian second-order normal dynamic linear model. This creates a smoothing over the control response, such that closer time buckets are modeled with more similar response rates:

α1=0\alpha_1 = 0

α2N(0,1/τ)\alpha_2 \sim \mathcal{N}(0, 1/\tau)

αcN(2αc1αc2,1/τ),3cC\alpha_c \sim \mathcal{N}(2 \alpha_{c-1} - \alpha_{c-2}, 1/\tau), 3 \le c \le C

where τ\tau denotes the drift parameter that controls the degree of smoothing over the time buckets and is assumed to have a Gamma hyperprior distribution:

τGamma(aτ,bτ)\tau \sim Gamma(a_{\tau}, b_{\tau})

The precision of the individual patient responses (1/σ21/\sigma^2) is also assumed to have a Gamma hyperprior distribution:

1/σ2Gamma(aσ2,bσ2)1/\sigma^2 \sim Gamma(a_{\sigma^2}, b_{\sigma^2})

Value

List containing the following elements regarding the results of comparing arm to control:

  • p-val - posterior probability that the difference in means is less than zero

  • treat_effect - posterior mean of difference in means

  • lower_ci - lower limit of the (1-2*alpha)*100% credible interval for difference in means

  • upper_ci - upper limit of the (1-2*alpha)*100% credible interval for difference in means

  • reject_h0 - indicator of whether the null hypothesis was rejected or not (p_val < alpha)

Author(s)

Dominic Magirr, Peter Jacko

Examples

trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear")

timemachine_cont(data = trial_data, arm = 3)