Package: SimInf 9.8.1

Stefan Widgren

SimInf: A Framework for Data-Driven Stochastic Disease Spread Simulations

Provides an efficient and very flexible framework to conduct data-driven epidemiological modeling in realistic large scale disease spread simulations. The framework integrates infection dynamics in subpopulations as continuous-time Markov chains using the Gillespie stochastic simulation algorithm and incorporates available data such as births, deaths and movements as scheduled events at predefined time-points. Using C code for the numerical solvers and 'OpenMP' (if available) to divide work over multiple processors ensures high performance when simulating a sample outcome. One of our design goals was to make the package extendable and enable usage of the numerical solvers from other R extension packages in order to facilitate complex epidemiological research. The package contains template models and can be extended with user-defined models. For more details see the paper by Widgren, Bauer, Eriksson and Engblom (2019) <doi:10.18637/jss.v091.i12>. The package also provides functionality to fit models to time series data using the Approximate Bayesian Computation Sequential Monte Carlo ('ABC-SMC') algorithm of Toni and others (2009) <doi:10.1098/rsif.2008.0172>.

Authors:Stefan Widgren [aut, cre], Robin Eriksson [aut], Stefan Engblom [aut], Pavol Bauer [aut], Thomas Rosendal [ctb], Ivana Rodriguez Ewerlöf [ctb], Attractive Chaos [cph]

SimInf_9.8.1.tar.gz
SimInf_9.8.1.tar.gz(r-4.5-noble)SimInf_9.8.1.tar.gz(r-4.4-noble)
SimInf_9.8.1.tgz(r-4.4-emscripten)SimInf_9.8.1.tgz(r-4.3-emscripten)
SimInf.pdf |SimInf.html
SimInf/json (API)
NEWS

# Install 'SimInf' in R:
install.packages('SimInf', repos = 'https://cloud.r-project.org')

Bug tracker:https://github.com/stewid/siminf/issues1 issues

Uses libs:
  • gsl– GNU Scientific Library (GSL)
  • openmp– GCC OpenMP (GOMP) support library
Datasets:
  • events_SISe3 - Example data to initialize events for the 'SISe3' model
  • nodes - Example data with spatial distribution of nodes
  • u0_SISe3 - Example data to initialize the 'SISe3' model

On CRAN:

Conda:

gslopenmp

4.78 score 2 stars 563 downloads 10 mentions 54 exports 4 dependencies

Last updated 10 months agofrom:94b2d7b70c. Checks:3 OK. Indexed: no.

TargetResultLatest binary
Doc / VignettesOKMar 21 2025
R-4.5-linux-x86_64OKMar 21 2025
R-4.4-linux-x86_64OKMar 21 2025

Exports:abcboxplotC_codecontinuedistance_matrixedge_properties_to_matrixeventsevents_SEIRevents_SIRevents_SISevents_SISegdatagdata<-get_individualsindegreeindividual_eventsldatalogLikmparsen_generationsn_nodesnode_eventsoutdegreepackage_skeletonpairspfilterplotprevalencepunchcard<-runSEIRselect_matrixselect_matrix<-set_num_threadsshift_matrixshift_matrix<-showSimInf_eventsSimInf_modelSIRSISSISeSISe_spSISe3SISe3_spsummarytrajectoryu0u0_SEIRu0_SIRu0_SISu0_SISeu0<-v0<-

Dependencies:digestlatticeMASSMatrix

Post-process data in a trajectory

Rendered frompost-process-data.Rmdusingknitr::rmarkdownon Mar 21 2025.

Last update: 2024-04-24
Started: 2024-04-24

Scheduled events

Rendered fromscheduled-events.Rmdusingknitr::rmarkdownon Mar 21 2025.

Last update: 2024-04-24
Started: 2021-06-25

SimInf: An R Package for Data-Driven Stochastic Disease Spread Simulations

Rendered fromSimInf.Rnwusingutils::Sweaveon Mar 21 2025.

Last update: 2022-09-03
Started: 2017-06-13

Citation

To cite SimInf in publications use:

Widgren S, Bauer P, Eriksson R, Engblom S (2019). “SimInf: An R Package for Data-Driven Stochastic Disease Spread Simulations.” Journal of Statistical Software, 91(12), 1–42. doi:10.18637/jss.v091.i12.

Corresponding BibTeX entry:

  @Article{,
    title = {{SimInf}: An {R} Package for Data-Driven Stochastic
      Disease Spread Simulations},
    author = {Stefan Widgren and Pavol Bauer and Robin Eriksson and
      Stefan Engblom},
    journal = {Journal of Statistical Software},
    year = {2019},
    volume = {91},
    number = {12},
    pages = {1--42},
    doi = {10.18637/jss.v091.i12},
  }

Readme and manuals

A flexible and efficient framework for data-driven stochastic disease spread simulations

Build Status CRAN status CRAN RStudio mirror downloads Code coverage

SimInf

The package provides an efficient and very flexible framework to conduct data-driven epidemiological modeling in realistic large scale disease spread simulations. The framework integrates infection dynamics in subpopulations as continuous-time Markov chains using the Gillespie stochastic simulation algorithm and incorporates available data such as births, deaths and movements as scheduled events at predefined time-points. Using C code for the numerical solvers and 'OpenMP' (if available) to divide work over multiple processors ensures high performance when simulating a sample outcome. One of our design goals was to make the package extendable and enable usage of the numerical solvers from other R extension packages in order to facilitate complex epidemiological research. The package contains template models and can be extended with user-defined models.

Getting started

You can use one of the predefined compartment models in SimInf, for example, SEIR. But you can also define a custom model 'on the fly' using the model parser method mparse. The method takes a character vector of transitions in the form of X -> propensity -> Y and automatically generates the C and R code for the model. The left hand side of the first arrow (->) is the initial state, the right hand side of the last arrow (->) is the final state, and the propensity is written between the two arrows. The flexibility of the mparse approach allows for quick prototyping of new models or features. To illustrate the mparse functionality, let us consider the SIR model in a closed population i.e., no births or deaths. Let beta denote the transmission rate of spread between a susceptible individual and an infectious individual and gamma the recovery rate from infection (gamma = 1 / average duration of infection). It is also possible to define variables which can then be used in calculations of propensities or in calculations of other variables. A variable is defined by the operator <-. Using a variable for the size of the population, the SIR model can be described as:

library(SimInf)

transitions <- c("S -> beta*S*I/N -> I",
                 "I -> gamma*I -> R",
                 "N <- S+I+R")
compartments <- c("S", "I", "R")

The transitions and compartments variables together with the constants beta and gamma can now be used to generate a model with mparse. The model also needs to be initialised with the initial condition u0 and tspan, a vector of time points where the state of the system is to be returned. Let us create a model that consists of 1000 replicates of a population, denoted a node in SimInf, that each starts with 99 susceptibles, 5 infected and 0 recovered individuals.

n <- 1000
u0 <- data.frame(S = rep(99, n), I = rep(5, n), R = rep(0, n))

model <- mparse(transitions = transitions,
                compartments = compartments,
                gdata = c(beta = 0.16, gamma = 0.077),
                u0 = u0,
                tspan = 1:150)

To generate data from the model and then print some basic information about the outcome, run the following commands:

result <- run(model)
result
#> Model: SimInf_model
#> Number of nodes: 1000
#> Number of transitions: 2
#> Number of scheduled events: 0
#> 
#> Global data
#> -----------
#>  Parameter Value
#>  beta      0.160
#>  gamma     0.077
#> 
#> Compartments
#> ------------
#>      Min. 1st Qu. Median   Mean 3rd Qu.   Max.
#>  S   1.00   19.00  30.00  40.74   60.00  99.00
#>  I   0.00    0.00   4.00   6.87   11.00  47.00
#>  R   0.00   28.00  67.00  56.39   83.00 103.00

There are several functions in SimInf to facilitate analysis and post-processing of simulated data, for example, trajectory, prevalence and plot. The default plot will display the median count in each compartment across nodes as a colored line together with the inter-quartile range using the same color, but with transparency.

plot(result)

plot of chunk mparse-SIR

Most modeling and simulation studies require custom data analysis once the simulation data has been generated. To support this, SimInf provides the trajectory method to obtain a data.frame with the number of individuals in each compartment at the time points specified in tspan. Below is the first 10 lines of the data.frame with simulated data.

trajectory(result)
#>    node time  S I R
#> 1     1    1 98 6 0
#> 2     2    1 98 6 0
#> 3     3    1 98 6 0
#> 4     4    1 99 5 0
#> 5     5    1 97 7 0
#> 6     6    1 98 5 1
#> 7     7    1 99 5 0
#> 8     8    1 99 5 0
#> 9     9    1 97 7 0
#> 10   10    1 97 6 1
...

Finally, let us use the prevalence method to explore the proportion of infected individuals across all nodes. It takes a model object and a formula specification, where the left hand side of the formula specifies the compartments representing cases i.e., have an attribute or a disease and the right hand side of the formula specifies the compartments at risk. Below is the first 10 lines of the data.frame.

prevalence(result, I ~ S + I + R)
#>    time prevalence
#> 1     1 0.05196154
#> 2     2 0.05605769
#> 3     3 0.06059615
#> 4     4 0.06516346
#> 5     5 0.06977885
#> 6     6 0.07390385
#> 7     7 0.07856731
#> 8     8 0.08311538
#> 9     9 0.08794231
#> 10   10 0.09321154
...

Learn more

See the vignette to learn more about special features that the SimInf R package provides, for example, how to:

  • use continuous state variables

  • use the SimInf framework from another R package

  • incorporate available data such as births, deaths and movements as scheduled events at predefined time-points.

Installation

You can install the released version of SimInf from CRAN

install.packages("SimInf")

or use the remotes package to install the development version from GitHub

library(remotes)
install_github("stewid/SimInf")

We refer to section 3.1 in the vignette for detailed installation instructions.

Authors

In alphabetical order: Pavol Bauer ORCID iD, Robin Eriksson ORCID iD, Stefan Engblom ORCID iD, and Stefan Widgren ORCID iD (Maintainer)

Any suggestions, bug reports, forks and pull requests are appreciated. Get in touch.

Citation

SimInf is research software. To cite SimInf in publications, please use:

  • Widgren S, Bauer P, Eriksson R, Engblom S (2019) SimInf: An R Package for Data-Driven Stochastic Disease Spread Simulations. Journal of Statistical Software, 91(12), 1--42. doi: 10.18637/jss.v091.i12

  • Bauer P, Engblom S, Widgren S (2016) Fast event-based epidemiological simulations on national scales. International Journal of High Performance Computing Applications, 30(4), 438--453. doi: 10.1177/1094342016635723

Acknowledgments

This software has been made possible by support from the Swedish Research Council within the UPMARC Linnaeus center of Excellence (Pavol Bauer, Robin Eriksson, and Stefan Engblom), the Swedish Research Council Formas (Stefan Engblom and Stefan Widgren), the Swedish Board of Agriculture (Stefan Widgren), the Swedish strategic research program eSSENCE (Stefan Widgren), and in the framework of the Full Force project, supported by funding from the European Union’s Horizon 2020 Research and Innovation programme under grant agreement No 773830: One Health European Joint Programme (Stefan Widgren).

Versioning

The SimInf package uses semantic versioning.

License

The SimInf package is licensed under the GPLv3.

Help Manual

Help pageTopics
Approximate Bayesian computationabc abc,SimInf_model-method
Coerce to data frameas.data.frame.SimInf_abc
Coerce events to a data frameas.data.frame.SimInf_events
Coerce to data frameas.data.frame.SimInf_indiv_events
Box plot of number of individuals in each compartmentboxplot,SimInf_model-method
Extract the C code from a 'SimInf_model' objectC_code
Run more generations of ABC SMCcontinue continue,SimInf_abc-method
Create a distance matrix between nodes for spatial modelsdistance_matrix
Convert an edge list with properties to a matrixedge_properties_to_matrix
Extract the events from a 'SimInf_model' objectevents events,SimInf_model-method
Example data to initialize events for the 'SEIR' modelevents_SEIR
Example data to initialize events for the 'SIR' modelevents_SIR
Example data to initialize events for the 'SIS' modelevents_SIS
Example data to initialize events for the 'SISe' modelevents_SISe
Example data to initialize events for the 'SISe3' modelevents_SISe3
Extract global data from a 'SimInf_model' objectgdata gdata,SimInf_model-method
Set a global data parameter for a 'SimInf_model' objectgdata<- gdata<-,SimInf_model-method
Extract individuals from 'SimInf_indiv_events'get_individuals get_individuals,SimInf_indiv_events-method
Determine in-degree for each node in a modelindegree
Individual eventsindividual_events
Extract local data from a nodeldata ldata,SimInf_model-method
Log likelihoodlogLik,SimInf_pfilter-method
Model parser to define new models to run in 'SimInf'mparse
Determine the number of generationsn_generations n_generations,SimInf_abc-method
Determine the number of nodes in a modeln_nodes n_nodes,SimInf_model-method
Transform individual events to node events for a modelnode_events node_events,SimInf_indiv_events-method
Example data with spatial distribution of nodesnodes
Determine out-degree for each node in a modeloutdegree
Create a package skeleton from a 'SimInf_model'package_skeleton
Scatterplot of number of individuals in each compartmentpairs,SimInf_model-method
Bootstrap particle filterpfilter pfilter,SimInf_model-method
Display the ABC posterior distributionplot,SimInf_abc-method
Display the distribution of scheduled events over timeplot,SimInf_events-method
Display the distribution of individual events over timeplot,SimInf_indiv_events-method
Display the outcome from a simulated trajectoryplot,SimInf_model-method
Diagnostic plot of a particle filter objectplot,SimInf_pfilter-method
Generic function to calculate prevalence from trajectory dataprevalence
Calculate prevalence from a model object with trajectory dataprevalence,SimInf_model-method
Extract prevalence from running a particle filterprevalence,SimInf_pfilter-method
Set a template for where to record result during a simulationpunchcard<- punchcard<-,SimInf_model-method
Run the SimInf stochastic simulation algorithmrun run,SEIR-method run,SimInf_abc-method run,SimInf_model-method run,SIR-method run,SIS-method run,SISe-method run,SISe3-method run,SISe3_sp-method run,SISe_sp-method
Create an SEIR modelSEIR
Definition of the 'SEIR' modelSEIR-class
Extract the select matrix from a 'SimInf_model' objectselect_matrix select_matrix,SimInf_model-method
Set the select matrix for a 'SimInf_model' objectselect_matrix<- select_matrix<-,SimInf_model-method
Specify the number of threads that SimInf should useset_num_threads
Extract the shift matrix from a 'SimInf_model' objectshift_matrix shift_matrix,SimInf_model-method
Set the shift matrix for a 'SimInf_model' objectshift_matrix<- shift_matrix<-,SimInf_model-method
Print summary of a 'SimInf_abc' objectshow,SimInf_abc-method
Brief summary of 'SimInf_events'show,SimInf_events-method
Print summary of a 'SimInf_indiv_events' objectshow,SimInf_indiv_events-method
Brief summary of 'SimInf_model'show,SimInf_model-method
Brief summary of a 'SimInf_pfilter' objectshow,SimInf_pfilter-method
A Framework for Data-Driven Stochastic Disease Spread SimulationsSimInf-package SimInf
Class '"SimInf_abc"'SimInf_abc-class
Create a 'SimInf_events' objectSimInf_events
Class '"SimInf_events"'SimInf_events-class
Class '"SimInf_indiv_events"'SimInf_indiv_events-class
Create a 'SimInf_model'SimInf_model
Class '"SimInf_model"'SimInf_model-class
Class '"SimInf_pfilter"'SimInf_pfilter-class
Create an SIR modelSIR
Definition of the SIR modelSIR-class
Create an SIS modelSIS
Definition of the SIS modelSIS-class
Create a SISe modelSISe
Create a 'SISe_sp' modelSISe_sp
Definition of the 'SISe_sp' modelSISe_sp-class
Definition of the 'SISe' modelSISe-class
Create a 'SISe3' modelSISe3
Create an 'SISe3_sp' modelSISe3_sp
Definition of the 'SISe3_sp' modelSISe3_sp-class
Definition of the 'SISe3' modelSISe3-class
Detailed summary of a 'SimInf_abc' objectsummary,SimInf_abc-method
Detailed summary of a 'SimInf_events' objectsummary,SimInf_events-method
Detailed summary of a 'SimInf_indiv_events' objectsummary,SimInf_indiv_events-method
Detailed summary of a 'SimInf_model' objectsummary,SimInf_model-method
Detailed summary of a 'SimInf_pfilter' objectsummary,SimInf_pfilter-method
Generic function to extract data from a simulated trajectorytrajectory
Extract data from a simulated trajectorytrajectory,SimInf_model-method
Extract filtered trajectory from running a particle filtertrajectory,SimInf_pfilter-method
Get the initial compartment stateu0 u0,SimInf_indiv_events-method u0,SimInf_model-method
Example data to initialize the 'SEIR' modelu0_SEIR
Example data to initialize the 'SIR' modelu0_SIR
Example data to initialize the 'SIS' modelu0_SIS
Example data to initialize the 'SISe' modelu0_SISe
Example data to initialize the 'SISe3' modelu0_SISe3
Update the initial compartment state u0 in each nodeu0<- u0<-,SimInf_model-method
Update the initial continuous state v0 in each nodev0<- v0<-,SimInf_model-method