Introduction

This vignette describes and demonstrates how SIMplyBee implements quantitative genetics principles for honeybees. Specifically, it describes three different examples where we simulate:

Honey yield - a single colony trait,
Honey yield and Calmness - two colony traits, and
Colony strength and Honey yield - two colony traits where one trait impacts the other one via the number of workers.

We start by loading SIMplyBee and quickly simulating genomes for some founder honeybees. Specifically, we will simulate genomes for 20 individuals with 16 chromosomes and 1000 segregating sites per chromosome.

library(package = "SIMplyBee")
library(package = "ggplot2")
founderGenomes <- quickHaplo(nInd = 20, nChr = 16, segSites = 1000)

Honey yield

This section shows how to simulate one colony trait, honey yield, that is influenced by the queen and workers as well as the environment. We will achieve this by:

setting base population quantitative genetic parameters,
inspecting individual values in the base population,
inspecting individual values in a colony,
calculating colony value,
calculating multi-colony values, and
selecting on colony values.

Base population quantitative genetic parameters

AlphaSimR, and hence SIMplyBee, simulates each individual with its corresponding genome, and quantitative genetic and phenotypic values. To enable this simulation, we must set base population quantitative genetic parameters for the traits of interest in the global simulation parameters via SimParamBee. We must set:

the number of traits,
the number of quantitative trait loci (QTL) that affect the traits,
the distribution of QTL effects,
trait means, and
trait genetic and environmental variances - if we simulate multiple traits, we must also specify genetic and environmental covariances between the traits.

In honeybees, the majority of traits are influenced by the queen and workers. There are many biological mechanisms for these queen and workers effects. Depending on which caste is the main driver of the trait (the queen or workers), we also talk about direct and indirect effects. For example, for honey yield, workers directly affect honey yield by foraging, while the queen indirectly affects honey yield by stimulating workers via pheromone production. The queen and workers effects for a trait can be genetically and environmentally independent or correlated (usually negatively).

Here, we will simulate two traits to represent the queen and workers effects on honey yield. From this point onward we will use the terms the queen effect and queen trait interchangeably. The same applies to workers effect and workers trait. These two effects (=traits) will give rise to honey yield trait. We will assume that colony honey yield is approximately normally distributed with the mean of 20 kg and variance of 4 kg², which implies that most colonies will have honey yield between 14 kg and 26 kg (see hist(rnorm(n = 1000, mean = 20, sd = sqrt(4)))). Traits like honey yield have a complex polygenic genetic architecture, so we will assume that this trait is influenced by 100 QTL per chromosome (with 16 chromosomes, this gives us 1600 QTL in total).

We will first initiate global simulation parameters and set the mean of queen effects to 10 kg with genetic variance of 1 kg², while we will set the mean of workers effects to 10 kg with genetic variance of 1 kg². The mean and the variance for the worker effect are proportionally scaled by the expected number of workers in a colony. The mean and variance for the queen effect is assumed larger than for the workers effect, because there is one queen and many workers in colony and we assume that workers effects “accumulate”. Deciding how to split the colony mean between queen and workers effects will depend on the individual to colony mapping function, which we will describe in the Colony value sub-section.

# Global simulation parameters
SP <- SimParamBee$new(founderGenomes)

nQtlPerChr <- 100

# Genetic parameters for queen and workers effects - each represented by a trait
mean <- c(10, 10 / SP$nWorkers)
varA <- c(1, 1 / SP$nWorkers)

We next set genetic correlation between the queen and workers effects to -0.5 to reflect the commonly observed antagonistic relationship between these effects. With all the quantitative genetic parameters defined, we now add two additive traits to global simulation parameters and name them queenTrait and workerTrait. These parameters drive the simulation of QTL effects. Read about all the other trait simulation options in AlphaSimR via: vignette(topic = "traits", package="AlphaSimR").

corA <- matrix(data = c( 1.0, -0.5, 
                        -0.5,  1.0), nrow = 2, byrow = TRUE)
SP$addTraitA(nQtlPerChr = nQtlPerChr, mean = mean, var = varA, corA = corA,
             name = c("queenTrait", "workersTrait"))

Finally, we set the environmental variance of the queen and workers effects to 3 kg² and we again scale the worker variance by the expected number of workers. Contrary to the negative genetic correlation, we here assume that environmental correlation between the queen and workers effects is slightly positive, 0.3. This is just an example! These parameters should be based on literature or simulation scenarios of interest.

varE <- c(3, 3 / SP$nWorkers)
corE <- matrix(data = c(1.0, 0.3, 
                        0.3, 1.0), nrow = 2, byrow = TRUE)
SP$setVarE(varE = varE, corE = corE)

Individual values in the base population

Now we create a base population of virgin queens. Since we defined two traits, all honeybees in the simulation will have genetic and phenotypic values for both traits. The genetic values are stored in the gv slot of each Pop object, while phenotypic values are stored in the pheno slot.

#>      queenTrait workersTrait
#> [1,]   9.965567  -0.03497046
#> [2,]   9.041226   0.09604636
#> [3,]   9.117323   0.23244625
#> [4,]  12.129616   0.01456447
#> [5,]  10.360236   0.14484792
#> [6,]  10.183191   0.21748808
#>      queenTrait workersTrait
#> [1,]   7.530149  -0.08457027
#> [2,]  11.339914   0.29476512
#> [3,]  11.174156   0.34576149
#> [4,]  12.008241   0.21538705
#> [5,]   9.047974   0.32493533
#> [6,]   9.878122   0.48272616

Note that these are virgin queens, yet we obtained queen and workers effect values for them! Is this wrong? No! Virgin queens carry DNA with genes that are differentially expressed in different castes, which would be only showed in their phenotype. Hence, virgin queens have genetic values for the queen and worker effects, but they might never actually express these effects. In this simulation virgin queens also obtained phenotypic values for both of the effects. This is technically incorrect because virgin queens don’t express genes for the worker effect at all, and they also do not express the queen effect, not until they become the queen of a colony. We can treat these phenotypic values for virgin queens as values that we could see if these virgin queens would express these traits. We will show later in the Colony value sub-section how we use these traits from different castes. If existence of these phenotypic values for certain castes is a hindrance, we can always remove them for population or colony objects by modifying the corresponding slots as required.

As with the virgin queens, drones also carry DNA with genes that are expressed in different castes. Therefore, drones will also have the queen and workers effect genetic (and phenotypic values) for honey yield even though they do not contribute to this trait in a colony.

#>      queenTrait workersTrait
#> [1,]  10.929188 -0.340134465
#> [2,]   8.625386 -0.016881785
#> [3,]  11.602196 -0.211572757
#> [4,]  10.788908 -0.078811563
#> [5,]  10.550898 -0.008195961
#> [6,]  10.854125  0.105663439

Individual values in a colony

We continue by creating a colony from one base population virgin queen, crossing it, and adding some workers.

colony <- createColony(x = basePop[6])
colony <- cross(x = colony, drones = drones, checkCross = "warning")
colony <- addWorkers(x = colony, nInd = 50)
colony
#> An object of class "Colony" 
#> Id: 1 
#> Location: 0 0 
#> Queen: 6 
#> Number of fathers: 15 
#> Number of workers: 50 
#> Number of drones: 0 
#> Number of virgin queens: 0 
#> Has split: FALSE 
#> Has swarmed: FALSE 
#> Has superseded: FALSE 
#> Has collapsed: FALSE 
#> Is productive: FALSE

We can access the genetic and phenotypic values of colony members with functions getGv() and getPheno(), both of which have the caste argument (see more via help(getGv)).

getGv(colony, caste = "queen")
#>   queenTrait workersTrait
#> 6   10.18319    0.2174881
getGv(colony, caste = "workers") |> head(n = 4)
#>    queenTrait workersTrait
#> 36  10.372806  -0.04262137
#> 37   8.177911   0.18171505
#> 38  11.738447   0.17042119
#> 39  10.070799   0.16660793

getPheno(colony, caste = "queen")
#>   queenTrait workersTrait
#> 6   9.878122    0.4827262
getPheno(colony, caste = "workers") |> head(n = 4)
#>    queenTrait workersTrait
#> 36  11.463704   0.09109294
#> 37   8.460375   0.54924103
#> 38  12.439422  -0.09148424
#> 39   9.624532   0.32441182

For convenience, there are also alias functions for accessing the genetic and phenotypic values of each caste directly.

getQueenGv(colony)
#>   queenTrait workersTrait
#> 6   10.18319    0.2174881
getWorkersGv(colony) |> head(n = 4)
#>    queenTrait workersTrait
#> 36  10.372806  -0.04262137
#> 37   8.177911   0.18171505
#> 38  11.738447   0.17042119
#> 39  10.070799   0.16660793

getQueenPheno(colony)
#>   queenTrait workersTrait
#> 6   9.878122    0.4827262
getWorkersPheno(colony) |> head(n = 4)
#>    queenTrait workersTrait
#> 36  11.463704   0.09109294
#> 37   8.460375   0.54924103
#> 38  12.439422  -0.09148424
#> 39   9.624532   0.32441182

Some phenotypes, such as honey yield, are only expressed if colony is at full size. This is achieved by the buildUp() colony event function that adds worker and drones and hence turns on the production status of the colony (to TRUE). SIMplyBee includes a function ìsProductive() to check the production status of a colony.

# Check if colony is productive
isProductive(colony)
#> [1] FALSE

# Build-up the colony and check the production status again
colony <- buildUp(colony)
colony
#> An object of class "Colony" 
#> Id: 1 
#> Location: 0 0 
#> Queen: 6 
#> Number of fathers: 15 
#> Number of workers: 100 
#> Number of drones: 100 
#> Number of virgin queens: 0 
#> Has split: FALSE 
#> Has swarmed: FALSE 
#> Has superseded: FALSE 
#> Has collapsed: FALSE 
#> Is productive: TRUE
isProductive(colony)
#> [1] TRUE

For the ease of further demonstration, we now combine workers’ values into a single data.frame.

# Collate genetic and phenotypic values of workers
df <- data.frame(id = colony@workers@id,
                 mother = colony@workers@mother,
                 father = colony@workers@father,
                 gvQueenTrait = colony@workers@gv[, "queenTrait"],
                 gvWorkersTrait = colony@workers@gv[, "workersTrait"],
                 pvQueenTrait =  colony@workers@pheno[, "queenTrait"],
                 pvWorkersTrait = colony@workers@pheno[, "workersTrait"])
head(df)
#>   id mother father gvQueenTrait gvWorkersTrait pvQueenTrait pvWorkersTrait
#> 1 86      6     26    10.045872     0.24009914     8.405167     0.13298782
#> 2 87      6     29     9.185191     0.23421770     9.138066     0.29873963
#> 3 88      6     22     8.696884     0.21524027    11.368281     0.35498569
#> 4 89      6     25    10.910608     0.09229883    12.149449    -0.03726173
#> 5 90      6     31    11.165804     0.09628423    13.313813     0.02256039
#> 6 91      6     21    10.981807     0.06640661    12.804729     0.28110716

To visualise correlation between queen and workers effects in workers, we plot these effect values against each other.

# Covariation between queen and workers effect genetic values in workers
p <- ggplot(data = df, aes(x = gvQueenTrait, y = gvWorkersTrait)) +
  xlab("Genetic value for the queen effect") +
  ylab("Genetic value for the workers effect") +
  geom_point() +
  theme_classic()
print(p)

In SIMplyBee, we know genetic values of all individuals, including drones that the queen mated with (=fathers in a colony)!

# Variation in patriline genetic values
getFathersGv(colony)
#>    queenTrait workersTrait
#> 21  10.929188 -0.340134465
#> 22   8.625386 -0.016881785
#> 23  11.602196 -0.211572757
#> 24  10.788908 -0.078811563
#> 25  10.550898 -0.008195961
#> 26  10.854125  0.105663439
#> 27   9.094691  0.247657454
#> 28   7.981797  0.384870618
#> 29   8.451045  0.290055753
#> 30  10.964592  0.015801886
#> 31  12.160759  0.195359464
#> 32   8.387179  0.181076934
#> 33  11.991553  0.200568972
#> 34   9.590299  0.454633741
#> 35   9.317407  0.286412551

Knowing the father of each worker, we inspect variation in the distribution of genetic values of worker by the patriline (workers from a single father drone) for the workers effect.

Colony value

However, in honeybees we usually don’t observe values on individuals, but on a colony. SIMplyBee provides functions for mapping individual values to a colony value. The general function for this is calcColonyValue(), which can combine any value and trait from any caste. There are also aliases calcColonyGv() and calcColonyPheno(). These functions require users to specify the so-called mapping function (via the FUN argument). The mapping function specifies queen and workers traits (potentially also drone traits) and what function we want to apply to each of them before mapping them to the colony value(s). We can also specify whether the colony value(s) depend on the production status. For example, if a colony is not productive, its honey yield would be 0 or unobserved. SIMplyBee provides a general mapping function mapCasteToColonyValue() and aliases mapCasteToColonyGv() and mapCasteToColonyPheno(). These functions have arguments to cater for various situations. By default, they first calculate caste values: leave the queen’s value as it is, sum workers’ values, potentially sum drones’ values, and lastly sum all these caste values together into a colony value. Users can provide their own mapping function(s) too!

We now calculate honey yield for our colony - a single value for the colony.

# Colony phenotype value
calcColonyPheno(colony, queenTrait = "queenTrait", workersTrait = "workersTrait")
#>          [,1]
#> [1,] 26.70137
help(calcColonyPheno)
help(mapCasteToColonyPheno)

These colony values are not stored in a colony, because they change as colony changes due to various events. For example, reducing the number of workers will reduce the colony honey yield.

# Colony phenotype value from a reduced colony
removeWorkers(colony, p = 0.5) |>
  calcColonyPheno(queenTrait = "queenTrait", workersTrait = "workersTrait")
#>          [,1]
#> [1,] 17.30785

Please note that we assumed that the queen contributes half to colony honey yield and workers contribute the other half. This means that removing workers will still give a non-zero honey yield! This shows that we have to design the mapping between individual, caste, and colony values with care!

# Colony phenotype value from a reduced colony
removeWorkers(colony, p = 0.99) |>
  calcColonyPheno(queenTrait = "queenTrait", workersTrait = "workersTrait")
#>          [,1]
#> [1,] 10.07485

Finally, note that SIMplyBee currently does not provide functionality for breeding values, dominance deviations, and epistatic deviations at caste and colony levels, despite the availabiliy of AlphaSimR bv(), dd(), and aa() functions. This is because we have to check or develop theory on how to calculate these values across active colonies and hence we currently advise against the use of AlphaSimR bv(), dd(), and aa() functions with SIMplyBee as the output of these functions could be easily misinterpreted.

MultiColony values

The same functions can be used on a MultiColony class object. Let’s create an apiary.

apiary <- createMultiColony(basePop[7:20])
drones <- createDrones(basePop[1:5], nInd = 100)
droneGroups <- pullDroneGroupsFromDCA(drones, n = nColonies(apiary), nDrones = 15)
apiary <- cross(x = apiary, drones = droneGroups, checkCross = "warning")
apiary <- buildUp(apiary)

We can extract the genetic and phenotypic values from multiple colonies in the same manner as from a single colony, by using get*Gv() and get*Pheno() functions. The output of these function is a named list with values for each colony or a single matrix if we set the collapse argument to TRUE.

getQueenGv(apiary) |> head(n = 4)
#> $`2`
#>   queenTrait workersTrait
#> 7   9.717679   0.03281628
#> 
#> $`3`
#>   queenTrait workersTrait
#> 8   8.871149  -0.07106226
#> 
#> $`4`
#>   queenTrait workersTrait
#> 9   11.33881  0.002700673
#> 
#> $`5`
#>    queenTrait workersTrait
#> 10   10.84765   0.07451731
getQueenGv(apiary, collapse = TRUE) |> head(n = 4)
#>    queenTrait workersTrait
#> 7    9.717679  0.032816282
#> 8    8.871149 -0.071062258
#> 9   11.338810  0.002700673
#> 10  10.847647  0.074517312

In a similar manner, we can calculate colony value for all the colonies in our apiary, where the row names of the output represent colony IDs.

colonyGv <- calcColonyGv(apiary)
colonyPheno <- calcColonyPheno(apiary)
data.frame(colonyGv, colonyPheno)
#>     colonyGv colonyPheno
#> 2  11.821739   13.533935
#> 3   8.945553    6.735730
#> 4  16.714493   11.145926
#> 5  19.397271   21.282199
#> 6  18.876614   19.034997
#> 7  29.391127   30.468066
#> 8  24.433618   18.785369
#> 9  15.584057   13.854266
#> 10 22.025930   25.849902
#> 11 25.262975   26.124538
#> 12  9.232886    7.130924
#> 13 13.458898   12.134866
#> 14 22.614498   24.491005
#> 15 23.861256   22.038497

Selection on colony values

Since the aim of selection is to select the best individuals or colonies for the reproduction, we could select the best colony in our apiary based on either genetic or phenotypic value for grafting the new generation of virgin queens. We can use the function selectColonies() that takes a matrix of colony values (the output of calcColonyValue() function). The default behavior is to select the colonies with the highest value (argument selectTop set to TRUE), but you can also select the colonies with the lowest values (argument selectTop set to FALSE).

# Select the best colony based on gv
selectColonies(apiary, n = 1, by = colonyGv)
#> An object of class "MultiColony" 
#> Number of colonies: 1 
#> Are empty: 0 
#> Are NULL: 0 
#> Have split: 0 
#> Have swarmed: 0 
#> Have superseded: 0 
#> Have collapsed: 0 
#> Are productive: 0
# Select the best colony based on phenotype
selectColonies(apiary, n = 1, by = colonyPheno)
#> An object of class "MultiColony" 
#> Number of colonies: 1 
#> Are empty: 0 
#> Are NULL: 0 
#> Have split: 0 
#> Have swarmed: 0 
#> Have superseded: 0 
#> Have collapsed: 0 
#> Are productive: 0

The same functionality is implemented in pullColonies() and removeColonies().

Honey yield and Calmness

In this section we expand simulation to two uncorrelated colony traits with queen and workers effects, honey yield and calmness. We follow the same recipe as in the previous section where we simulated only one colony trait.

We first reinitialize the global simulation parameters because we will define new traits. For honey yield we will use the same parameters as before, while for calmness trait we will assume that the trait is scored continuously in such a way that negative values are undesirable and positive values are desirable with zero being population mean. We will further assume the same variances for calmness as for honey yield, and a genetic (and environmental) correlation between the queen and workers effects of -0.4 (and 0.2) for calmness. We assume no genetic or environmental correlation between honey yield and calmness. Beware, this is just an example to show you how to simulate multiple colony traits - we have made up these parameters - please use literature estimates in your simulations!

# Global simulation parameters
SP <- SimParamBee$new(founderGenomes)

nQtlPerChr <- 100

# Quantitative genetic parameters - for two traits, each with the queen and workers effects
meanP <- c(10, 10 / SP$nWorkers, 0, 0)
varA <- c(1, 1 / SP$nWorkers, 1, 1 / SP$nWorkers)
corA <- matrix(data = c( 1.0, -0.5,  0.0,  0.0, 
                        -0.5,  1.0,  0.0,  0.0,
                         0.0,  0.0,  1.0, -0.4, 
                         0.0,  0.0, -0.4,  1.0), nrow = 4, byrow = TRUE)
SP$addTraitA(nQtlPerChr = 100, mean = meanP, var = varA, corA = corA,
             name = c("yieldQueenTrait", "yieldWorkersTrait",
                      "calmQueenTrait", "calmWorkersTrait"))

varE <- c(3, 3 / SP$nWorkers, 3, 3 / SP$nWorkers)
corE <- matrix(data = c(1.0, 0.3, 0.0, 0.0,
                        0.3, 1.0, 0.0, 0.0,
                        0.0, 0.0, 1.0, 0.2,
                        0.0, 0.0, 0.2, 1.0), nrow = 4, byrow = TRUE)
SP$setVarE(varE = varE, corE = corE)

We continue by creating a base population of virgin queens and from them an apiary with 10 full-sized colonies.

basePop <- createVirginQueens(founderGenomes)
drones <- createDrones(x = basePop[1:5], nInd = 100)
apiary <- createMultiColony(basePop[6:20])
droneGroups <- pullDroneGroupsFromDCA(drones, nColonies(apiary), nDrones = 15)
apiary <- cross(x = apiary, drones = droneGroups, checkCross = "warning")
apiary <- buildUp(apiary)
apiary
#> An object of class "MultiColony" 
#> Number of colonies: 15 
#> Are empty: 0 
#> Are NULL: 0 
#> Have split: 0 
#> Have swarmed: 0 
#> Have superseded: 0 
#> Have collapsed: 0 
#> Are productive: 15

We can again inspect the genetic (and phenotypic) values of all individuals in each colony and whole apiary with get*Gv() and get*Pheno() functions. Now, the output contains four traits representing the queen and workers effect for honey yield and calmness. These functions also take an nInd argument to sample a number of individuals along with their values.

getQueenGv(apiary) |> head(n = 4)
#> $`1`
#>   yieldQueenTrait yieldWorkersTrait calmQueenTrait calmWorkersTrait
#> 6         11.5581         0.1111487     -0.1203044       0.03072213
#> 
#> $`2`
#>   yieldQueenTrait yieldWorkersTrait calmQueenTrait calmWorkersTrait
#> 7        9.662136         0.0567672      -1.402382       0.05300574
#> 
#> $`3`
#>   yieldQueenTrait yieldWorkersTrait calmQueenTrait calmWorkersTrait
#> 8        9.768168         0.1286841       1.261854       -0.2348537
#> 
#> $`4`
#>   yieldQueenTrait yieldWorkersTrait calmQueenTrait calmWorkersTrait
#> 9        11.62177       -0.01690015      -1.649686        0.1331017
getWorkersPheno(apiary, nInd = 3) |> head(n = 4)
#> $`1`
#>     yieldQueenTrait yieldWorkersTrait calmQueenTrait calmWorkersTrait
#> 521        8.737670         0.2257020      0.3400217       0.07866688
#> 522       12.461293         0.2660533     -0.7316516       0.12001798
#> 523        9.135413         0.2181086     -1.8576530      -0.04229327
#> 
#> $`2`
#>     yieldQueenTrait yieldWorkersTrait calmQueenTrait calmWorkersTrait
#> 721       11.543074        0.44142131      -1.150784      -0.17528367
#> 722       11.331776        0.35839383      -3.264752      -0.09931115
#> 723        9.105517        0.03644309       1.051117      -0.10553236
#> 
#> $`3`
#>     yieldQueenTrait yieldWorkersTrait calmQueenTrait calmWorkersTrait
#> 921       10.484049        0.08295194     -2.7294520       -0.4367808
#> 922       13.248816        0.13739981      2.1973218       -0.5241773
#> 923        7.076762        0.08266278      0.5863629       -0.2096949
#> 
#> $`4`
#>      yieldQueenTrait yieldWorkersTrait calmQueenTrait calmWorkersTrait
#> 1121        8.858929         0.0588574      -0.587965      -0.14115407
#> 1122       11.356753         0.1201757      -1.170234      -0.32376203
#> 1123        9.239399         0.2293877      -1.071277      -0.08661219

Now, we calculate colony genetic and phenotypic values for all colonies in the apiary. Since we are simulating two traits, honey yield and calmness, we have two ways to calculate corresponding colony values. The first way is to use the default mapCasteToColony*() function in calcColony*() and only define additional arguments as shown here:

colonyValues <- calcColonyPheno(apiary,
                                queenTrait = c("yieldQueenTrait", "calmQueenTrait"),
                                workersTrait = c("yieldWorkersTrait", "calmWorkersTrait"),
                                traitName = c("yield", "calmness"),
                                checkProduction = c(TRUE, FALSE)) |> as.data.frame()
colonyValues
#>       yield    calmness
#> 1  16.92243  -2.4040443
#> 2  14.58087  -2.4613780
#> 3  19.48068 -14.3319728
#> 4  10.92329   1.9963639
#> 5  20.80701  -1.0643402
#> 6  20.92485  -4.2732469
#> 7  20.84005 -10.9421929
#> 8  30.31972  -7.3953653
#> 9  17.41895   4.8079257
#> 10 15.93591   1.2212312
#> 11 15.71014   1.7466962
#> 12 19.64624  -0.3152329
#> 13 20.70472  -4.4114528
#> 14 11.26517  -3.1649065
#> 15 17.65836  -6.0928918

The second way is to create our own mapping function. An equivalent outcome to the above is shown below just to demonstrate use of your own function, but we are simply just reusing mapCasteToColonyPheno() twice;)

myMapCasteToColonyPheno <- function(colony) {
  yield <- mapCasteToColonyPheno(colony,
                                 queenTrait = "yieldQueenTrait",
                                 workersTrait = "yieldWorkersTrait",
                                 traitName = "yield",
                                 checkProduction = TRUE)
  calmness <- mapCasteToColonyPheno(colony,
                                    queenTrait = "calmQueenTrait",
                                    workersTrait = "calmWorkersTrait",
                                    traitName = "calmness",
                                    checkProduction = FALSE)
  return(cbind(yield, calmness))
}
colonyValues <- calcColonyPheno(apiary, FUN = myMapCasteToColonyPheno) |> as.data.frame()
colonyValues
#>       yield    calmness
#> 1  16.92243  -2.4040443
#> 2  14.58087  -2.4613780
#> 3  19.48068 -14.3319728
#> 4  10.92329   1.9963639
#> 5  20.80701  -1.0643402
#> 6  20.92485  -4.2732469
#> 7  20.84005 -10.9421929
#> 8  30.31972  -7.3953653
#> 9  17.41895   4.8079257
#> 10 15.93591   1.2212312
#> 11 15.71014   1.7466962
#> 12 19.64624  -0.3152329
#> 13 20.70472  -4.4114528
#> 14 11.26517  -3.1649065
#> 15 17.65836  -6.0928918

Again, we can now select the best colony based on the best phenotypic value for either yield, calmness, or an index of both. Let’s say that both traits are equally important so we select on a weighted sum of both of them - we will use the AlphaSimR selIndex() function that enables this calculation along with scaling. We will represent the index such that it has a mean of 100 and standard deviation of 10 units.

colonyValues$Index <- selIndex(Y = colonyValues, b = c(0.5, 0.5), scale = TRUE) * 10 + 100
bestColony <- selectColonies(apiary, n = 1, by = colonyValues$Index)
getId(bestColony)
#> [1] 8

We see that we selected colony with ID “4”, but we would be selecting a different colony based on different selection criteria (yield, calmness, or index).

Strength and honey yield

In this section we change simulation to two traits where the phenotype realisation of the first trait affects the phenotype realisation of the second trait. Specifically, we will assume that queen’s fecundity, and hence the number of workers, is under the genetic affect of the queen and her environment. Furthermore, we will assume as before that colony honey yield is due to the queen effect and workers effect. Since the value of the workers effect depends on then number of workers, we obtain correlation between fecundity and honey yield, even if these traits would be uncorrelated on the queen level. We emphasise that this is just an example and the biology of these traits might be different.

We follow the same logic as before and simulate three traits that will contribute to two colony traits, queen’s fecundity, that is colony strength, and honey yield. We assume that fecundity is only due to the queen (and not the workers), hence we simulate only the queen effect for this trait. For honey yield we again assume that both the queen and workers contribute to the colony value. For speed of simulation we only simulate 100 workers per colony on average and split honey yield mean between the queen and workers. We measure fecundity with the number of workers, which is a count variable and for such variables Poisson distribution is a good model. This distribution has just one parameter (lambda) that represents both the mean and variance of the variable. To this end we set phenotypic variance to 100 and split it into 25 for genetic and 65 for environmental variance. As before we warn that these are just exemplary values to demonstrate the code functionality and do not necessarily reflect published values!

# Global simulation parameters
SP <- SimParamBee$new(founderGenomes)

# Quantitative genetic parameters
# - the first trait has only the queen effect
# - the second trait has both the queen and workers effects
nWorkers <- 100
mean <- c(nWorkers, 10, 10 / nWorkers)
varA <- c(25, 1, 1 / nWorkers)
corA <- matrix(data = c(1.0,  0.0,  0.0,
                        0.0,  1.0, -0.5, 
                        0.0, -0.5,  1.0), nrow = 3, byrow = TRUE)
SP$addTraitA(nQtlPerChr = 100, mean = mean, var = varA, corA = corA,
             name = c("fecundityQueenTrait", "yieldQueenTrait", "yieldWorkersTrait"))

varE <- c(75, 3, 3 / nWorkers)
corE <- matrix(data = c(1.0, 0.0, 0.0,
                        0.0, 1.0, 0.3,
                        0.0, 0.3, 1.0), nrow = 3, byrow = TRUE)
SP$setVarE(varE = varE, corE = corE)

We continue by creating an apiary with 10 colonies.

basePop <- createVirginQueens(founderGenomes)
drones <- createDrones(x = basePop[1:5], nInd = 100)
apiary <- createMultiColony(basePop[6:20])
droneGroups <- pullDroneGroupsFromDCA(drones, nColonies(apiary), nDrones = 15)
apiary <- cross(x = apiary, drones = droneGroups, checkCross = "warning")

Let’s explore queen’s genetic and phenotypic values for fecundity and honey yield. The below printouts show quite some variation in fecundity between queens at the genetic, but particularly phenotypic level. This is a small example, so we should not put too much into correlations between these three variables. However, if you restart this simulation many times, you will notice zero correlation on average between fecundityQueenTrait and the other two traits and negative correlation on average between yieldQueenTrait and yieldWorkersTrait. Just like we defined in the global simulation parameters.

#>    fecundityQueenTrait yieldQueenTrait yieldWorkersTrait
#> 6            106.44670        9.718913       0.121950923
#> 7             92.83568       11.172911      -0.157863672
#> 8             88.83579       10.066414       0.162354930
#> 9            101.45331       10.465125       0.081443449
#> 10           103.97129        7.404775       0.125305869
#> 11            99.87067       10.623552       0.008347997
#> 12           102.79253        9.511186       0.169191475
#> 13            93.67424       10.517192       0.087603080
#> 14           106.93940        9.270423       0.011224837
#> 15           103.80760        7.913200       0.163729165
#> 16            95.62826        9.675453       0.228409282
#> 17            98.94928        9.665202      -0.011010834
#> 18           107.50373        9.947371       0.097280755
#> 19            98.29021       11.241486      -0.060456299
#> 20           100.75228       10.499306       0.214588751
#>                     fecundityQueenTrait yieldQueenTrait yieldWorkersTrait
#> fecundityQueenTrait           1.0000000      -0.2415538        -0.1265798
#> yieldQueenTrait              -0.2415538       1.0000000         0.3472355
#> yieldWorkersTrait            -0.1265798       0.3472355         1.0000000

We next build-up colonies in the apiary. But instead of building them all up to the same fixed number of workers, we build them up according to queen’s fecundity. For that we use the sampling function nWorkersColonyPhenotype(), that samples the number of workers based on phenotypes of colony members, in our case fecundityQueenTrait in queens. Correspondingly, each colony will have a different number of workers. Read more about this function in it’s help page.

apiary <- buildUp(apiary, nWorkers = nWorkersColonyPhenotype,
                  queenTrait = "fecundityQueenTrait")
cbind(nWorkers = nWorkers(apiary), queenPheno)
#>    nWorkers fecundityQueenTrait yieldQueenTrait yieldWorkersTrait
#> 1       104           104.12047       11.205212        0.24588101
#> 2        97            96.88693       10.993692       -0.44138116
#> 3        92            92.38997        8.210623       -0.32292294
#> 4       113           113.43725       10.806634       -0.15490722
#> 5       104           103.74159        4.102949       -0.08470612
#> 6        93            99.02537        8.066388       -0.25211134
#> 7        87            87.03946        9.134779        0.18181087
#> 8        72            72.16656       12.150234        0.11499222
#> 9        88            95.49936        6.428601       -0.28523080
#> 10      108           107.76333        9.896180        0.40703719
#> 11       88            87.75657       10.109118        0.20408130
#> 12       93            92.82338        9.022220       -0.11749679
#> 13      106           106.47725        9.594374       -0.08670210
#> 14       97            96.81830        9.688132       -0.01711390
#> 15       87            87.10358       10.939345        0.17324316
help(nWorkersColonyPhenotype)

To compute the colony value for honey yield, we again employ the calcColonyPheno() function. Correlating the queen and colony values we will now see a positive correlation because our individual to colony mapping function sums workers effect across all workers and the more workers there are the larger the sum.

#>                        nWorkers fecundityQueenTrait yieldQueenTrait
#> nWorkers             1.00000000          0.97390308      -0.1461282
#> fecundityQueenTrait  0.97390308          1.00000000      -0.2415538
#> yieldQueenTrait     -0.14612819         -0.24155376       1.0000000
#> yieldWorkersTrait   -0.02814610         -0.12657979       0.3472355
#> yield                0.06152033         -0.02152753       0.3929570
#>                     yieldWorkersTrait       yield
#> nWorkers                   -0.0281461  0.06152033
#> fecundityQueenTrait        -0.1265798 -0.02152753
#> yieldQueenTrait             0.3472355  0.39295702
#> yieldWorkersTrait           1.0000000  0.67898556
#> yield                       0.6789856  1.00000000

Quantitative genetics