Gregor Gorjanc (gg): Overload of the term "additive" in quantitative genetics

The following discussion came up on SLiM (https://messerlab.org/slim/) mailing list (https://groups.google.com/g/slim-discuss), which I think is highly indicative of confusion among many of us about additive and non-additive gene effects, additive and non-additive genetic values, and corresponding additive and non-additive genetic variances. Doh!

It started by a common comment that there is not much non-additive genetic variation, so maybe we can ignore non-additive gene action in simulations. While this is often done, it's important to be careful about how the term "additive" and "non-additive" are used, which I think is leading to lots of confusion among many (including me).

Let's see how the term "additive" is used in multiple ways in quantitative genetics!

1) The quantitative genetics model

phenotypic_value = intercept + genetic_value + environmental_value

is additive by construction to being with, but nobody mentions this - it's likely that biology is not so linear! But, this gives an "additive" model that we can work with well and it seems to be giving us good predictions for many quantities.

2) The decomposition of genetic value

genetic_value = additive_genetic_value + dominance_deviation + epistasis_deviation,

is again additive by construction (we are summing up things), but let’s move to the additive_genetic_value (=breeding value), which is allele substitution effect (alpha) multiplied by allele dosage (if dosages are, say, 0, 1, and 2, then we have 0alpha, 1alpha, 2alpha) for each locus and then summed over all causal loci. There are two "additivities" here (in addition to the additive decomposition of the phenotypic value and the genetic value!) - adding up allele substitution effects within a locus, and then across the loci.

3) The allele substitution effects and gene action

Allele substitution effects are obtained by a linear (=additive) regression of phenotypic values onto allele dosages, which (in a randomly mating population and without epistasis and GxE, but with dominance) turns out to be:

alpha = a + d(q-p),

where -a and +a are values for the two homozygotes (with "origin" in the middle) and d is a value of the heterozygote relative to the "origin" - this is the standard quantitative genetics parameterization (see Falconer & MacKay green book page 109 - in the 1996 version). These -a, d, and +a values are the values of genotypes (genetic values) in the first (phenotype) model shown at the top.

The a value above is sometimes referred to as an additive gene action at a locus, and d as a dominant gene action at a locus.

So, the above case shows ~5 uses of "additivity", but the show goes on! We typically see that variance of dominance deviations, and likely also epistatic deviations, is small, so we can conclude that non-additive genetic variance can be ignored? It depends! I think we need to distinguish between:

A) what is happening in reality - we don't really know, but clearly biology is highly non-linear (=non-additive), BUT 1st order approximations (=additive) will capture the majority of variation

B) what we simulate - relevant discussion in the mailing list, but we want simulations to mimic A, but we can only set parameters based on C (see next)

C) what we can estimate from the data - indeed many studies find that the variance of breeding values seems to explain most of the variance in genetic values, leading to the usual statement that most of "genetic variance is additive", BUT there is a caveat that breeding values are a 1st order approximation and as such capture additive and some of the non-additive gene effects. Studies that report dominance variance, technically variance between dominance deviations (the part not captured by breeding values - and note that breeding values capture some dominance variation!), often report small values, again indicating that most variation is "additive". BUT, some of these studies are underpowered to get accurate estimates of variance between dominance deviations. On the other hand, there is quite a lot of studies of inbreeding depression and heterosis, indicating that there must/should be dominant gene effects (there are different hypothesis about this too that I will not go into!). I know that there is a substantial inbreeding depression in maize inbred lines, which then generates very large heterosis in their hybrids. Then, I guess sometimes there are real dominant gene effects, but selection is keeping allele frequency of unfavorable mutations low (so we only rarely see unfavorable/unfavorable genotype!), meaning that observed variance between dominance deviations at that locus will be low ...

So, all this "additivity" is convoluted.

These two papers touch on some of these points (there is lots more literature about this topic!):

Data and Theory Point to Mainly Additive Genetic Variance for Complex Traits
https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1000008

The Genetic Architecture of Quantitative Traits Cannot Be Inferred from Variance Component Analysis
https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1006421

Gregor Gorjanc (gg)

2023-10-01

Overload of the term "additive" in quantitative genetics

No comments: