“one gene in mice to seven genes in primates”
Source: Perez-Caballero, Soll, and Bieniasz
The repeat side of the story is only half the arms race. The host defense side evolves too. APOBEC gene copy number varies dramatically across mammals, and that variation shapes how we interpret editing signatures in repeats.
APOBEC3 genes are a famous example. Some mammals have a compact APOBEC3 repertoire, while primates carry multiple APOBEC3 paralogs. This expansion is often interpreted as evidence of long-term pressure from viruses and retroelements. More copies create more biochemical possibilities: different subcellular localization, expression timing, target preference, motif specificity, and antagonist resistance.
For repeat-editing studies, gene copy number matters in several ways.
First, it affects enzyme attribution. A GG-context footprint in one species and a GA-context footprint in another may reflect different APOBEC paralogs, not simply different retroelement properties. In primates, APOBEC3G, APOBEC3F, APOBEC3A, APOBEC3B, and others have overlapping but distinct target profiles and restriction mechanisms. In non-placental vertebrates, APOBEC3 may be absent, so APOBEC1-like or APOBEC5-like enzymes may be candidates.
Second, copy number affects evolutionary timing. If a repeat family appears to have been heavily edited in a lineage after APOBEC duplication, the duplication and repeat burst may be related. But the causal arrow can be hard to establish. Did retroelement activity drive APOBEC expansion? Did APOBEC expansion permit stronger suppression of active repeats? Or are both responding to a broader viral ecology?
Third, copy number affects redundancy. A lineage with many APOBEC paralogs may preserve a restriction function while allowing one paralog to diversify toward new targets. McLaughlin and colleagues provide a useful framing for this problem in APOBEC3A: LINE-1 restriction can remain conserved while antiviral specificity changes. That means rapid evolution of an APOBEC protein does not automatically identify the mobile element that drove selection.
Fourth, copy number affects toxicity. APOBEC activity is dangerous. These enzymes mutate nucleic acids. Extra copies may improve defense, but they may also increase the risk of host-genome damage or dysregulated editing. This tension may shape which duplicates survive.
Yang and colleagues add another fascinating twist: APOBEC genes themselves can be copied by retrotransposition. They describe A3 retrocopies in primates, including New World monkey APOBEC3G-derived retrocopies, some of which are expressed and functional. This turns the story into a loop: retroelements can duplicate host restriction genes, and those new host-gene copies can then restrict viruses or retroelements.
This matters for methodology because gene copy number should not be treated as static background annotation. A comparative study of APOBEC footprints should ideally reconstruct the APOBEC repertoire in each species analyzed. That includes intact genes, pseudogenes, retrocopies, copy-number variants, and lineage-specific losses. It should also consider expression in germline, early embryo, placenta, immune tissues, and other contexts where retroelement activity or viral endogenization could occur.
A practical comparative framework could look like this. For each species, annotate APOBEC genes and retrocopies. Infer orthology and paralogy. Identify intact catalytic motifs and expression evidence. Estimate repeat-family activity and age distribution. Detect repeat editing signatures. Then test whether editing abundance, motif class, or repeat-family targeting correlates with APOBEC repertoire size or specific paralog presence.
The strongest claims will not simply say “more APOBEC genes, more editing.” Copy number, expression, enzyme activity, and target ecology all matter. A species with few APOBEC genes may still show strong editing if the relevant enzyme is highly expressed in the right cells. A species with many copies may show weak detectable footprints if repeats are old, assemblies are poor, or restriction is deaminase-independent.
Key technical takeaway: APOBEC gene copy number is a crucial covariate. Repeat-editing signatures should be interpreted alongside lineage-specific APOBEC repertoires, paralog function, retrocopies, expression, and toxicity constraints.
No comments:
Post a Comment