Zum Hauptinhalt springen

Identification of a viral gene essential for the genome replication of a domesticated endogenous virus in ichneumonid parasitoid wasps.

Lorenzi, Ange ; Legeai, Fabrice ; et al.
In: PLoS Pathogens, Jg. 20 (2024-04-01), Heft 4, S. e1011980
Online academicJournal

Identification of a viral gene essential for the genome replication of a domesticated endogenous virus in ichneumonid parasitoid wasps  Introduction

Thousands of endoparasitoid wasp species in the families Braconidae and Ichneumonidae harbor "domesticated endogenous viruses" (DEVs) in their genomes. This study focuses on ichneumonid DEVs, named ichnoviruses (IVs). Large quantities of DNA-containing IV virions are produced in ovary calyx cells during the pupal and adult stages of female wasps. Females parasitize host insects by injecting eggs and virions into the body cavity. After injection, virions rapidly infect host cells which is followed by expression of IV genes that promote the successful development of wasp offspring. IV genomes consist of two components: proviral segment loci that serve as templates for circular dsDNAs that are packaged into capsids, and genes from an ancestral virus that produce virions. In this study, we generated a chromosome-scale genome assembly for Hyposotor didymator that harbors H. didymator ichnovirus (HdIV). We identified a total of 67 HdIV loci that are amplified in calyx cells during the wasp pupal stage. We then focused on an HdIV gene, U16, which is transcribed in calyx cells during the initial stages of replication. Sequence analysis indicated that U16 contains a conserved domain in primases from select other viruses. Knockdown of U16 by RNA interference inhibited virion morphogenesis in calyx cells. Genome-wide analysis indicated U16 knockdown also inhibited amplification of HdIV loci in calyx cells. Altogether, our results identified several previously unknown HdIV loci, demonstrated that all HdIV loci are amplified in calyx cells during the pupal stage, and showed that U16 is required for amplification and virion morphogenesis.

Author summary: Parasitoid "domesticated endogenous viruses" (DEVs) provide a fascinating example of eukaryotes acquiring new functions through integration of a viral genome. DEVs consist of multiple loci in the genomes of wasps. Upon activation, these elements collectively orchestrate the production of virions or virus-like particles that are crucial for successful parasitism of host insects. Despite the significance of DEVs for parasitoid biology, the mechanisms regulating key steps in virion morphogenesis are largely unknown. In this study, we focused on the ichneumonid parasitoid Hyposoter didymator, which harbors an ichnovirus consisting of 67 proviral loci. Our findings reveal that all proviral loci are simultaneously amplified in ovary calyx cells of female wasps during the pupal stage suggesting the hijacking of cellular replication complexes by viral proteins. We functionally studied the ichnovirus gene U16, which encodes a protein with a weakly conserved primase C-terminal domain. Silencing U16 inhibited viral DNA amplification and virion production, underscoring the key role of this gene for ichnovirus replication. Our study provides evidence that many genes involved in viral DNA replication have been conserved during the domestication of ichnoviruses.

Endogenous viral elements (EVEs) refer to viral sequences in eukaryotic genomes that originate from complete or partial integration of a viral genome into the germline [[1]]. While retroviruses are the best-known sources of EVEs, bioinformatic studies have also identified non-retroviral EVEs across a diverse range of organisms [[2]]. Although many EVEs become non-functional and decay through neutral evolution [[3]], some have been preserved and repurposed by their hosts for new functions, often as short regulatory sequences or individual genes [[4]]. A notable exception to this pattern is observed in domesticated endogenous viruses (DEVs) that have been identified in four lineages of endoparasitoid wasps—insects that lay eggs and develop within the bodies of other insects [[6]]. Parasitoid DEVs consist of numerous genes conserved within the wasp genome that originate from the integration of complete viral genomes. Unlike other EVEs, these genes remain functional and actively interact to produce viral particles in calyx cells, which are located in the apical part of the oviducts of female wasps [[7]]. Viral particles are produced in the pupal and adult stages and accumulate in the oviducts of the wasp. Adult female wasps inject these particles along with eggs into insect hosts where they have essential functions in the successful development of wasp offspring [[8]].

DEVs are most prevalent among species in two wasp families named the Braconidae and Ichneumonidae. The DEVs identified in these two wasp families have evolved from different viral ancestors but through convergence have been similarly repurposed to produce either virions containing circular double-stranded (ds) DNAs or virus-like particles (VLPs) lacking nucleic acid. The hyperdiverse Microgastroid complex in the family Braconidae harbors DEVs named bracoviruses (BVs) that evolved from a common virus ancestor in the family Nudiviridae [[9]]. Other braconids in the subfamily Opiinae and ichneumonids in the subfamily Campopleginae independently acquired two other distinct nudiviruses that wasps have coopted to produce VLPs [[10]]. The fourth type of DEV that has been identified are named ichnoviruses (IVs). IVs are present in two ichneumonid subfamilies (Campopleginae and Banchinae) which produce virions containing circular dsDNAs. Unlike the other three DEVs, IVs evolved from a still unknown ancestor. This ancestor was likely though a large DNA virus in the phylum Nucleocytoviricota, which was formerly referred to as nucleocytoplasmic large DNA viruses (NCLDVs) [[12], [14]]. Together, BVs and IVs were also earlier referred to as polydnaviruses, but this term is largely no longer used because of strong evidence indicating BVs and IVs derive from different ancestors [[6]].

BVs have been more studied than IVs but the latter are intriguing because of their uncertain origins. Despite differences in ancestry and gene content, BV and IV genomes are similarly organized into two components that have distinct functions [[15]]. Insights into the genome components of IVs primarily derive from sequencing two campoplegine wasps named Hyposoter didymator and Campoletis sonorensis [[16]], along with calyx transcriptome studies [[12], [17]] and proteomic analyses of purified virions [[12]]. The first genome component of IVs are domains in the wasp genome that show evidence of deriving from the virus ancestor and having essential functions in virion formation. These domains, named "Ichnovirus Structural Protein Encoding Regions" (IVSPERs), contain intronless genes that are specifically transcribed in calyx cells [[12], [18]]. Most IVSPER genes are transcribed at the onset of pupation in hyaline stage 1 pupae [[17]], and some genes in IVSPERs encode proteins associated with IV virions [[12]]. Six genes have been knocked down by RNA interference (RNAi) in H. didymator which demonstrated that they have functions in virion assembly or cell trafficking [[17]]. Five IVSPERs have been identified in the H. didymator and C. sonorensis genomes [[16]], while three have been identified in the genome of the more distantly related banchine Glypta fumiferanae [[13]]. The content of IVSPER genes is notably similar between ichneumonid wasp species [[12], [18]], and their gene order is well-conserved among campoplegine species [[15]]. Additionally, one intronless gene (U37) was identified in the H. didymator and C. sonorensis genomes outside of any IVSPER with features suggesting it also derives from the IV viral ancestor [[16]]. Together, these genes, whether found within or outside IVSPERs, represent the fingerprints of the ancestral viral machinery essential for virion production and are designated as IV core replication genes. Notably, none of these genes are packaged in virions, indicating that IV core genes can only be transmitted vertically through the germline of associated parasitoids.

The second component of IV genomes are domains referred to as "proviral segments," which are amplified in calyx cells and produce circular, double-stranded (ds) DNAs that are packaged into capsids [[19]]. The number of proviral segments, typically exceeding 50, are widely dispersed in wasp genomes and exhibit considerable variability between wasp species [[16]]. Each proviral segment is characterized by flanking direct repeats (DRs) of variable length (<100 bp to >1 kb) and homology that identifies where homologous recombination processes occur to produce circularized, dsDNAs [[19]]. Some IV proviral segments also contain internal repeats that facilitate additional homologous recombination events and produce multiple overlapping or nested circularized DNAs per proviral segment [[16], [19]]. The circularized, dsDNAs produced from proviral segments encode genes with and without introns that are expressed in the hosts of wasps after virion infection [[21], [23]]. These genes also have functions that are required for the successful development of parasitoid progeny. Thus, while IV core replication genes represent the conserved viral machinery that produces virions in calyx cells, proviral segments contain the IV genome components virions transfer to parasitized hosts.

The replication of IVs, encompassing the processes leading to the production of virions containing IV segments, occurs within the nuclei of calyx cells during pupal and adult developmental stages [[7], [25]]. Electron microscopy studies of H. didymator ichnovirus (HdIV) show that fusiform-shaped capsids are individually enveloped in the nuclei of calyx cells during the late pupal stage (pigmented pupae, stage 3) [[17]]. These enveloped "subvirions" exit the nucleus, traverse the cytoplasm, and exit calyx cells by budding, resulting in mature virions with two envelopes that accumulate in the calyx lumen of the ovaries [[7], [25]]. Earlier findings indicated that IVSPERs and proviral segments undergo amplification in newly emerged adult wasps [[12]]. However, these data focused on only a subset of IVSPER genes and one proviral segment, leaving our knowledge of whether all IV genome components are amplified in calyx cells incomplete. Similarly, the initiation time of amplification during pupal development and IV virion production remains unknown. The specific role of IV core genes in virion production is also poorly documented when compared to BVs [[26]]. The limited sequence homology of IVSPER genes with genes in other viruses provides minimal insights into potential functions. To date, only the six genes mentioned above that are involved in subvirion assembly or cell trafficking have been studied [[17]].

In this work, we explored IV DNA replication using the campoplegine wasp H. didymator. We first generated a chromosome-level assembly for the H. didymator genome. Through this assembly, we determined that all genome components undergo local amplification in calyx cells which initiates between pupal stages 1 and 2. Notably, IVSPERs, isolated IV core genes, and proviral segments were amplified in large regions with non-discrete boundaries. Next, we studied the function of U16 which is located on H. didymator IVSPER-3. U16 is one of the most transcribed IVSPER genes during the initial pupal stage and contains a weakly conserved domain found in the C-terminus of primases. RNAi knockdown of U16 inhibited virion formation. Knockdown also significantly reduced DNA amplification of all HdIV genome components, which decreased transcript abundance of IV core genes and the abundance of circular dsDNA viral molecules. We conclude that U16 is an essential gene for amplification of the HdIV genome and virion production, demonstrating that genes from the IV ancestor regulating IV replication have been conserved during virus domestication. Additionally, our results show that viral DNA amplification is essential for IV virion production.

Results

Genomic localization of Hyposoter didymator IV components in a novel chromosome-level assembl...

The genome assembly for H. didymator we previously generated [[16]] consisted of 2,591 scaffolds with an N50 of 4 Mbp. We concluded that this assembly was overly fragmented to evaluate DNA amplification in calyx cells during virion morphogenesis. We therefore used proximity ligation technology to produce a new chromosome-level assembly consisting of twelve large scaffolds that correspond with the haploid karyotype for H. didymator [[28]]. The sizes of these scaffolds ranged from 6.7 Mbp to 29.3 Mbp (Fig A and Table A in S1 Dataset).

The twelve scaffolds contained all HdIV loci previously identified: the replication genes organized in clusters (IVSPER-1 to IVSPER-5), the predicted IV replication gene (U37) located outside of an IVSPER, and 53 of the 54 previously identified proviral segments (Hd1 to Hd54) [[16]] (Fig 1). Only the scaffold containing Hd51 was missing from the 12 main scaffolds, possibly due to low-quality sequencing data in this region (Table A in S1 Dataset). Our chromosome-level assembly revealed that each scaffold contained at least one HdIV locus, but notably, all IVSPERs and 40% of the proviral segment loci resided on two scaffolds (7 and 11) (Fig 1). The high density of HdIV loci observed on scaffold 11 might indicate that this genomic region served as the integration site for the ancestral virus.

DIAGRAM: Fig 1 Genome of H. didymator and position of HdIV loci.Position of the HdIV loci previously identified in [[16]] within the twelve large scaffolds obtained using Hi-C technology. The HdIV replication genes, organized in Ichnovirus Structural Protein Encoding Regions (IVSPERs) or that are isolated in other regions of the H. didymator genome, are highlighted in red while viral segments are indicated in black. The scaffold diagram was generated using the Mapchart software [[29]]. At the bottom of the figure, diagrams illustrate the difference between the two types of viral loci: the IVSPER which are clusters of HdIV replication genes inherited from the IV viral ancestor, and the viral segments flanked by direct repeats located on the left (DRL) and right (DRR) sides of segments, housing virulence genes expressed in the parasitized insect.

While three IVSPERs and the majority of proviral segments were distantly located from each other in the H. didymator genome, there were exceptions to this pattern including certain pairs of proviral segments separated by less than 20 kb (e.g., Hd36 and Hd38; Hd46 and Hd43; Hd44.1 and Hd44.2; Hd12 and Hd16 in scaffolds -1, -2 and -11 respectively). In all of these cases, the paired segments exhibited significant homology which suggested they derived from recent duplication events (Fig B in S1 Dataset). Additionally, several proviral segments were in proximity to IVSPERs or IV replication genes that resided outside of IVSPERs (e.g., Hd46 near U37; Hd29 and Hd24 on each side of IVSPER-2; Hd15 near IVSPER-1 in scaffolds -2 and -11 respectively; Fig 1).

Amplification of Hyposoter didymator IV genome components in calyx cells during wasp pupal de...

Prior results suggested IV proviral segment loci are amplified before processing and excision of individual segments that are packaged into capsids [[19]]. An earlier study also reported that select IVSPER genes and one proviral segment located near an IVSPER are amplified in H. didymator adults [[12]]. However, these findings did not address whether all IV genome components are amplified in calyx cells or when amplification initiates in association with virion formation. To more precisely investigate the temporal dynamics of amplification, we first conducted relative quantitative (q) PCR assays that measured copy number of genes in IVSPER-1, -2, and -3 in calyx DNA samples that were collected from stage 1–4 pupae. We compared these treatments to DNA samples from hind legs of stage 1 pupae where no HdIV replication occurs. We also included a wasp gene (XRCC1) located in close proximity to IVSPER-1. Results showed that the copy number of each tested gene was similar in calyx and hind legs in stage 1 pupae (statistical significances in Table B in S2 Dataset, lower panel), indicating none were amplified during the initial pupal stage. Subsequently, the copy number of each gene increased progressively with each pupal stage (Fig 2A and Table B in S2 Dataset). While exhibiting lower amplification levels than the IVSPER genes we analyzed, a similar trend was observed for the wasp gene XRCC1 (Fig 2A and Table B in S2 Dataset). These findings indicated IVSPER amplification in calyx cells begins between pupal stage 1 (one day old, hyaline) and pupal stage 2, and further increases in pupal stage 3 (five days old, pigmented abdomen) and pupal stage 4.

Graph: Fig 2 DNA amplification of HdIV loci.(A) qPCR analysis of selected IVSPER genes in calyx cells during wasp pupal development. Top panel. A schematic representation of H. didymator IVSPERs-1, -2, and -3 (GenBank GQ923581.1, GQ923582.1, and GQ923583.1); genes selected for qPCR assays are highlighted in white. U1-U24 are unknown protein-encoding genes, while IVSPs are members of a gene family encoding ichnovirus structural proteins. Bottom panel. Genomic (g) DNA amplification levels of IVSPER genes and wasp XRCC1 in calyx cells from pupal stage 1–4. The XRCC1 (X-Ray Repair Cross Complementing 1) encoding gene is located 1,200 bp from U1 (position 3,270,470 to 3,272,519 in Scaffold-11). Data correspond to gDNA amplification relative to amplification of the housekeeping gene elongation factor 1 (ELF1). The Y-axis was transformed using the square root function for better data visualization. Statistical test results are available in Table B in S2 Dataset. (B) Read depth of HdIV loci compared to the rest of the wasp genome. Read depth values per analyzed region (see Materials and methods) are presented for each locus type (proviral segments and IVSPERs) at pupal stage 1 (hyaline pupa) and pupal stage 3 (pigmented pupa). The read depths per HdIV locus (HdIV) are compared to the read depth per random genome regions outside of HdIV loci (wasp). Note that the read depth value for random wasp regions is lower for DNA samples collected from stage 3 versus stage 1 pupae. This difference is attributed to the higher proportion of reads mapping to HdIV regions among the total number of reads in stage 3 compared to stage 1. The significance levels are indicated as follows: ns = non-significant, **p<0.01, and ***p<0.001. Statistical test results are available at https://github.com/flegeai/EVE_amplification/blob/main/notebooks/Count_tables.ipynb.

We next investigated whether all or only some HdIV genome components are amplified by isolating DNA from calyx cells from stage 1 pupae, which the preceding qPCR assays detected no amplification, and stage 3 pupae, which showed high levels of amplication. We then generated paired-end libraries, which were sequenced using the Illumina platform, followed by read alignment to the new chromosome-level genome assembly. When analyzing the reads from stage 1 pupae, read depth per HdIV locus did not differ significantly from read depth of randomly selected regions of the same size from the rest of the wasp genome (Fig 2B). In contrast, for stage 3 pupae, read depth of the HdIV loci was significantly higher than that of randomly selected regions from the rest of the wasp genome (Fig 2A, read depth values in S1 Table), suggesting most or all HdIV loci are amplified in pupal stage 3.

Differential levels of amplification across all components of the HdIV genome

While the preceding results suggested all HdIV genome components were amplified in calyx cells, the qPCR results presented in Fig 2A also indicated amplification levels varied with genes in IVSPER-3 exhibiting higher levels of amplification than genes in IVSPER-1 and -2. We therefore also analyzed read depth per position and the ratio of read depths between stage 3 and stage 1 in our Illumina data set (Figs 3 and S1). This analysis indicated that amplification levels of IVSPER loci, determined at the summit of the read depth curve, ranged from 10X for IVSPER-5 in Scaffold-7 to over 200X for IVSPER-3 in Scaffold-3 (S1 Table). Consistent with the findings from our qPCR analysis, IVSPER-3 was more highly amplified than IVSPER-1 and -2. Results from the read depth per position curve further indicated that the peak of amplification occurs toward the middle of each IVSPER (S1 Fig), consistent with our qPCR data genes closer to the boundary of an IVSPER that exhibit lower levels of amplification than genes in the middle of the cluster (Figs 2A, 3 and S1).

Graph: Fig 3 HdIV DNA amplification.DNA amplification in pupal stage 3 was assessed by mapping genomic DNA Illumina reads against the 12 large H. didymator genome scaffolds. In each scaffold, red bars indicate amplified loci, with the intensity of red corresponding to increased values of the counts per million (CPM) ratio between pupal stage 3 and pupal stage 1. The positions of IVSPERs and isolated IV replication genes are indicated by purple squares, while proviral segments are indicated by green circles. For selected HdIV loci, amplification curves (representing the ratio of the CPM values calculated for 10 bp intervals between pupal stage 3 and pupal stage 1) are shown in boxes. Amplification curves for all of the annotated HdIV loci are shown in S1 Fig. Each HdIV locus is indicated in red while 10,000 bp of flanking sequence on each side of the locus is also shown. For proviral segments, loci are defined as the sequence delimited by two direct repeats; IVSPERs are defined as the region between the start and stop codon of the first and last coding sequences in the cluster; isolated IV replication genes are defined by their coding sequence.

Proviral segment loci were relatively more amplified than IV replication gene loci, but were also variable (Figs 3 and S1). For example, the ratio between read depth between stage 3 and stage 1 ranged from 30X for proviral locus Hd40 in Scaffold-6 to over 1,100X for Hd27 in Scaffold-7 (S1 Table) at the summit of the read depth curves. Variability in sequencing depth among proviral loci was consistent with earlier studies indicating that the circularized DNAs packaged into IV capsids are non-equimolar in abundance [[8], [30]].

All proviral segments consistently exhibited a substantial increase in amplification that peaked between the two DRs (as exemplified by Hd14 or Hd12 in S2 Fig). For numerous proviral loci, the reads mapping between the flanking DRs displayed uniform read depth. However, in other cases, peaks with varying read depth were evident (as exemplified by Hd32 or Hd16 in S2 Fig). This differential read depth usually applied to proviral segments that contained more than one pair of DRs, as illustrated by proviral locus Hd11 (Fig 4A) or Hd32 and Hd16 (S2 Fig). Previous studies indicated that Hd11 contains two pairs of DRs, enabling the formation of two nested, circularized segments termed Hd11-1 (formed by recombination between DR1Left (DR1L) and DR1Right (DR1R)) and Hd11-2 (formed by recombination between DR2L and DR2R) (Fig 4A). Reads mapping to the Hd11 locus (bounded by DR1L and DR2R) exhibited three relatively uniform plateaus of different values. Two plateaus corresponded to reads mapping to the predicted locations of Hd11-1 (235X) and Hd11-2 (111X), while the central region with higher read depth (311X) corresponded to reads mapping to both nested segments (Fig 4A). This differential read depth would not be expected if reads mapped only to Hd11 chromosomal DNA. Consequently, the pattern of proviral segment amplification suggested part of the read depth values were due to reads arising from amplification intermediates and/or circularized dsDNAs that were also present in our DNA samples. Some amplified HdIV loci contain both an IVSPER and proviral segments. Two of these loci resided on Scaffold-11 (Hd29, IVSPER-2, Hd24, and Hd33, Hd15, IVSPER-1 (Fig 4B)). For these loci, the amplification curves spanned the length of the amplified region (yellow dotted line in Fig 4B) but were interrupted by peaks corresponding to the length of proviral segments. This pattern suggested amplification levels of the chromosomal form of the proviral segments could correspond to levels comparable to the surrounding region and to the IVSPER amplification curves, but were higher because of reads also mapping to circular dsDNAs or amplification intermediates.

Graph: Fig 4 HdIV amplified regions in Scaffold-11.(A) Detail of the amplified region at the Hd11 locus. Hd11 (red bar) represents two overlapping segments Hd11-1 (defined by left and right DRs, DR1L and DR1R respectively, blue bar) and Hd11-2 (defined by DR2L and DR2R, orange bar). (B) Detail of two other amplified regions containing IVSPERs and HdIV proviral loci. In (B), amplification curves of IVSPERs are highlighted in yellow. In (A) and (B), amplification curves represent the ratio of the CPM values (calculated for 10 bp intervals) obtained in pupal stage 3 compared to pupal stage 1. For each locus, amplification values at the summit of the peaks (bold type) and at the start and end positions of HdIV segments are indicated with arrows. Each amplification curve figure was generated by Integrated Genome Viewer (IGV) [[31]].

Amplification of H. didymator IV genome components in extensive wasp genome domains with unde...

Since our read depth data indicated amplified regions were larger than the annotated HdIV loci (Figs 3 and S1), we used the MACS2 peak calling program, originally developed for chromatin immunoprecipitation sequencing experiments, to identify areas in the H. didymator genome that were enriched for reads when compared to a control [[32]]. Amplification peaks were called with MACS2 using alignments from stage 3 pupae as the treatment and alignments from stage 1 pupae as the control. MACS2 identified a total of 165 regions that were enriched for reads (i.e. were amplified) (S2 Table). All HdIV genome components that we had annotated in our earlier study [[16]] were included in these regions. Among the other regions identified by MACS2, manual curation (see Materials and methods section) indicated three were proviral segment loci that we named Hd52, Hd53, and Hd54. Five others contained intronless genes with features suggesting origins from the IV ancestor, that were outside of IVSPERs. We thus named these potential IV replication genes U38, U39, U40, U41, and U42. The remaining regions detected by MACS2 either contained predicted wasp genes or lacked any features that identified them as IV replication genes or proviral segments. Altogether, the MACS2 algorithm predicted a total of 55 regions in the H. didymator genome containing HdIV loci. Two proviral segments (Hd45.1 on Scaffold-4 and Hd2-like on Scaffold-7) escaped MACS2 detection, possibly because they were located too close to the ends of each scaffold. However, our read mapping data clearly indicated these two segments are amplified in stage 3 (Table 1) with a profile similar to other prosegments (S1 Fig). In total, read mapping and MACS2 peaks indicated the H. didymator genome contains 67 HdIV loci (56 proviral segments, five IVSPERs, and six predicted IV replication genes that reside outside of IVSPERs) that are all amplified in calyx cells at pupal stage 3 and embedded in a total of 57 large and locally amplified regions (Fig 3 and Table 1).

Graph

Table 1 HdIV loci amplified in calyx cells from stage 3 pupae identified by read mapping and/or the MACS2 algorithm. For each scaffold, the position and size of the HdIV loci are indicated. Loci newly identified in the present work are marked with asterisks. Corresponding amplified regions (i.e., the peak predicted by the MACS2 algorithm) are provided for each locus or groups of loci. Start and end positions delimiting the HdIV loci and the amplified regions detected by MACS2 are indicated. The distance between the start or the end of the amplified region and the locus is presented. Asterisks indicate the HdIV loci identified in the present work. For each HdIV locus and amplified region detected by MACS2, read depth values are provided for calyx cell samples collected from stage 1 or stage 3 pupae. Read depth is based on the length of the HdIV locus or the amplified region. ND indicates amplified regions not detected by MACS2.

Annotated proviral locusAmplified region (MACS2 peak prediction)
ScaffoldHdIV locusLocus startLocus endLocus size (bp)Read depth Pupal stage 1Read depth Pupal stage 3MACS2 Peak namePredicted region startDistance from locus start (bp)Predicted region endDistance from locus end (bp)Region size (bp)Read depth Pupal stage 1Read depth Pupal stage 3
Scaffold-1Hd2613,523,61713,528,6345,0173723Hd2613,517,3176,30013,534,3925,75817,0752217
Hd3820,666,61220,670,2753,66331,090Hd38+Hd3620,656,41710,19520,681,70911,43425,2922246
Hd3620,670,79120,674,5283,737350114,3747,181
Hd5027,247,70427,253,4905,7862123Hd5027,241,0466,65827,259,5336,04318,487242
Scaffold-2Hd492,124,3242,129,5885,2642505Hd492,117,1657,1592,135,1105,52217,9452155
U372,735,0402,736,8781,838388U37+Hd46+Hd432,728,2076,8332,758,00821,13029,8013132
Hd462,737,5362,741,6444,10821889,32916,364
Hd432,744,1622,748,3204,158336515,9559,688
Hd225,839,4675,843,6444,1772152Hd225,832,2077,2605,850,0656,42117,858240
Hd44.116,284,64516,287,6533,0083218Hd44.1+Hd44.216,278,9815,66416,302,69015,03723,709260
Hd44.216,289,60516,294,4354,830312210,6248,255
Scaffold-3Hd30419,458423,6214,16331,502Hd30410,4349,024429,8136,19219,3793336
Hd4816,689,74516,699,4179,6723184Hd4816,679,12110,62416,706,5357,11827,414270
Scaffold-4Hd196,231,0956,235,5344,4393428Hd196,229,4981,5976,243,2487,71413,7502143
Hd4112,119,88212,127,8347,9522618Hd4112,113,3726,51012,135,0967,26221,7242233
Hd45.212,329,91712,331,9672,050183Hd45.212,321,4188,49912,340,7158,74819,297218
Hd45.115,910,20815,914,4214,2131186NDNDNDNDNDNDND
Scaffold-5Hd135,876,2655,882,0215,7563385Hd135,869,1057,1605,889,8867,86520,7812112
Scaffold-6Hd402,988,7722,992,2653,493254Hd402,983,6995,0732,997,2665,00113,567216
Hd97,229,7747,247,66517,8912148Hd97,223,1106,6647,254,6096,94431,499387
U42*9,084,5429,085,6301,088320U42*9,075,6008,9429,095,67110,04120,07128
Hd111,708,97711,723,74714,7707284Hd111,702,9506,02711,728,7525,00525,8025168
Hd31-3411,868,29211,872,4104,1183945Hd31-3411,862,4305,86211,878,8326,42216,4022242
Scaffold-7U38*4,768,4184,770,1451,727230U38*4,758,4779,9414,776,7976,65218,320215
Hd276,697,6486,701,6494,00132,407Hd276,688,1779,4716,708,6777,02820,5002479
Hd52*6,772,3996,776,4694,0702141Hd52*6,765,6136,7866,783,7347,26518,121235
Hd57,084,9457,098,65713,7122220Hd57,078,9585,9877,104,5045,84725,5462124
Hd477,900,8157,905,3174,5023248Hd477,894,3076,5087,912,0186,70117,711267
IVSPER-59,178,2749,179,9031,629322IVSPER-59,168,10810,1669,191,03011,12722,92229
IVSPER-39,252,9039,278,33425,4312342IVSPER-39,243,1669,7379,287,9189,58444,7522221
U39*11,747,26611,750,5203,254392U39*11,738,0629,20411,764,02513,50525,963333
IVSPER-413,191,25813,207,06915,811261IVSPER-413,176,55014,70813,216,1079,03839,557333
Hd713,968,91813,976,9838,0652157Hd713,965,5513,36713,978,6031,62013,052298
Hd214,085,67214,099,60813,9363373Hd214,079,4326,24014,106,1616,55326,7292205
Hd614,221,05614,231,51610,4603473Hd614,218,1712,88514,238,1676,65119,9963254
Hd2-like14,388,34314,389,8791,5363528NDNDNDNDNDNDND
Scaffold-8Hd201,942,4071,949,2706,8632189Hd201,934,9607,4471,956,5527,28221,592265
Hd86,823,7456,831,1007,3552404Hd86,818,1655,5806,837,9546,85419,7892156
Hd47,097,0097,107,33410,3252780Hd47,090,1106,8997,114,2426,90824,1322340
Hd179,012,1069,019,8357,7292342Hd179,006,3185,7889,026,0096,17419,6912138
Hd54*9,214,2079,220,1255,9182319Hd54*9,208,8145,3939,228,0427,91719,2282104
Hd189,364,7949,369,4894,6952771Hd189,360,6124,1829,376,7787,28916,1662229
Scaffold-9U40*5,723,8665,725,7821,916337U40+Hd395,714,0599,8075,757,03231,25042,973237
Hd395,744,9555,749,0764,121224330,8967,956
U41*7,049,4767,051,9462,470222U417,040,2659,2117,060,0188,07219,753211
Scaffold-10Hd23.23,553,2033,556,5643,361390Hd23.23,545,3527,8513,561,4954,93116,143327
Hd23.13,592,5273,596,9834,4562229Hd23.13,587,2035,3243,605,0128,02917,809263
Hd259,399,1729,403,3454,1733496Hd259,393,5595,6139,412,0078,66218,4482117
Scaffold-11Hd10453,966460,4726,5062403Hd10447,7126,254465,3234,85117,6112154
Hd11768,148777,3379,1892361Hd11764,3113,837783,9166,57919,6054175
Hd162,262,9412,270,6447,70321,298Hd16+Hd122,257,2645,6772,292,89922,25535,6352318
Hd122,280,0312,285,9325,901217722,7676,967
Hd293,146,2733,150,6284,3553587Hd29+IVSPER-2+Hd243,139,1317,1423,193,13642,50854,0052153
IVSPER-23,155,2283,181,83826,610215316,09711,298
Hd243,182,7483,187,4444,69631963,128,7435,692
Hd333,234,2453,238,0793,8343244Hd33+Hd15+IVSPER-13,229,6814,5643,277,20339,12447,5222115
Hd153,249,0513,254,0374,986336319,37023,166
IVSPER-13,255,2663,269,28514,019214125,5857,918
Hd143,759,5773,764,7725,1952325Hd143,753,8705,7073,770,3775,60516,5072107
Hd323,811,9433,819,8587,9152402Hd323,807,2164,7273,823,3793,52116,1632200
Hd426,043,7646,046,9203,1562184Hd426,039,4904,2746,052,3075,38712,817248
Hd216,076,3486,080,7154,3672312Hd216,067,1229,2266,086,9046,18919,782275
Hd53*6,184,7786,190,1265,3483178Hd53*6,177,7107,0686,199,5059,37921,795249
Hd36,685,1116,695,12410,0132807Hd36,678,5856,5266,701,1556,03122,5702367
Hd376,786,2066,789,9133,7072220Hd376,779,6876,5196,795,6935,78016,006255
Scaffold-12Hd28308,843313,4564,613240Hd28305,6323,211316,3242,86810,692219
Hd352,941,8762,945,5853,70931,062Hd352,935,4306,4462,954,0298,44418,5992219

Our results also indicated all amplified regions in the H. didymator genome containing HdIV loci consist of the annotated HdIV locus along with flanking wasp sequences consistent with our detailed analysis of the wasp gene XRCC1 that is located in close proximity to IVSPER-1 (Fig 2B). Across all HdIV loci, we determined that the flanking regions containing wasp sequences that were amplified varied from 7,000 to 15,000 bp (Table 1). The total size of the amplified regions ranged from 10,692 bp (Hd28 on Scaffold-12) to 54,005 bp (IVSPER-2 on Scaffold-11). Most amplified regions contained a single HdIV locus, but seven contained a mix of HdIV genome components (Table 1). Three amplified regions contained the neighboring and closely related proviral segments mentioned above (e.g., Hd36 and Hd38 on Scaffold-1, Hd44.1 and Hd44.2 on Scaffold-2, Hd12 and Hd16 on Scaffold-11). In addition to the two examples noted above on Scaffold 11 (see Fig 4B), two other amplified regions also contained both IVSPERs and proviral segments (U37, Hd46, and Hd43 on Scaffold-2; U40 and Hd39 on Scaffold-9). The simultaneous amplification of both types of HdIV loci within the same amplicons suggested regulation by shared molecular mechanisms. In turn, we further hypothesized similar molecular mechanisms also regulate the amplification of isolated proviral segments and IVSPERs.

Lastly, we searched for sequence signatures that potentially identify the amplification boundaries for each HdIV locus. However, our analysis identified only low-complexity A-tract sequences, which were not specific to HdIV components as they were also found in random non-amplified wasp genomic sequences (S3 Fig). Thus, no motifs were identified that distinguished the amplification boundaries of HdIV loci.

RNAi knockdown of U16 inhibits virion morphogenesis

We selected the gene U16 located on H. didymator IVSPER-3 as a factor with potential functions in activating IV replication. U16 is conserved among all IV-producing wasps for which genome or transcriptome data are available (Fig 5). In H. didymator calyx cells, U16 is also one of the most transcribed IV genes detected in calyx cells from stage 1 pupae [[16]]. Sequence analysis using the basic local alignment search tool and DeepLoc2.0 predicted all U16 family members contain a C-terminal alpha-helical domain (PriCT-2) of unknown function that is present in several primases [[33]] and a nuclear localization signal (Fig 5 and S3 Dataset). We next assessed the effects of knocking down U16 by RNAi on virion morphogenesis in calyx cells. We injected newly pupated wasps with dsRNAs that specifically targeted U16 using previously established methods [[17]]. RT-qPCR analysis indicated transcript abundance in the calyx of newly emerged adult females was reduced by more than 90% when compared to control wasps that were injected with dsGFP (Fig 6A). Inspection of the ovaries further indicated that the calyx lumen of control wasps contained blue 'calyx fluid' indicative of HdIV virions being present, whereas almost no calyx fluid was seen in dsU16-injected wasps (Fig 6A). Examination of calyx cell nuclei by transmission electron microscopy similarly showed that calyx cells in one-day-old control females contained an abundance of subvirions, whereas no subvirions were observed in treatment wasps (Fig 6B). We thus concluded that U16 is required for virion morphogenesis.

MAP: Fig 5 U16 proteins analysis.U16 proteins were identified in the campoplegine Hyposoter didymator [[12]], Campoletis sonorensis [[16]], and Bathyplectes anurus [[18]], and in two banchine wasps Glypta fumiferanae [[13]] and Lissonota sp. [[34]]. (A) Phylogenetic tree of U16 proteins inferred using a maximum-likelihood method. Branch supports are labeled near the nodes. (B) Identity heat map of U16 proteins. (C) Alignment of U16 proteins. The predicted PriCT-2 domain is indicated by the red outline. In panels (B) and (C), increased blue intensity depicts a stronger degree of identity between the proteins. Abbreviations: Hd, H. didymator; Cs, C. sonorensis; Ba, B. anurus; Gf, G. fumiferanae; Lsp, Lissonota sp.

Graph: Fig 6 RNAi knockdown of U16.(A) RT-qPCR data showing relative expression of U16 in dsGFP (control) and dsU16 injected females. ** p<0.01. Images of ovaries dissected from newly emerged adult females that were injected with dsGFP (left) or dsU16 (right). Note the blue color in the oviduct of the dsGFP control indicating the presence of HdIV virions. (B) Schematics and electron micrographs showing that (a) calyx cell nuclei (N) from females treated with dsGFP-injected contain subvirions (V) while (b) calyx cell from a dsU16-injected wasps do not. This results in no accumulation of virions in the calyx lumen as illustrated in the schematic images. Abbreviations: CL, calyx lumen; Cyt, cytoplasm. Scale bars = 5 μm, zooms = 1 μm.

RNAi knockdown of U16 also disables amplification of HdIV loci

Since U16 contained a domain found in primases, we investigated whether RNAi knockdown also disabled amplification of HdIV genome components. We injected newly pupated wasps with dsU16 or dsGFP, followed by isolation and deep sequencing of calyx cell DNA from stage 3 pupae in three independent replicates. Mapping the reads from dsGFP-treated calyx samples to the H. didymator genome indicated all HdIV loci were amplified as evidenced by higher read depth values when compared to random regions of the wasp genome (Fig 7A). Silencing U16 significant decreased read depth values when compared to dsGFP samples for both IVSPER and proviral segment loci (Fig 7A). However, read depth values remained higher for HdIV loci compared to other regions of the wasp genome in dsU16-treated samples (Fig 7A), suggesting that dsU16 treatment did not completely abolish U16 expression across the three replicates. Upon individual analysis of each HdIV locus, we observed that read depth for every HdIV genome component (including IVSPERs, isolated IV replication genes, or HdIV proviral segments) was lower in the dsU16-treated samples compared to the dsGFP-treated samples (Fig 7B and 7C and S3 Table; statistical analyses at https://github.com/flegeai/EVE_amplification).

Graph: Fig 7 Impact of U16 RNAi knockdown on DNA proviral amplification.(A) Comparative distribution of read depths in dsGFP- and dsU16-injected females. For each of the three replicates, read depth values are given for HdIV loci (V) and random genomic regions outside of HdIV loci (W) with equal size distribution. IVSPERs and IV replication genes loci are shown in the left panel, while proviral segment loci are shown in the right panel. (B) Read depth values for IVSPERs and IV replication genes residing outside an IVSPER in three biological replicates treated with dsU16- or dsGFP. Names of HdIV loci and the scaffold (Scaf-) they are located are indicated. (C) Read depth values for proviral segment loci in the three biological replicates of the dsU16 and dsGFP samples. For better visualization, only the scaffold (Scaf-) on which the proviral segments are located is indicated. The list of the proviral segment loci within each scaffold is available in Table 1. The y-axis was transformed by the log function for better data visualization. In (A), significance levels are indicated as follows: **p<0.01, and ***p<0.001. In (B) and (C), all differences between dsU16 and dsGFP samples are statistically significant at p<0.05. Statistical test results are available at https://github.com/flegeai/EVE_amplification/blob/main/notebooks/Count_tables.ipynb.

We extended our analysis by injecting dsGFP or dsU16 into newly formed pupae followed by isolation of DNA from calyx cells and hind legs where no HdIV replication occurs. We then used specific primers and qPCR assays that measured DNA abundance of three wasp genes, selected HdIV replication genes inside and outside of IVSPERs, and selected HdIV genes in different proviral segments. As anticipated, no genes were amplified in hind legs from either control or treatment wasps (Fig 8 and Table D in S2 Dataset). In dsGFP-injected control wasps, all HdIV genes were amplified in calyx cell samples (Fig 8 and Table D in S2 Dataset). Among the wasp genes, only XRCC1 exhibited significant amplification, consistent with its location within the IVSPER-1 amplified region (Fig 8 and Table D in S2 Dataset). In contrast, when examining calyx cell DNA from wasps injected with dsU16, none of the HdIV genes nor XRCC1 were amplified (Fig 8 and Table D in S2 Dataset). Altogether, our results indicate that U16 is required for amplification of all HdIV loci.

Graph: Fig 8 Impact of U16 RNAi knockdown on amplification of selected wasp and HdIV genes.Relative genomic amplification of select HdIV genes in two-day-old females injected with dsGFP or dsU16. The wasp gene XRCC1, located within the amplified region of the IVSPER-1 locus, was incorporated into the analysis. Wasp histone (H1) and ribosomal protein (rpl) genes served as controls. Samples were obtained from calyx cells (where virion are produced) and hind legs (control). Statistical significance levels are denoted as follows: ns = non-significant, *p<0.05, **p<0.01, and ***p<0.001. Statistical test results are available in Table D in S2 Dataset. The y-axis values were transformed using the square root function for better data visualization.

Impact of DNA amplification on IV replication gene transcription levels and abundance of circ...

We hypothesized that amplification of IV replication genes would increase transcript abundance which in turn would be affected by inhibiting HdIV DNA amplification. We thus compared transcript abundance of various genes in IVSPER-1, -2, and -3, in calyx RNA samples that were collected from wasps treated with dsU16 or dsGFP. U16 knockdown reduced the expression of every HdIV replication gene we examined (Fig 9A and Table F in S2 Dataset).

Graph: Fig 9 Impact of U16 RNAi knockdown on HdIV replication gene expression and proviral segment amplification.(A) Relative expression of nine IVSPER genes in 2-day-old adult females injected with dsGFP (control) or dsU16. (B) Relative DNA amplification of the integrated linear (proviral) and circularized (episomal) forms of viral segment Hd29 in 2-day-old adult females injected with dsGFP (control) or dsU16. The left panel illustrates the position of primer pairs designed to selectively amplify the proviral form (Proviral Left and Right, indicated by red and black arrows), the circularized form (Episomal, red arrows), or both (Proviral + Episomal, brown arrows). Note that the primers designed to amplify circular molecules may also anneal to concatemers generated during segment amplification and processing. The right panel presents the relative amplification of each form using DNA from dsGFP- and dsU16-injected females. In both (A) and (B), significance levels are indicated as follows: ns = non-significant, *p<0.005, **p<0.01, and ***p<0.001. Statistical test results are available in Tables F (A) and H (B) in S2 Dataset. The y-axis values were transformed using the square root function for better data visualization.

Finally, we investigated the impact of U16 knockdown on the abundance of the circularized dsDNAs that are processed from amplified proviral segments. For this assay, we used PCR primers that specifically amplified the proviral form, circularized (episomal) or concatemeric form or both forms of Hd29 (Fig 9B and Table H in S2 Dataset). Results showed a significant reduction in both the proviral and circularized/concatemeric forms of Hd29 in calyx cell DNA from wasps injected with dsU16 when compared to DNA from wasps injected with dsGFP (Fig 9B and Table H in S2 Dataset). Our results thus indicated U16 is required for proviral segment amplification which is also required for production of circularized segments.

Discussion

This study significantly advances our understanding of IV replication by generating a chromosome-level assembly for the H. didymator genome, presenting several lines of evidence on the timing of HdIV loci amplification in calyx cells, and identifying U16 as an essential gene for amplification of all HdIV loci and virion formation. Read mapping to genomic DNA extracted from H. didymator pupal stages 1 and 3 further shows that all HdIV genome components are concurrently amplified in calyx cells while our qPCR data indicate amplification begins between pupal stage 1 and stage 2 and reaches its maximum in stage 4. This temporal pattern is overall similar to what has previously been shown to occur for BVs [[27], [35]]. In the braconid wasp Chelonus inanitus, DNA content of calyx cells also increases through endoreduplication [[36]], which we have not yet examined in H. didymator but is a future goal. Overall, our results indicate that DNA amplification of HdIV genome components in calyx cells is an essential feature of virion morphogenesis in H. didymator.

A second important advance from this study is our finding that HdIV loci are amplified with non-discrete boundaries that extend variable distances into flanking wasp DNA. This result differs from other integrated viruses such as polyomavirus where DNA undergoes "onion skin" replication, which is distinguished by replication forks that terminate at discrete boundaries [[37]]. In contrast, IVSPER amplification does share similarities with chorion genes in Drosophila follicle cells that are amplified in regions that extend ~100 kb beyond the genes themselves without discrete termination boundaries [[38]]. Similar to IVSPERs, levels of DNA amplification in Drosophila follicle cells also vary between amplicons [[39]]. Amplification of chorion gene loci is associated with multiple origins of replication (ORs) that are interspersed within each gene cluster. This results in overlapping bidirectional replication forks progressing outward on either side of the ORs [[40]]. Thus, IVSPERs and IV replication genes may also contain multiple ORs but additional approaches, such as nascent strand sequencing based on λ-exonuclease enrichment [[41]], will be necessary to determine if this is the case.

Our study also highlights that amplification of proviral segment loci is associated with a significant increase in read depth at the Direct Repeat (DR) positions bordering the proviral segments, which serve as sites for homologous recombination and circularization of the segments. This suggests that a portion of the rapid increase in read depth is due to reads mapping to amplification intermediates and circularized segments. The presence of concatemeric intermediates and circularized HdIV DNAs in the genomic DNA samples we sequenced is supported by our qPCR results for segment Hd29 (Fig 9B). Quantifying the proportion of reads mapping to the chromosomal form of HdIV segments was not possible in this study because our paired-end read data cannot discriminate between chromosomal HdIV DNA, potential replication intermediates, or circularized DNA. Long-read data as generated for some BVs [[42]] will be required to identify HdIV replication intermediates. Nonetheless, our results overall suggest HdIV proviral segment amplification involves both local chromosomal amplification and amplification of intermediates related to producing the circular dsDNAs that are packaged into capsids.

Our interest in U16 stemmed from previous results indicating it is transcriptionally upregulated in calyx cells before the appearance of envelope and capsid components [[17]]. Sequence analysis during this study revealed a PriCT-2 domain in U16, known from primases in herpesviruses, whose function is unknown but may facilitate the association of the large primase domain (AEP) with DNA [[33], [43]]. Although other known primase domains were not identified in the U16 sequence, the presence of a PriCT-2 domain suggested this protein might play a role in the replication of HdIV genome components. Additionally, our RNAi experiments demonstrate that U16 knockdown resulted in the complete absence of virion production in calyx cell nuclei and calyx fluid. These observations indicated an essential role for U16 in the early stages of viral replication, potentially involved in the amplification of HdIV genome components and/or the transcriptional regulation of IV replication genes. Subsequently, we analyzed the genome-wide impact of RNAi knockdown of U16 on HdIV loci amplification, revealing that this gene is crucial for the amplification of all H. didymator IV genome components. In the case of IV replication genes, reduced amplification was accompanied by a simultaneous significant reduction in transcript abundance, likely resulting in insufficient amounts of HdIV structural proteins. However, amplification and transcription abundance levels did not fully correlate with each other. For instance, U11 and IVSP3-1 (both located on IVSPER-2) exhibit similar amplification patterns (Fig 2), but earlier findings showed that transcript abundances were not the same in calyx cells [[16]]. Thus, differences in gene expression observed among genes located within the same amplified regions (Fig 2) could also be affected by promoter strength or other factors. On the other hand, inhibition of proviral segment loci amplification had consequences for the abundance of the circularized dsDNA that is packaged into capsids, which were drastically reduced. Thus, our results identify U16 as an essential protein for virion morphogenesis. However, its precise role in viral replication remains to be understood. Questions to be addressed in the future include whether U16 acts at the initiation or elongation step of HdIV DNA replication, whether it interacts directly with DNA, or with proteins from the replisome complex, which itself could be composed of a mixture of HdIV and wasp proteins.

BVs share some features with IVs but also exhibit differences. Notably, in contrast to IVs, where most core genes with functions in virion morphogenesis reside in IVSPERs, many BV core replication genes are widely dispersed in the genomes of wasps [[44], [46]] and are not amplified in calyx cells during virion morphogenesis [[41]]. However, some BV-producing wasp genomes contain a ~400 Kb DNA domain housing several nudiviral core genes, known as the nudivirus-like cluster, potentially marking the integration site of the nudivirus ancestor of BVs in the common ancestor of microgastroid braconids [[9]]. Notably, the nudivirus-like cluster is amplified with non-discrete boundaries [[42]], similar to what is reported for IV genome components in this study. The observed similarity in the amplification pattern between the BV nudivirus cluster and the proviral components of IVs could suggest they are amplified through a common mechanism, even though the specific genes involved differ.

BV genomes also contain proviral segment loci with boundaries defined by flanking DRs and amplified in regions that include flanking regions outside of each DR. However, unlike IV proviral segments, the amplified flanking regions in BVs contain very precise nucleotide junctions that identify the boundaries of amplification [[42], [47]]. It is also known that some BV proviral segments are amplified as head-to-tail concatemers, consistent with a rolling circle amplification mechanism, while others are amplified as head-to-head and tail-to-tail concatemers, suggesting amplification by different mechanisms. However, all of these concatemers are similarly processed into circular DNAs by recombination at a precise site within DRs, which is a tetramer conserved in all BV segments [[42], [47]]. Nudiviral genes encoding tyrosine recombinases are further known to mediate this homologous recombination event [[26]]. These types of molecules could also be present in IV genomes and need to be discovered. Currently, a detailed comparison between BV and IV proviral segment amplification is challenging and will require more information about the machinery involved in the processing of IV proviral segments into circular dsDNAs that are packaged into capsids.

Collectively, our results identify U16 as a gene deriving from the IV ancestor that is required for HdIV DNA replication. This indicates that viral regulatory factors required for DNA amplification other than U16 have been preserved in parasitoid genomes, which may also interact with wasp cellular machinery in regulating viral replication. Furthermore, this work emphasizes the value of studying original endogenized viruses, such as those found in parasitoids, to unveil new regulators of DNA processing.

Materials and methods

Insects

H. didymator was reared as previously outlined by [[48]]. Female pupae obtained from cocoons were staged using pigmentation patterns: stage 1, corresponding to hyaline pupae (approximately 3-day-old pupae); stage 2, had a pigmented thorax (4-day-old); stage 3, had a pigmented thorax and abdomen (5-day-old); stage 4, were pharate adults just before emergence.

Dovetail Omni-C Library preparation and sequencing

DNA from 10 male offspring (i.e., haploid genomes) from a single female H. didymator was sent on dry ice to Dovetail Genomics for Omni-C library construction. In the process of constructing the Dovetail Omni-C library, chromatin was fixed in place within the nucleus using formaldehyde and subsequently extracted. The fixed chromatin was digested with DNAse I followed by repair of chromatin ends and ligation to a biotinylated bridge adapter. Proximity ligation of adapter-containing ends ensued. Post-proximity ligation, crosslinks were reversed, and the DNA was purified. The purified DNA underwent treatment to eliminate biotin not internal to ligated fragments. Sequencing libraries were generated utilizing NEBNext Ultra enzymes and Illumina-compatible adapters. Fragments containing biotin were isolated using streptavidin beads before PCR enrichment of each library. The library was sequenced using the Illumina HiSeqX platform, which generated approximately 30x read depth. Subsequently, HiRise utilized reads with a mapping quality greater than 50 (MQ>50) for scaffolding purposes.

Scaffolding the assembly with HiRise

The de novo assembly from [[16]], and the Dovetail OmniC library reads served as input data for HiRise, a specialized software pipeline designed for leveraging proximity ligation data to scaffold genome assemblies, as outlined by [[49]]. The sequences from the Dovetail OmniC library were aligned to the initial draft assembly using the bwa tool (available at https://github.com/lh3/bwa). HiRise then analyzed the separations of Dovetail OmniC read pairs mapped within the draft scaffolds. This analysis generated a likelihood model for the genomic distance between read pairs. The model was subsequently employed to identify and rectify putative misjoins, score potential joins, and execute joins above a specified threshold. A contact map was generated from a BAM file by utilizing read pairs where both ends were aligned with a mapping quality of 60.

Genomic DNA (gDNA) extraction for high throughput sequencing

Comparative analysis of two pupal stages. Genomic DNA (gDNA) was extracted from pooled calyx samples dissected from H. didymator female pupae at stage 1 (~60 females) and stage 3 (~50 females). Since the aim was to compare the two developmental pupal stages, a single replicate was done for each stage. Impact of U16 knockdown. Genomic DNA from calyces was collected from stage 3 female pupae that were injected with dsGFP and dsU16. This experiment involved three biological replicates, each corresponding to 30 to 50 calyx samples. Genomic DNA was extracted using the phenol-chloroform method. Briefly, calyx samples were incubated in proteinase K (Ambion, 0.5 μg/μl) and Sarkosyl detergent (Sigma, 20%), followed by treatment with RNAse (Promega, 0.3 μg/μl). Total genomic DNA was then extracted through phenol-chloroform extraction and ethanol precipitation. Following extraction, gDNA was quantified using a QBIT fluorometer (ThermoFisher) and subsequently sent for sequencing to Genewiz/Azenta company. Paired-end sequencing was carried out using Illumina technology and NovaSeq 2x150bp platform.

NGS data analyses

Illumina reads were aligned to the updated version of the H. didymator genome using bwa mem [[50]], version 0.7.17, with default parameters. Subsequently, the aligned reads were converted to BAM files utilizing samtools view (version 1.15) [[51]].

Prediction of the amplified regions

Amplification peaks were identified using MACS2 [[31]] by comparing the pupal stage 3 alignment file as treatment and the pupal stage 1 alignment file as control. The specified parameters for this analysis were:—broad—nomodel -g 1.8e8 -q 0.01—min-length 5000. Out of the 165 predicted peaks (i.e., amplified regions), only those with a fold change (FC) higher than 2 were retained for further analyses, resulting in a total of 59 peaks. These 59 peaks encompassed all known proviral loci, except for Hd40, which had a slightly lower value than the specified threshold (FC = 1.9), and Hd45.1 and Hd2-like, located too close to the scaffold end and potentially missed. For the predicted peaks with FC>2 that did not correspond to known proviral loci, a manual curation was performed to determine whether these regions corresponded to HdIV loci. Proviral segments were identified by their flanking direct repeats (DRs) and gene contents, specifically the presence of genes belonging to IV segment conserved gene families. To identify putative core IV replication genes, genes present in the MACS2 peak were analyzed. Only those with no similarity to wasp proteins and that were transcribed in calyx cells (based on the available transcriptome from [[17]]) were retained.

Comparison of read depths between HdIV loci and the rest of the wasp genome

Read depths were computed for various types of virus-related genomic regions, including each locus (IVSPERs, IV replication genes outside IVSPERs, proviral segments), and each MACS2-detected amplified region. To calculate read depth for non-viral regions of the wasp genome, one hundred sets of random regions mimicking the size distribution of HdIV loci were generated using the shuffle tool from bedtools version 2.27 [[52]]. This was achieved by utilizing the bed file of HdIV loci (56 for proviral segments and 11 for IVSPERs) as parameters for the shuffle tool. Raw read counts were determined for each proviral and wasp genomic region using featureCounts [[53]] from the Subread package (version 2.0.1) with the parameters (-c -P -s 0 -O). Subsequently, read depth values were computed for each pupal stage (stage 1, St1 and stage 3, St3), as well as for each experiment (dsGFP and dsU16) and each replicate with a custom script available at https://github.com/flegeai/EVE%5famplification. Read depth values for each region were calculated by dividing the number of fragments mapped to the region by the size of the region (expressed in kilobase pairs, kbp), and further normalized by the depth of the library (expressed in million reads).

Genome read depths per position on H. didymator scaffolds (Counts per Million, CPM) and Maxim...

Genome read depths per position in 10 bp bins were acquired using the BamCoverage tool from the deeptools package [[54]] with the options:—normalizeUsing CPM and -bs 10. Subsequently, for each 10 bp bin, the pupal stage 3 (St3) versus stage 1 (St1) ratio was computed through an in-house script available at https://github.com/flegeai/EVE%5famplification. This script utilized the pyBigWig python library from deeptools [[54]]. To determine the maximal counts per million (CPM) at each stage for every proviral locus, an in-house script importing the pyBigWig python library was employed. The maximum CPM value for the "stage 3 / stage 1" ratio was then calculated based on the 10 bp bin bigwig file, specifically for the position displaying the highest CPM value at stage 3 (summit).

Search for motifs at the HdIV amplified regions boundaries

The MEME suite [[55]] was employed for analyses using default parameters and a search for six motifs. A dataset comprising a total of 110 sequences, each spanning 1,000 nucleotides on both sides of the start and end positions of the 55 HdIV amplified regions predicted by the MACS2 algorithm, was utilized for this analysis. As a control, a parallel analysis was conducted using 110 sequences, each 2,000 nucleotides in length, randomly selected from locations within the H. didymator genome but outside the HdIV loci. This control dataset allowed for the comparison of motif patterns between the HdIV amplified regions and randomly chosen genomic regions.

Genomic DNA extraction for gDNA amplification analyses by quantitative real-time PCR

To assess the level of DNA amplification, total genomic DNA (gDNA) was extracted using the DNeasy Blood & Tissue Kit (Qiagen) following the manufacturer's protocol. Ovaries (ovarioles removed) and hind legs, representing the negative control, were dissected from ten pupae at four different stages. Three replicates were generated for each pupal stage. Quantification of target gene amplification was conducted through quantitative PCR, utilizing LightCycler 480 SYBR Green I Master Mix (Roche) in 384-well plates (Roche). The total reaction volume per well was 3 μl, comprising 1.75 μl of the reaction mix (1.49 μl SYBR Green I Master Mix, 0.1 μl nuclease-free water, and 0.16 μl diluted primer), and 1.25 μl of each gDNA sample diluted to achieve a concentration of 1.2 ng/μl. Primers used are listed in S4 Table. The gDNA levels corresponding to the viral genes and the housekeeping wasp gene (elongation factor (ELF-1)) were determined using the LightCycler 480 System (Roche). The cycling conditions involved heating at 95°C for 10 min, followed by 45 cycles of 95°C for 10 s, 58°C for 10 s, and 72°C for 10 s. Each sample was evaluated in triplicate. The obtained DNA levels were normalized with respect to the wasp gene ELF-1. Raw data are provided in S2 Dataset.

Total RNA extraction

Total RNA was extracted from ovaries (ovarioles removed) dissected from pupae at different stages using the Qiagen RNeasy extraction kit in accordance with the manufacturer's protocol. To control for gene silencing, total RNAs were also extracted from individual adult wasp abdomens (2 to 4 days old). For this, Trizol reagent (Ambion) was initially used followed by extraction using the NucleoSpin RNA kit (Macherey-Nagel). Isolated RNA was then subjected to DNase treatment using the TURBO DNA-free Kit (Life Technologies) to assure removal of any residual genomic DNA from the RNA samples.

Protein sequence analyses

Conserved domains of U16 were identified using the CD-search tool available through NCBI's conserved domain database resource [[56]]. Subcellular localization predictions were made using the DeepLoc—2.0 tool, a deep learning-based approach for predicting the subcellular localization of eukaryotic proteins [[58]]. Protein multiple sequence alignment and phylogenetic tree were generated using MEGA11 software [[59]], using MUSCLE for protein alignment and Maximum Likelihood method to generate the phylogenetic tree. The alignment was visualized using Jalview [[60]]. Structure predictions for U16 were carried out using the MPI Bioinformatics Toolkit [[61]].

RNA interference (RNAi)

Gene-specific double-stranded RNA (dsRNA) used for RNAi experiments was prepared using the T7 RiboMAX Express RNAi System (Promega). Initially, a 350–450 bp fragment corresponding to the U16 sequence was cloned into the double T7 vector L4440 (a gift from Andrew Fire, Addgene plasmid # 1654). Subsequently, an in vitro transcription template DNA was PCR amplified with a T7 primer, and this template was used to synthesize sense and antisense RNA strands with T7 RNA polymerase at 37°C for 5 hours. The primers used for dsRNA production are listed in S4 Table. After annealing and DNase treatment using the TURBO DNA-free Kit (Life Technologies), the purified dsRNAs were resuspended in nuclease-free water, quantified using a NanoDrop ND-1000 Spectrophotometer (Thermo Scientific), and examined by agarose gel electrophoresis to ensure their integrity. Injections were performed in less than one-day-old female pupae using a microinjector (Fentojet Express, Eppendorf) and a micromanipulator (Narishige). Approximately 0.3–0.6 μl of 500 ng/μl dsRNA was injected into each individual. Control wasps were injected with a non-specific dsRNA homologous to the green fluorescent protein (GFP) gene. Treated pupae were kept in an incubator until adult emergence, which occurred approximately 5 days after injection.

Transmission electron microscopy

Ovaries were dissected from adult wasps between 2 and 3 days after emergence, following the procedures outlined in [[17]]. To ensure consistency of the observed phenotype, at least three females (taken at different microinjection dates) were observed for each tested dsRNA. For transmission electron microscopy (TEM) observations, calyces were fixed in a solution of 2% glutaraldehyde in PBS for 2 hours and then post-fixed in 2% osmium tetroxide in the same buffer for 1 hour. Tissues were subsequently bulk-stained for 2 hours in a 5% aqueous uranyl acetate solution, dehydrated in ethanol, and embedded in EM812 resin (EMS). Ultrathin sections were double-stained with Uranyless (DeltaMicroscopy) and lead citrate before examination under a Jeol 1200 EXII electron microscope at 100 kV (MEA Platform, University of Montpellier). Images were captured with an EMSIS Olympus Quemesa 11 Megapixels camera and analyzed using ImageJ software [[62]].

Reverse-transcriptase quantitative real-time PCR (RT-qPCR)

For RT-qPCR assays, 400 ng of total RNA was reverse-transcribed using the SuperScript III Reverse Transcriptase kit (Life Technologies) and oligo(dT)15 primer (Promega). The mRNA transcript levels of selected IVSPER genes were measured by quantitative reverse transcription-PCR (qRT-PCR) using a LightCycler 480 System (Roche) and SYBR Green I Master Mix (Roche). Expression levels were normalized relative to a housekeeping wasp gene (elongation factor 1 ELF-1). Each sample was evaluated in triplicate, and the total reaction volume per well was 3 μl, including 0.5 μM of each primer and cDNA corresponding to 0.88 ng of total RNA. The amplification program consisted of an initial step at 95°C for 10 min, followed by 45 cycles of 95°C for 10 s, 58°C for 10 s, and 72°C for 10 s. The primers used for this analysis are listed in S4 Table.

qPCR data analysis

Data were acquired using Light-Cycler 480 software. PCR amplification efficiency (E) for each primer pair was determined by linear regression of a dilution series (5x) of the cDNA pool. Relative expression, using the housekeeping gene ELF-1 as a reference, was calculated through advanced relative quantification (Efficiency method) software provided by Light-Cycler 480 software. For statistical analyses, Levene's and Shapiro-Wilk tests were employed to verify homogeneity of variance and normal distribution of data among the tested groups. Differences in gene relative expression between developmental stages and between dsGFP and dsU16-injected females were assessed using a two-tailed unpaired t-test for group comparison. In cases where homogeneity of variance was not assumed, the Welch-test was used to compare gene relative expression between groups. A p-value < 0.05 was considered significant. All statistical analyses were conducted using R [[63]]. Detailed statistical analyses of qPCR results are provided in S2 Dataset.

Supporting information

S1 Dataset

Hyposoter didymator Hi-C genome assembly.

The dataset includes: Fig A. Figure depicting the Hi-C scaffold contact map; Table A. Table presenting the Hi-C scaffolds containing HdIV loci; Fig B. Figure displaying the pairwise comparisons of HdIV segments located in close proximity within the H. didymator scaffolds.(DOCX)

S2 Dataset

Raw data and statistical analyses of qPCR analyses.

The dataset includes raw data and statistical analyses for: Genomic DNA amplification of IVSPER genes at four different H. didymator pupal stages, Table A. Raw data and Table B. Statistical analyses; Genomic DNA amplification of IVSPER and HdIV segment genes in dsGFP and dsU16-injected wasps, Table C. Raw data and Table D. Statistical analyses; RNA quantification of IVSPER genes in dsGFP and dsU16-injected wasps, Table E. Raw data and Table F. Statistical analyses; DNA amplification of Hd29 segment in dsGFP and dsU16-injected wasps, Table G. Raw data and Table H. Statistical analyses.(DOCX)

S3 Dataset

Sequence analysis and alignment of the U16 gene from H. didymator to four other wasp species that harbor IVs.

The dataset includes: Table A. Raw sequence of U16 proteins. Table B. Position of the PriCT-domain in U16 proteins. Fig A. Detail of the predicted secondary structure of the PricT-2 domain in the H. didymator U16 protein. Fig B. Subcellular localization of U16 predicted by DeepLoc 2.0.(DOCX)

S1 Table

Read depth of HdIV loci on each scaffold of the H. didymator genome.

(DOCX)

S2 Table

List of the peaks predicted in H. didymator genome scaffolds using MACS2 algorithm.

(DOCX)

S3 Table

Read depth of HdIV amplified regions in calyx cell DNA from dsGFP- and dsU16-injected female pupae.

(DOCX)

S4 Table

List of primers used in the present work.

(DOCX)

S1 Fig

DNA amplification patterns of HdIV loci in calyx cells of H. didymator.

(DOCX)

S2 Fig

HdIV amplified regions in Scaffold-11.

(DOCX)

S3 Fig

MEME analysis of boundaries of the predicted MACS2 HdIV amplified regions.

(DOCX)

The insects used in the experiments were provided by Raphaël BOUSQUET and Gaétan CLABOTS from the DGIMI insect rearing facility. The authors also thank Dr Jean-Claude OGIER for his assistance in generating the phylogenetic tree of U16 proteins. All RNAi experiments were conducted in the insect quarantine platform (PIQ) of DGIMI lab, which is a member of the Montpellier Vectopole Sud network (https://www.vectopole-sud.fr/). Microscopy observations were facilitated by the Montpellier MEA platform (https://mea.edu.umontpellier.fr/). All qPCR analyses were performed with the assistance of the Montpellier Genomix qPHD platform (http://www.pbs.univ-montp2.fr/).

Footnotes 1 The authors have declared that no competing interests exist. References Katzourakis A, Gifford RJ. Endogenous viral elements in animal genomes. PLoS Genet. (2010) Nov 18; 6(11):e1001191. doi: 10.1371/journal.pgen.1001191, 21124940 2 Kryukov K, Ueda MT, Imanishi T, Nakagawa S. Systematic survey of non-retroviral virus-like elements in eukaryotic genomes. Virus Res. (2019) Mar; 262:30–36. doi: 10.1016/j.virusres.2018.02.002, 29425804 3 Frank JA, Feschotte C. Co-option of endogenous viral sequences for host cell function. Curr Opin Virol. (2017) Aug; 25:81–89. doi: 10.1016/j.coviro.2017.07.021, 28818736 4 Feschotte C, Gilbert C. Endogenous viruses: insights into viral evolution and impact on host biology. Nat Rev Genet. (2012); 13(4):283–296. doi: 10.1038/nrg3199, 22421730 5 Gilbert C, Feschotte C. Genomic fossils calibrate the long-term evolution of hepadnaviruses. PLoS Biol. (2010) Sep; 8(9):e1000495. doi: 10.1371/journal.pbio.1000495, 20927357 6 Drezen JM, Bézier A, Burke GR, Strand MR. Bracoviruses, ichnoviruses, and virus-like particles from parasitoid wasps retain many features of their virus ancestors. Curr Opin Insect Sci. (2022) Feb; 49:93–100. doi: 10.1016/j.cois.2021.12.003, 34954138 7 Stoltz DB, Vinson SB. Viruses and parasitism in insects. Adv Virus Res. (1979); 24:125–71. doi: 10.1016/s0065-3527(08)60393-0, 389003 8 Webb BA, Strand MR. The biology and genomics of polydnaviruses. In: Comprehensive Molecular Insect Science, Vol. 6, ed. Iatrou K, Gill S, pp. 323–60. Amsterdam: Pergamon. (2005). 9 Bézier A, Annaheim M, Herbinière J, Wetterwald C, Gyapay G, Bernard-Samain S, et al. Polydnaviruses of braconid wasps derive from an ancestral nudivirus. Science. (2009) Feb 13; 323(5916):926–30. doi: 10.1126/science.1166788, 19213916 Pichon A, Bézier A, Urbach S, Aury JM, Jouan V, Ravallec M, et al. Recurrent DNA virus domestication leading to different parasite virulence strategies. Sci Adv. (2015) Nov 27; 1(10):e1501150. doi: 10.1126/sciadv.1501150, 26702449 Burke GR. Common themes in three independently derived endogenous nudivirus elements in parasitoid wasps. Curr Opin Insect Sci. (2019) Apr; 32:28–35. Epub 2018 Oct 23. doi: 10.1016/j.cois.2018.10.005, 31113628. Volkoff AN, Jouan V, Urbach S, Samain S, Bergoin M, Wincker P, et al. Analysis of virion structural components reveals vestiges of the ancestral ichnovirus genome. PLoS Pathog. (2010) May 27; 6(5):e1000923. doi: 10.1371/journal.ppat.1000923, 20523890 Béliveau C, Cohen A, Stewart D, Periquet G, Djoumad A, Kuhn L, et al. Genomic and Proteomic Analyses Indicate that Banchine and Campoplegine Polydnaviruses Have Similar, if Not Identical, Viral Ancestors. J Virol. (2015) Sep; 89(17):8909–21. doi: 10.1128/JVI.01001-15, 26085165 Koonin EV, Dolja VV, Krupovic M, Varsani A, Wolf YI, Yutin N, Zerbini FM, Kuhn JH. Global organization and proposed megataxonomy of the virus world. Microbiol Mol Biol Rev. (2020) Mar 4; 84(2):e00061–19. doi: 10.1128/MMBR.00061-19, 32132243 Volkoff A-N, Huguet E. Polydnaviruses (Polydnaviridae). In: Bamford DH, Zuckerman M, editors. Encyclopedia of Virology (Fourth Edition). Academic Press, Oxford; (2021). pp. 849–857. Legeai F, Santos BF, Robin S, Bretaudeau A, Dikow RB, Lemaitre C, et al. Genomic architecture of endogenous ichnoviruses reveals distinct evolutionary pathways leading to virus domestication in parasitic wasps. BMC Biol. (2020) Jul 24; 18(1):89. doi: 10.1186/s12915-020-00822-3, 32703219 Lorenzi A, Ravallec M, Eychenne M, Jouan V, Robin S, Darboux I, et al. RNA interference identifies domesticated viral genes involved in assembly and trafficking of virus-derived particles in ichneumonid wasps. PLoS Pathog. (2019) Dec 13; 15(12):e1008210. doi: 10.1371/journal.ppat.1008210, 31834912 Robin S, Ravallec M, Frayssinet M, Whitfield J, Jouan V, Legeai F, et al. Evidence for an ichnovirus machinery in parasitoids of coleopteran larvae. Virus Res. (2019); 263: 189–206. doi: 10.1016/j.virusres.2019.02.001, 30738799 Cui L, Webb BA. Homologous sequences in the Campoletis sonorensis polydnavirus genome are implicated in replication and nesting of the W segment family. J Virol. (1997) Nov; 71(11):8504–13. doi: 10.1128/JVI.71.11.8504-8513.1997 Rattanadechakul W, Webb BA. Characterization of Campoletis sonorensis ichnovirus unique segment B and excision locus structure. J Insect Physiol. (2003) May; 49(5):523–32. doi: 10.1016/s0022-1910(03)00053-2, 12770631 Blissard GW, Smith OP, Summers MD. Two related viral genes are located on a single superhelical DNA segment of the multipartite Campoletis sonorensis virus genome. Virology. (1987) Sep; 160(1):120–34. doi: 10.1016/0042-6822(87)90052-3, 3629971 Theilmann DA, Summers MD. Molecular analysis of Campoletis sonorensis virus DNA in the lepidopteran host Heliothis virescens. J Gen Virol. (1986) Sep; 67(Pt 9):1961–9. doi: 10.1099/0022-1317-67-9-1961, 3746255 Webb BA, Strand MR, Dickey SE, Beck MH, Hilgarth RS, Barney WE, et al. Polydnavirus genomes reflect their dual roles as mutualists and pathogens. Virology. (2006) Mar 30; 347(1):160–74. doi: 10.1016/j.virol.2005.11.010, 16380146 Dorémus T, Cousserans F, Gyapay G, Jouan V, Milano P, Wajnberg E, et al. Extensive transcription analysis of the Hyposoter didymator ichnovirus genome in permissive and non-permissive lepidopteran host species. PLoS One. (2014) Aug 12; 9(8):e104072. doi: 10.1371/journal.pone.0104072, 25117496 Volkoff AN, Ravallec M, Bossy JP, Cerutti P, Rocher J, Cerutti M, Devauchelle G. The replication of Hyposoter didymator polydnavirus: Cytopathology of the calyx cells in the parasitoid. Biology of the Cell. (1995); 83(1):1–13. Burke GR, Thomas SA, Eum JH, Strand MR. Mutualistic polydnaviruses share essential replication gene functions with pathogenic ancestors. PLoS Pathog. (2013); 9(5):e1003348. doi: 10.1371/journal.ppat.1003348, 23671417 Lorenzi A, Arvin MJ, Burke GR, Strand MR. Functional characterization of Microplitis demolitor bracovirus genes that encode nucleocapsid components. J Virol. (2023) Oct 25:e0081723. doi: 10.1128/jvi.00817-23, 37877717 Rocher J, Ravallec M, Barry P, Volkoff AN, Ray D, Devauchelle G, Duonor-Cérutti M. Establishment of cell lines from the wasp Hyposoter didymator (Hym., Ichneumonidae) containing the symbiotic polydnavirus H. didymator ichnovirus. J Gen Virol. (2004) Apr; 85(Pt 4):863–868. doi: 10.1099/vir.0.19713-0 Voorrips RE. MapChart: software for the graphical presentation of linkage maps and QTLs. J Hered. (2002) Jan-Feb; 93(1):77–8. doi: 10.1093/jhered/93.1.77, 12011185 Krell PJ, Summers MD, Vinson SB. Virus with a multipartite superhelical DNA genome from the ichneumonid parasitoid Campoletis sonorensis. J Virol. (1982) Sep; 43(3):859–70. doi: 10.1128/JVI.43.3.859-870.1982, 16789230 Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, Mesirov JP. Integrative Genomics Viewer. Nat Biotechnol. (2011); 29:24–26. doi: 10.1038/nbt.1754, 21221095 Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. (2008); 9(9):R137. doi: 10.1186/gb-2008-9-9-r137, 18798982 Iyer LM, Koonin EV, Leipe DD, Aravind L. Origin and evolution of the archaeo-eukaryotic primase superfamily and related palm-domain proteins: structural insights and new members. Nucleic Acids Res. (2005) Jul 15; 33(12):3875–96. doi: 10.1093/nar/gki702, 16027112. Burke GR, Hines HM, Sharanowski BJ. The presence of ancient core genes reveals endogenization from diverse viral ancestors in parasitoid wasps. Genome Biol Evol. (2021) Jul 6; 13(7):evab105. doi: 10.1093/gbe/evab105, 33988720. Marti D, Grossniklaus-Bürgin C, Wyder S, Wyler T, Lanzrein B. Ovary development and polydnavirus morphogenesis in the parasitic wasp Chelonus inanitus. I. Ovary morphogenesis, amplification of viral DNA and ecdysteroid titres. J Gen Virol. (2003) May; 84(Pt 5):1141–1150. doi: 10.1099/vir.0.18832-0 Wyler T, Lanzrein B. Ovary development and polydnavirus morphogenesis in the parasitic wasp Chelonus inanitus. II. Ultrastructural analysis of calyx cell development, virion formation and release. J Gen Virol. (2003); 84:1151–63. doi: 10.1099/vir.0.18830-0 Baran N, Neer A, Manor H. "Onion skin" replication of integrated polyoma virus DNA and flanking sequences in polyoma-transformed rat cells: termination within a specific cellular DNA segment. Proc Natl Acad Sci U S A. (1983) Jan; 80(1):105–9. doi: 10.1073/pnas.80.1.105, 6296858 Spradling AC. The organization and amplification of two chromosomal domains containing Drosophila chorion genes. Cell. (1981) Nov; 27(1 Pt 2):193–201. doi: 10.1016/0092-8674(81)90373-1, 6799210 Kim JC, Nordman J, Xie F, Kashevsky H, Eng T, Li S, et al. Integrative analysis of gene amplification in Drosophila follicle cells: parameters of origin activation and repression. Genes Dev. (2011) Jul 1; 25(13):1384–98. doi: 10.1101/gad.2043111, 21724831 Tower J.Developmental gene amplification and origin regulation. Annu Rev Genet. (2004); 38:273–304. doi: 10.1146/annurev.genet.37.110801.143851, 15568978 Foulk MS, Urban JM, Casella C, Gerbi SA. Characterizing and controlling intrinsic biases of lambda exonuclease in nascent strand sequencing reveals phasing between nucleosomes and G-quadruplex motifs around a subset of human replication origins. Genome Res. (2015) May; 25(5):725–35. doi: 10.1101/gr.183848.114, 25695952 Burke GR, Simmonds TJ, Thomas SA, Strand MR. Microplitis demolitor Bracovirus proviral loci and clustered replication genes exhibit distinct DNA amplification patterns during replication. J Virol. (2015) Sep; 89(18):9511–23. doi: 10.1128/JVI.01388-15, 26157119 Weller SK, Kuchta RD. The DNA helicase-primase complex as a target for herpes viral infection. Expert Opin Ther Targets. (2013) Oct; 17(10):1119–32. doi: 10.1517/14728222.2013.827663, 23930666 Burke GR, Walden KK, Whitfield JB, Robertson HM, Strand MR. Widespread genome reorganization of an obligate virus mutualist. PLoS Genet. (2014) Sep; 10(9):e1004660. doi: 10.1371/journal.pgen.1004660, 25232843 Gauthier J, Boulain H, van Vugt JJFA, Baudry L, Persyn E, Aury JM, et al. Chromosomal scale assembly of parasitic wasp genome reveals symbiotic virus colonization. Commun Biol. (2021) Jan 22; 4(1):104. Erratum in: Commun Biol. 2021 Jul 30;4(1):940. doi: 10.1038/s42003-020-01623-8, 33483589 Mao M, Strand MR, Burke GR. The complete genome of Chelonus insularis reveals dynamic arrangement of genome components in parasitoid wasps that produce bracoviruses. J Virol. (2022) Mar 9; 96(5):e0157321. doi: 10.1128/JVI.01573-21, 34985997 Louis F, Bézier A, Periquet G, Ferras C, Drezen JM, Dupuy C. The bracovirus genome of the parasitoid wasp Cotesia congregata is amplified within 13 replication units, including sequences not packaged in the particles. J Virol. (2013) Sep; 87(17):9649–60. doi: 10.1128/JVI.00886-13, 23804644 Visconti V, Eychenne M, Darboux I. Modulation of antiviral immunity by the ichnovirus HdIV in Spodoptera frugiperda. Mol Immunol. (2019) Apr; 108:89–101. doi: 10.1016/j.molimm.2019.02.011, 30784767 Putnam NH, O'Connell BL, Stites JC, Rice BJ, Blanchette M, Calef R, et al. Chromosome-scale shotgun assembly using an in vitro method for long-range linkage. Genome Res. (2016) Mar; 26(3):342–50. doi: 10.1101/gr.193474.115, 26848124 Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv:1303.3997v2 [q-bio.GN]. Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, et al. Twelve years of SAMtools and BCFtools. Gigascience. (2021) Feb 16; 10(2):giab008. doi: 10.1093/gigascience/giab008, 33590861 Quinlan AR, Hall IM. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics. (2010) Mar 15; 26(6):841–2. doi: 10.1093/bioinformatics/btq033, 20110278 Liao Y, Smyth GK, Shi W. featureCounts: An efficient general-purpose program for assigning sequence reads to genomic features. Bioinformatics. (2014) Apr 1; 30(7):923–30. doi: 10.1093/bioinformatics/btt656, 24227677 Ramírez F, Ryan DP, Grüning B, Bhardwaj V, Kilpert F, Richter AS, et al. deepTools2: A next-generation web server for deep-sequencing data analysis. Nucleic Acids Res. (2016) Jul 8; 44(W1):W160–5. doi: 10.1093/nar/gkw257, 27079975 Bailey TL, Johnson J, Grant CE, Noble WS. The MEME Suite. Nucleic Acids Res. (2015) Jul 1; 43(W1):W39–49. doi: 10.1093/nar/gkv416, 25953851 Marchler-Bauer A, Bo Y, Han L, He J, Lanczycki CJ, Lu S, et al. CDD/SPARCLE: functional classification of proteins via subfamily domain architectures. Nucleic Acids Res. (2017) Jan 4; 45(D1):D200–D203. doi: 10.1093/nar/gkw1129, 27899674 Lu S, Wang J, Chitsaz F, Derbyshire MK, Geer RC, Gonzales NR, et al. CDD/SPARCLE: the conserved domain database in 2020. Nucleic Acids Res. (2020) Jan 8; 48(D1):D265–D268. doi: 10.1093/nar/gkz991, 31777944 Thumuluri V, A Armenteros JJ, Johansen AR, Nielsen H, Winther O. DeepLoc 2.0: multi-label subcellular localization prediction using protein language models. Nucleic Acids Research. (2022). doi: 10.1093/nar/gkac278, 35489069 Tamura K, Stecher G, Kumar S. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Mol Biol Evol. (2021) Jun 25; 38(7):3022–3027. doi: 10.1093/molbev/msab120, 33892491 Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ. Jalview Version 2—A multiple sequence alignment editor and analysis workbench. Bioinformatics. (2009); 25:1189–1191. doi: 10.1093/bioinformatics/btp033, 19151095 Gabler F, Nam SZ, Till S, Mirdita M, Steinegger M, Söding J, et al. Protein sequence analysis using the MPI Bioinformatics Toolkit. Curr Protoc Bioinformatics. (2020) Dec; 72(1):e108. doi: 10.1002/cpbi.108, 33315308 Rasband WS. ImageJ. National Institutes of Health, Bethesda, Maryland, USA. 1997–2018. http://imagej.nih.gov/ij R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. R Core Team. 2023. URL https://www.R-project.org/.

By Ange Lorenzi; Fabrice Legeai; Véronique Jouan; Pierre-Alain Girard; Michael R. Strand; Marc Ravallec; Magali Eychenne; Anthony Bretaudeau; Stéphanie Robin; Jeanne Rochefort; Mathilde Villegas; Gaelen R. Burke; Rita Rebollo; Nicolas Nègre and Anne-Nathalie Volkoff

Reported by Author; Author; Author; Author; Author; Author; Author; Author; Author; Author; Author; Author; Author; Author; Author

Titel:
Identification of a viral gene essential for the genome replication of a domesticated endogenous virus in ichneumonid parasitoid wasps.
Autor/in / Beteiligte Person: Lorenzi, Ange ; Legeai, Fabrice ; Jouan, Véronique ; Girard, Pierre-Alain ; Michael R Strand ; Ravallec, Marc ; Eychenne, Magali ; Bretaudeau, Anthony ; Robin, Stéphanie ; Rochefort, Jeanne ; Villegas, Mathilde ; Gaelen R Burke ; Rebollo, Rita ; Nègre, Nicolas ; Volkoff, Anne-Nathalie
Link:
Zeitschrift: PLoS Pathogens, Jg. 20 (2024-04-01), Heft 4, S. e1011980
Veröffentlichung: Public Library of Science (PLoS), 2024
Medientyp: academicJournal
ISSN: 1553-7366 (print) ; 1553-7374 (print)
DOI: 10.1371/journal.ppat.1011980
Schlagwort:
  • Immunologic diseases. Allergy
  • RC581-607
  • Biology (General)
  • QH301-705.5
Sonstiges:
  • Nachgewiesen in: Directory of Open Access Journals
  • Sprachen: English
  • Collection: LCC:Immunologic diseases. Allergy ; LCC:Biology (General)
  • Document Type: article
  • File Description: electronic resource
  • Language: English

Klicken Sie ein Format an und speichern Sie dann die Daten oder geben Sie eine Empfänger-Adresse ein und lassen Sie sich per Email zusenden.

oder
oder

Wählen Sie das für Sie passende Zitationsformat und kopieren Sie es dann in die Zwischenablage, lassen es sich per Mail zusenden oder speichern es als PDF-Datei.

oder
oder

Bitte prüfen Sie, ob die Zitation formal korrekt ist, bevor Sie sie in einer Arbeit verwenden. Benutzen Sie gegebenenfalls den "Exportieren"-Dialog, wenn Sie ein Literaturverwaltungsprogramm verwenden und die Zitat-Angaben selbst formatieren wollen.

xs 0 - 576
sm 576 - 768
md 768 - 992
lg 992 - 1200
xl 1200 - 1366
xxl 1366 -