Science & Technology

Binary vector copy number engineering improves -mediated transformation

Posted by

Tamunofiniarisa

On November 5, 2024

0 comments

Technology tamfitronics

Main

Agrobacterium-mediated transformation (AMT) is an indispensable tool in both plant and fungal biotechnology for transgene insertion into target cells^1,2,3. By replacing the native oncogenic genes used by the bacterial pathogen Agrobacterium tumefaciens with user-defined DNA sequences, the innate DNA transfer capacity of this bacterium can be exploited for untargeted DNA insertions into diverse plant, fungal and mammalian cell lines^4,5,6,7. Initial improvements to AMT involved relocating two DNA sequences known as the left and right border (LB and RB) from the native ~200-kb tumor-inducing (Ti) plasmid to a smaller helper plasmid known as a binary vector⁸. Any DNA contained between the LB and RB is mobilized as a transfer DNA (T-DNA) and inserted into target cells with help from virulence (for) genes contained within the Ti plasmid, enabling tractable engineering by cloning different sequences into the T-DNA region¹. The simplicity of changing the transgene target by customizing the sequence between the LB and RB has made AMT an important tool for agricultural biotechnology, bioenergy crop engineering and synthetic biology.

Despite the genetic revolution that AMT initiated, efficient transformation is still a considerable bottleneck for genetic engineering in most plant species. Decades of optimization of AMT have identified induction conditions^9,10strains of A. tumefaciens⁴ and enhancements to for gene expression that have increased AMT efficiency in numerous plant species¹¹. In addition to the transgene-harboring binary vector, some protocols have used a second introduced plasmid, termed a ternary vector, which overexpresses various genes involved in Agrobacterium virulence to improve AMT efficiencies in recalcitrant plants such as maize¹¹. Such work has inspired more recent research to completely refactor the Ti plasmid that harbors the for genes¹²laying the groundwork for fine-tuned engineering of individual virulence components to further enhance AMT. Even with these improvements, transformation efficiencies remain low for most genetic backgrounds and limit the scale and throughput of genetic engineering projects, presenting a need for improved tools to better control and enhance the AMT process.

While many improvements to AMT have focused on either altering for gene expression or sequences within the T-DNA^1,2,13,14relatively little work has been conducted to engineer the binary vector backbone itself. Binary vectors consist of a T-DNA region, a bacterial selectable marker and an origin of replication (ORI) to enable proper replication and maintenance of the plasmid in Agrobacterium. Within plasmid terminology, the term ORI refers to both the origin of vegetative replication (oriV, as cataloged by Dong et al.¹⁵) where plasmid replication initiates and the coding sequences for specific proteins that bind to the oriV or host replication factors to regulate plasmid copy number, stability and partitioning such as Rep, Stb and Par proteins, respectively. In this manuscript, the term ORI refers to the entire DNA sequence that mediates plasmid replication, which includes the oriV and trans-acting factors such as Rep proteins¹⁶. For binary vectors, the ORI, which dictates both host range and copy number, is known to impact AMT efficiency in plants^16,17,18,19. Zhi et al. postulated a direct relationship between binary vector copy number and maize transformation efficiency through a comparison of three binary vector ORIs, suggesting that higher copy numbers could be used to improve AMT¹⁷. A similar comparison of four ORIs conducted by Oltmanns et al. in Arabidopsis thaliana and a different cultivar of maize uncovered a more complicated relationship that implicated the strain of A. tumefaciensthe ORI and the plant genetic background as factors that independently influence transformation efficiencies¹⁸. Notably, however, neither study evaluated the impact of copy number variants within the same ORI, making it difficult to isolate the impact of copy number alone while controlling for intrinsic differences inherent to each ORI. While there is evidence that different ORI copy numbers influence AMT efficiency, there has never been an attempt to systematically modify this variable for the binary vector. The best attempt to date at direct manipulation of the binary vector ORI to alter AMT outcomes was by Vaghchhipawala et al., who evaluated a single higher-copy-number mutant of the pRi ORI and found no improvement in stable transformation¹⁹. Thus, a more comprehensive screen of ORI variants is needed to better understand the relationship among ORI identity, plasmid copy number and AMT transformation outcomes.

In prokaryotic synthetic biology, it has long been known that the copy number of a plasmid can dramatically impact engineered metabolic pathways and the functionality of synthetic circuits^{20,21,22,23,24}. As plasmid copy number influences transgene expression magnitude and metabolic load, previous work has been conducted to engineer plasmid copy number, particularly in narrow-host-range ORIs commonly used in Escherichia coli such as pMB1 and pSC101 derivatives^21,25,26. For broad-host-range ORIs that are used within A. tumefaciensengineering efforts have been extremely limited with only a few mutants isolated^{20,27,28,29,30}. To evaluate the impact of plasmid copy number across multiple broad-host-range ORIs used for AMT, a high-throughput screen is needed to identify and characterize numerous copy number variants. To date, however, no generally applicable method exists that can be applied to diverse ORIs to systematically screen for copy number diversity.

To specifically evaluate the impact of binary vector copy on AMT, we sought to generate copy number variants in four broad-host-range origins (RK2, pVS1, pSa and BBR1) that have been widely used in Agrobacterium and prokaryotic biotechnology more broadly. To accomplish this, we leveraged a high-throughput growth-coupled selection assay to rapidly identify ORI mutations that influence copy number. Across all four origins, we were able to use a transient expression assay in Nicotiana benthamiana to rapidly screen for mutants that could improve AMT efficiency. Using top candidate plasmid variants from this screen, we were able to notably improve stable transformation in both A. thaliana and the oleaginous yeast Rhodosporidium toruloides. Thus, our work demonstrates the impact of binary vector backbone engineering on AMT across kingdoms, creating an easily deployable strategy to improve transformation.

Results

Directed evolution pipeline to diversify plasmid copy number

As previous work suggested a relationship between binary vector copy number and AMT transformation efficiency¹⁷we sought to develop a method to systematically select for higher-copy-number mutants across diverse ORIs. Despite numerous intrinsic differences, many ORIs use analogous but nonhomologous replication initiation proteins to control plasmid replication and copy number. These nonhomologous RepA proteins share a general function of binding in cis to a motif within the oriV of the plasmid ORI to recruit various endogenous bacterial factors to enable plasmid replication³¹. Previous studies demonstrated that mutations impacting RepA proteins can alter plasmid copy number for diverse ORIs including pSC101 (ref. ²¹), BBR1 (ref. ²⁰), RK2 (refs. ^29,30) and pVS1 (ref. ³²). To leverage this general property, the entire beet open reading frame (ORF) was randomly mutagenized using error-prone PCR (epPCR) for the pVS1, pSa, RK2 and BBR1 ORIs. These ORIs are derived from unique incompatibility groups and their respective RepA proteins share no sequence homology (Supplementary Fig. 1). For each ORI, these mutagenized ORFs were then built into a selection vector and ~100,000 colonies were pooled per ORI to create four mutant libraries within the C58C1 strain of A. tumefaciens.

To select for higher-copy-number variants within these libraries, we use a directed evolution assay that coupled plasmid copy number to bacterial antibiotic tolerance (Fig. 1). This survival-coupled selection identified wild type (WT)-lethal conditions that were permissible for growth of the mutant population, enriching higher-copy-number mutants within this population (Supplementary Fig. 2). For each selected library, the entire population’s beet ORFs were sequenced using an Illumina MiSeq alongside unselected library controls to evaluate the selective pressure of each RepA residue on survival.

Technology tamfitronics figure 1 — **Fig. 1: Directed evolution pipeline to generate plasmid copy number variants using next-generation sequencing.**

This whole-population sequencing approach enabled the quantification of mutant enrichment at every residue of diverse RepA proteins (Fig. 2 and Supplementary Fig. 3). By comparing mutant distribution frequencies between the unselected and selected population, residues that contributed to the survival of the mutants in WT-lethal conditions can be identified. For the RK2 and BBR1 ORIs, large fold-change enrichments for specific nucleotide positions within the selected population were found that reached 63.8-fold and 26.7-fold over the unselected population. The pVS1 and pSa ORIs had notably lower fold-change enrichments reaching 7.2-fold and 4.4-fold, respectively. Across the four ORIs, we observed>2-fold enrichment for 38, 118, 126 and 150 nucleotide positions for pSa, pVS1, BBR1 and RK2. Mutations that were significantly enriched in the selected population compared to the unselected control were putatively associated with a higher-copy-number phenotype (Supplementary Table 1).

Technology tamfitronics figure 2 — **Fig. 2: Enriched residue sites after copy number variant selection.**

Mapping selected residues onto a RepA AlphaFold-generated model showed a significant enrichment of mutation sites on the predicted dimerization interfaces for all four ORIs (Supplementary Fig. 4). Some RepA proteins are postulated to regulate plasmid copy number through a ‘handcuffing’ process in which monomeric RepA promotes plasmid replication while the dimerized form inhibits further polymerase activity, finalizing plasmid copy number³³. It has been hypothesized that mutations that weaken the affinity of RepA to itself may reduce dimerization and enable additional replication, resulting in a higher final plasmid copy number²¹; the enrichment of selected residues on the dimerization interface for the four nonhomologous RepA proteins screened supports this notion. From this dataset, we chose ~20 residues per ORI that were found to be highly enriched for a mutation corresponding to an amino acid substitution to further characterize copy number and AMT performance in a N. benthamiana transient expression assay.

ORI copy mutants enhance plant transient transformation

Residues that were found to be significantly enriched in our selection were cloned into uniform plant expression binary vectors. For a given ORI, identical binary vectors consisting of a constitutive plant promoter driving green fluorescent protein (GFP) were created, varying by one single-nucleotide polymorphism (SNP) within the beet ORF corresponding to one of the selected mutations. These vectors were transformed into the EHA105 strain of A. tumefaciens and used to screen the impact of single beet SNPs in a transient expression assay in N. benthamiana (Fig. 3a). Because the binary vectors for a given ORI were identical except for a single SNP in beetdifferences in plant GFP output were attributed to the impact of this SNP on AMT efficiency. A total of 71 candidates across all origins were screened, and mutants for all four ORIs were identified that significantly increased GFP output relative to their WT forms (Fig. 3b and Supplementary Table 1).

Technology tamfitronics figure 3 — **Fig. 3: Screening RepA mutants for improved N. *benthamiana* transient transformation.**

We found the pSa ORI to have the largest fold-change increase in expression, with the E90K mutant having a 6.9-fold increase in GFP output relative to WT pSa. Furthermore, pSa had the greatest number of mutants that showed improvement over the WT form with 13 of the 19 mutants tested having significantly higher expression. This ORI also had the largest dynamic range of GFP expression, which spanned 28-fold over the 19 total mutants, significantly extending above and below WT output levels. These results are consistent with the known low WT copy number of this ORI in Agrobacteriumwhich potentially enables large improvements to be made by increasing the copy number¹⁸.

The RK2 origin also showed a substantial increase in plant GFP output, with the S20F mutant showing a 5.4-fold increase in expression. Notably for this ORI, six of the 19 mutants screened had extremely low GFP expression coupled with an observed slow growth phenotype, forming two distinct groups of high-expressing or low-expressing mutants rather than the gradient of outputs observed in other ORIs.

Of the four ORIs used in this study, pVS1 had the highest GFP output in its WT form (Supplementary Fig. 6). As RK2 was used in a pilot experiment and found to increase GFP expression 5.4-fold above WT levels for the best mutant, a lower-strength constitutive promoter, pCL2 (ref. ³⁴), was used to screen pVS1 mutants to prevent saturating the machine’s GFP channel. A 2.1-fold increase in GFP output was observed for the best-performing pVS1 mutant, R106H. The BBR1 ORI was also found to have a high level of expression in its WT form; thus, the same pCL2 promoter was used to drive GFP expression in planta. For the 16 BBR1 mutants screened, ten had significantly lower GFP outputs than the WT form, making this the only ORI with a majority of mutants having decreased expression. The top-performing mutant, E182V, showed a 1.7-fold increase in GFP output relative to WT BBR1. As no previous work established the copy number of BBR1 in A. tumefaciensthis result suggests that increasing the copy number may not be as beneficial for this origin.

Collectively, our results demonstrate that manipulation of the binary vector ORI greatly expands the dynamic range of T-DNA delivery to target cells, creating an additional orthogonal variable to regulate transient expression that is independent of promoter and Agrobacterium strain selection. All ORIs screened had mutants with variable expression levels that significantly increased or decreased GFP output relative to WT levels, indicating that the selection of ORI variants can be used to dynamically tailor transient expression levels up to 28-fold for a given ORI. The ability for mutants to increase transient expression in an additional strain of A. tumefaciens was also validated using the strain GV3101 and RK2. Within N. benthamianamutants that were assayed within GV3101 were found to increase or decrease transient expression levels in the same manner as the EHA105 strain (Supplementary Fig. 6). The fold-change difference for EHA105, however, was found to be larger than for GV3101, perhaps because of the hypervirulent nature of EHA105. Superior mutants were also tested in stable transformation systems using GV3101 for A. thaliana and EHA105 for R. toruloideswith both improving stable transformation, as discussed later in the manuscript. This demonstrates the utility of these mutants in two commonly used strains of A. tumefaciens. Additionally, top-performing mutants, along with their WT form, were assayed in EHA105 in another transient expression system, Lettuce sativa (Supplementary Fig. 6), and superior mutants increased GFP output for RK2 and pSa. Taken together, these results demonstrate that engineering the binary vector ORI can enable user-controlled manipulation of T-DNA transfer, refining our control over AMT in a transient system.

Growth rate and copy number impact on AMT is origin specific

To determine the relationship between binary vector copy number and N. benthamiana transient expression efficiency, we used digital PCR (dPCR) to quantify the plasmid copy number of each construct (Supplementary Fig. 7 and Supplementary Table 1). While growing the cultures for copy number quantification, we observed that some samples had notably different growth rates compared to the WT ORIs. Thus, we also quantified the growth rates of A. tumefaciens harboring WT or mutant ORIs in a time-course plate growth assay in both Luria–Bertani (LB) and a MOPS minimal medium + glucose (Supplementary Figs. 7 and 8). While previous work postulated that increased binary vector copy numbers would correspond to higher AMT efficiencies because of increased T-DNA delivery to target cells¹⁷our results demonstrate a more complex relationship among binary vector copy number, strain growth rate and AMT-mediated GFP output (Fig. 4).

Technology tamfitronics figure 4 — **Fig. 4: Relationship among copy number, *Agrobacterium* growth rate and transient transformation.**

Of the ORIs tested in this study, pSa showed the most direct relationship between plasmid copy number and GFP output. The WT copy number of 4.5 increased to 49 copies per cell and no plateau was observed for the regression line between copy number and GFP output, suggesting that the copy number can be increased further to enhance AMT performance. No significant relationship was observed between copy number and growth rate or between growth rate and GFP output for pSa, making copy number the only measured variable of importance.

Unlike the trend observed for pSa, RK2 exhibited an increase in GFP output with higher copy numbers; however, beyond an optimal level, excessively high copy numbers led to a decline in performance. The WT RK2 copy number of 1.2 increased to a maximum of 18 copies per cell in the I305K mutant but this construct showed a notably lower AMT performance. Intermediate copy numbers (that is, 5–15 copies) showed the best results for RK2, making the optimal copy number for this ORI lower than that of pSa, which demonstrates ORI-specific impacts on AMT that fall outside of copy number alone. The growth rates of RK2 mutants with higher copy numbers exceeded the WT growth rate until around seven copies per cell, after which the growth rates declined. There was a direct correlation between growth rate and GFP output, suggesting a tight relationship among these three variables for this ORI. The measured RK2 WT copy number of 1.2 was substantially lower than the value of 7–10 previously reported in the literature^29,30. We, thus, conducted a verification experiment to determine the WT copy number using three optical densities (ODs; 0.25, 0.5 and stationary phase) and two extraction kits, and we produced a consistent result of a WT copy number of around one per cell. (Supplementary Fig. 9). As the RK2 ORI has undergone progressive truncations over the decades to remove components that are not absolutely essential for plasmid replication, our quantification pertains to the mini-RK2 variant consisting of only the oriV and the trfA repA ORF²⁸.

In a similar manner to RK2, increased copy numbers of pVS1 increased GFP output until an optimal number that, if exceeded, saw declines in transient GFP expression. The WT copy number of 9.5 increased to a maximum of 66 copies and copy numbers between 30 and 40 produced the highest GFP outputs. There was a direct negative relationship between copy number and growth rate with high-copy-number mutants growing more slowly on average than the WT ORI. Unlike RK2, however, there was no observed relationship between growth rate and AMT performance, and slower-growing mutants outperformed their WT counterparts.

BBR1 was the only ORI that did not follow our hypothesis, and no relationship was found between plasmid copy number and GFP output. The WT form of BBR1 replicates at a rather high 52 copies per cell in Agrobacterium and further increasing this value saw no benefit to AMT performance. Rather than copy number, growth rate was the only measured variable of importance, with faster-growing mutants tending to outperform their slower-growing counterparts. Interestingly, plasmid copy number had no relationship with growth rate, implying that the mutants grew at different rates independent of the copy number of the plasmid. The top-performing mutant, E182V, had a copy number of eight, demonstrating that lower copy numbers can potentially outperform the WT form for BBR1.

We also sought to understand whether mutation fold-change enrichment in our initial screen could predict downstream AMT performance. When we assessed the Pearson’s correlation of mutation enrichment with tobacco GFP, growth rate and copy number, only weak correlations between GFP and enrichment and between copy number and enrichment were found to be significant for the pSa origin alone, suggesting that the fold-change enrichment within the checkerboard selection assay is not predictive of any downstream character (Supplementary Fig. 10).

Taken as a whole, our results demonstrate that increased copy numbers improve AMT efficiencies for three of the four ORIs screened. Excessively high copy numbers likely impose some form of a metabolic burden on a cell as previously described²²resulting in deleterious effects for pVS1 and RK2 that would likely emerge for higher copies of pSa. Growth rate of the host bacteria is also an important variable for some but not all ORIs and is the only measured variable of importance for BBR1. The intrinsic differences between ORIs and their interaction with host machinery in vivo likely contribute to the unique relationships observed for each ORI. As the four RepA proteins mutagenized in this study are nonhomologous and likely influence the dynamics of plasmid replication differently from one another, we are not surprised by the distinct relationships among copy number, growth rate and GFP output observed. Our findings demonstrate that there are no unifying principles that can explain how different ORI copy numbers correlate with AMT efficiency, highlighting the need for empirical, in vivo testing of each mutant variant to evaluate what is an otherwise unpredictable impact on AMT performance.

Mutant ORIs improve plant and fungal stable transformation

To evaluate whether the results from the transient tobacco screen could be translated into stable transformation systems for both plants and fungi, stable transformation experiments were conducted for A. thaliana and R. toruloides. As pVS1 typically achieves the highest transformation efficiency of commonly used ORIs, we selected three mutants and the WT form of pVS1 to determine whether this highly efficient ORI could be improved. Additionally, the top mutant and WT of RK2 and pSa were selected to demonstrate potential improvement in other ORIs. For the A. thaliana transformation, 20 plants were floral-dipped per construct and the seeds were bulked into a single stock. A seed weight corresponding to ~26,500 seeds was selected on kanamycin medium and assayed for total plant recovery. The pVS1 R106H mutant increased the A. thaliana stable transformation efficiency by 60% over WT pVS1 (Fig. 5). RK2 and pSa had more dramatic increases at 2,800% and 280%, respectively, but with lower total transformation efficiencies than pVS1. The second-highest-expressing and third-highest-expressing pVS1 mutants were also tested and found to increase stable transformation by 35% and 24% (Supplementary Fig. 11).

Technology tamfitronics figure 5 — **Fig. 5: Binary vector origin mutants improve stable transformation of both plants and fungi.**

To further validate this result, a construct delivering a T-DNA conferring hygromycin resistance along with a constitutively expressed Ruby reporter was used to floral-dip 20 additional plants for the WT and R106H forms of pVS1. From an additional ~25,000 seeds, the R106H mutant improved the transformation efficiency in this experiment by 103% over WT and the red pigmentation of the plantlets confirmed their status as transgenic (Supplementary Fig. 12).

For the oleaginous yeast R. toruloidesthe top constructs from pVS1 and RK2 were used alongside their WT counterparts. Both mutants improved the stable transformation efficiency by 390% (pVS1) and 510% (RK2) compared to their WT ORIs. Taken together, these results demonstrate that mutants with increased T-DNA delivery to target cells, as identified in the N. benthamiana screen, also have increased stable transformation efficiencies, making N. benthamiana a viable platform for rapidly screening ORI mutant impact on AMT.

Discussion

Using a directed evolution selection assay, this study created and characterized a large library of binary vector copy number variants, surpassing variant levels found for these ORIs in any other bacterial species. As the four ORIs engineered are from unique incompatibility groups, these variants will likely be useful for various prokaryotic engineering efforts. RepA proteins from these ORIs are distinct in their form, regulation and function within the host; therefore, the growth-coupled mutational pipeline is likely amenable to any ORI that replicates with a RepA protein, creating a high-throughput tool to engineer numerous broad-host-range and narrow-host-range plasmids. As plasmid copy number can dramatically impact genetic engineering in bacteria, the libraries of characterized copy number mutants will likely serve to enable more precise genetic control in nonmodel bacteria where no such variants existed previously.

The copy number libraries created in this study enable us to study the impact of binary vector copy number on AMT at a resolution not previously possible. We demonstrated that single SNPs in a binary vector backbone can vary transient expression levels by 28-fold for a given ORI within tobacco, demonstrating a large gradient of T-DNA delivery to target cells. We demonstrated that the results from this tobacco transient expression screen can be translated to improvements in stable transformation efficiency in both plants and fungi. Thus, engineering single SNPs into the binary vector backbone can notably improve AMT efficiency, presenting an easily deployable method for enhancing downstream transformation.

For such t ransformations, single T-DNA insertions are often desired and previous studies determined that different ORIs that replicate at varying copy numbers alter the average number of T-DNA insertions per cell¹⁸. This library should enable a user to titrate variable amounts of T-DNA into a cell to find the ideal copy number for both optimized transformation efficiency and desired insertion number. Future work should focus on characterizing the large range of copy numbers (nearly two orders of magnitude) to determine the relationship among binary vector copy number, stable transformation efficiency and quality event generation.

From a gene-editing perspective, the number of repair templates delivered to a cell is thought to limit the efficiency of homology-directed repair (HDR)^35,36. Transient delivery of increased levels of repair template using a higher-output mutant along with Cas9 or another targeted nuclease may enhance HDR efficiencies, and higher-copy-number mutants from each ORI can be used as tools to evaluate this. Further extensions of this work through stacking individual mutations and exploring different amino acid substitutions at enriched residues should enable an even greater range of copy numbers and AMT performances, providing a route for further optimization. This is particularly true for pSa, which did not have an observed plateau for GFP output with higher copy numbers, suggesting that this number can be raised even further to improve AMT. Taken as a whole, we have created a toolkit of copy number variants derived from single beet SNPs that can successfully tailor plant transient expression levels and improve stable transformation in plant and fungal systems, expanding and refining our control over the critical process of AMT.

Methods

Media, chemicals and culture conditions

Routine bacterial cultures were grown in LB Miller medium (BD Biosciences). E. coli was grown at 37 °C and A. tumefaciens was grown at 28 °C shaking at 200 rpm unless otherwise noted. Cultures were supplemented with kanamycin (50 mg L⁻¹; Sigma-Aldrich), gentamicin (30 mg L⁻¹; Thermo Fisher Scientific), spectinomycin (100 mg L⁻¹; Sigma-Aldrich) or rifampicin (100 mg L⁻¹; Teknova) when indicated. All other compounds unless otherwise specified were purchased through Sigma-Aldrich.

Strains and plasmids

All bacterial strains and plasmids used in this work are listed in Supplementary Table 2. All strains and plasmids created in this work are viewable through the public instance of the Joint BioEnergy Institute (JBEI) registry (https://public-registry.jbei.org/folders/847). All strains and plasmids created in this work can be requested from the strain archivist at JBEI with a signed material transfer agreement. All plasmids generated in this paper were designed using Device Editor, which is part of the DIVA suite version 6.1.2 and Open Vector Editor software version 18.3.6, while all primers used for the construction of plasmids were designed using j5 software version 3.4.0 (refs. ^37,38,39). Plasmids were assembled by Gibson assembly using standard protocols⁴⁰Golden Gate assembly using standard protocols⁴¹ or restriction digest followed by ligation with T4 ligase⁴². Plasmids were routinely isolated using the QIAprep spin miniprep kit (Qiagen) and all primers were purchased from Integrated DNA Technologies (IDT). Plasmid sequences were verified using whole-plasmid sequencing (Primordium Labs). Agrobacterium was routinely transformed by electroporation as described previously using a 1-mm cuvette and a 2.4-kV, 25-μF, 200-Ω pulse⁴³.

Mutating RepA proteins using epPCR

For each origin used in this study, the RepA or RepA-like protein was randomly mutagenized using epPCR⁴⁴. Briefly, the RepA ORFs were amplified with a high-fidelity Phusion polymerase (New England Biolabs (NEB), M0530S) using primers to add BsaI restriction sites to the 5′ and 3′ ends of the product. Bands of the proper size were gel-purified (Zymo Research, D4007) and the resulting product was used as a template. An epPCR master mix was made, containing MnCl₂ and unequal nucleotide ratios (10 mM TrisCl pH 8.3, 50 mM KCl, 7 mM MgCl₂1 mM deoxycytidine triphosphate, 1 mM deoxythymidine triphosphate, 0.2 mM deoxyadenosine triphosphate, 0.2 mM deoxyguanosine triphosphate, 0.5 mM MnCl₂1 μM forward and reverse primers, 20 pg μl⁻¹ purified RepA template and 1 μl of Taq polymerase (Thermo Fisher Scientific, EP0401) per 50-μl reaction). For each reaction, a 12-cycle amplification was run to constrain the mutation number in each product strand to around 1–3 mutations. The appropriate bands were excised and gel-extracted and the resulting product was digested with BsaI and cleaned by column purification (NEB, T1030S)

Constructing mutant libraries

To make selection vectors, a gentamicin resistance gene (gentamicin-3-acetyltransferase) driven by a salicylic-acid-inducible promoter was originally cloned into pGingerBS-NahR⁴⁵ and then variants were created for each of the pVS1, pSa and RK2 origins using a Gibson-like assembly with NEBuilder HiFi DNA assembly (NEB, E2621L). The entire vector except the RepA protein was then amplified in a Phusion PCR reaction with primers containing PaqCI restriction overhangs designed to have complementary sticky ends with the epPCR products from above. These PCR products were gel-purified and digested with PaqCI before being column-purified and ligated with the digestion product from the epPCRs using T7 DNA ligase (NEB, M0318S).

After 30 min of room temperature incubation, the ligation reaction was column-purified and 1 μl was electroporated into E. coli per reaction (NEB, C3020K). Following a 1-h recovery in the supplied medium, the electroporated samples were placed into 50 ml of LB + spectinomycin. A 1:100 dilution was made for each flask and was plated onto solidified LB + spectinomycin to allow for an estimate of library size. Around five flasks were prepared for each origin to create a library size of at least 100,000 independent transformants. After an overnight growth at 37 °C shaking at 200 rpm, 5-ml aliquots from each flask were miniprepped (Qiagen, 27104) and all samples from each origin were combined into single tubes, comprising the mutant vector library of ~100,000 mutants.

Next, 1 μl of these libraries per reaction were then electroporated into A. tumefaciens C58C1 electrocompetent cells. Using the same method as above with 50-ml recovery flasks, cells were recovered at 30 °C overnight shaking at 200 rpm and enough flasks were prepared to obtain a library size of around 150,000 mutants. Then, 5 ml of cells from each flask were combined and made into glycerol stocks for use in the higher-copy-number selection screen.

Selecting higher-copy-number mutants from constructed libraries

A checkerboard assay was used to select for higher-copy-number variants. The mutant libraries described above for each ORI contain A. tumefaciens C58C1 cells that each harbor a selection vector comprised of a constitutively expressed spectinomycin resistance gene, a salicylic-acid-inducible gentamicin resistance gene and the ORI of interest with a beet gene that was subjected to epPCR to induce random mutations. A 1-ml glycerol stock of each of the mutant libraries was grown in 50 ml of LB + spectinomycin (50 mg L⁻¹) overnight at 30 °C shaking at 200 rpm along with a culture of the WT strain for each ORI. Spectinomycin was added to maintain the plasmid within the population during this overnight growth before the actual selection began with gentamicin.

For each ORI, 100 ml of LB + spectinomycin was inoculated with saturated culture from the mutant library or WT control in a 1:200 ratio and 500 μl of this solution was added to each well of a 96-well block. Gentamicin was then added to all wells in the block following a serial dilution row-wise from rows A to H with a dilution factor of 2, starting from 3,000 mg L⁻¹ and ending with 23.4 mg L⁻¹. Salicylic acid was then added to all wells in the plate following a serial dilution column-wise from columns 1 to 12 with a dilution factor of 2, starting from 5 μM in column 12 and ending with 2.44 nM in column 1. This created 96 unique selection conditions within the block corresponding to different gentamicin and salicylic acid concentrations, enabling selection of the population over a large dynamic range of selection conditions.

The resulting 96-well blocks were then covered in vent film and incubated overnight at 30 °C shaking at 200 rpm. Growth was determined by calculating the OD₆₀₀ of each well the following day using a Genesys 50 (Thermo Fisher Scientific) and the mutant population was compared to its WT counterpart to determine gentamicin–salicylic acid combinations that were lethal to the WT ORI but that permitted growth of the mutant population. Two selection conditions that were WT lethal per ORI were chosen and grown in triplicate in 10-ml cultures of LB + spectinomycin + gentamicin + salicylic acid inoculated with a fresh glycerol stock of the mutant population. A culture containing just WT plasmid was also prepared for each selection condition as a negative control. Additionally, a triplicate of cultures containing the mutant library without gentamicin + salicylic acid selection were grown as an unselected control.

The following selection conditions were used: pVS1, 750 mg L⁻¹ gentamicin + 5 μM salicylic acid and 375 mg L⁻¹ gentamicin + 156 nM salicylic acid; pSa, 1,500 mg L⁻¹ gentamicin + 5 μM salicylic acid and 750 mg L⁻¹ gentamicin + 625 nM salicylic acid; BBR1, 2,250 mg L⁻¹ gentamicin + 5 μM salicylic acid and 1,000 mg L⁻¹ gentamicin + 78 nM salicylic acid; RK2, 1,500 mg L⁻¹ gentamicin + 5 μM salicylic acid. The overnight cultures were grown at 30 °C shaking at 200 rpm and plasmid-prepped according to a modified Qiagen protocol (https://www.qiagen.com/us/resources/resourcedetail?id=95083ccb-9583-489e-b215-99bd91c0604e&lang=en).

These samples were then used for the following Tagmentation procedure to barcode the RepA ORFs to allow for identification of enriched mutations within the selected population relative to the unselected control.

Sequencing mutant library survivors to identify enriched RepA mutations

The extracted plasmids from the selected populations were sequenced using an Illumina MiSeq to identify enriched mutations that may have contributed to their survival in WT-lethal conditions. The entire RepA ORF + 150 bp upstream and downstream was PCR-amplified and gel-purified. Purified samples were then fragmented into random 600-bp pieces and barcoded using the Illumina bead-linked transposon Tagmentation kit (Illumina, 20060059) according to the protocol provided by the manufacturer. The concentration of the barcoded fragments was then measured using a Qubit fluorometer with the Qubit dsDNA HS assay kit (Invitrogen). Library size analysis was conducted using a Bioanalyzer (Agilent Technologies). To enhance the quality of Illumina reads for regions with low variations in the conserved PCR amplicons, 20% PhiX Control v3 DNA (Illumina, FC-110-3001) was added to the library. The libraries were evenly pooled and the library, along with the PhiX mixture at a concentration of 18 pM, was loaded onto the MiSeq platform using the MiSeq Reagent Kit v3 (paired-end, 2 × 300-bp reads; Illumina).

Mapping of Illumina reads was performed as described previously⁴⁶. Briefly, quality control for the reads from this run was performed using FastQC version 0.11.8 and MultiQC version 1.10.1. Read trimming was performed using trimmomatic version 0.39 (ref. ⁴⁷). Trimmed reads were aligned to their corresponding reference file using Bowtie version 2.4.5 (ref. ⁴⁸). Mutations in sequencing reads were identified using pysamstats version 1.1.2 and pysam version 0.18.0. All code from this work is publicly available on GitHub (https://github.com/shih-lab/origin_story) and all sequencing reads are available on the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) under BioProject accession PRJNA1031697.

Building plant GFP expression vectors with enriched RepA mutations from the MiSeq data

To construct GFP expression vectors, promoters of variable strengths derived from the PCONS suite of constitutive plant promoters were used³⁴. Expression vectors containing a GFP construct driven by the constitutive plant promoters pCM2 (for RK2 and pSa) or pCL2 (pVS1 and BBR1) along with a 35S::KanR component for stable-line selection were made using Gibson assembly. RK2 was run as a pilot for this project using the stronger pCM2 promoter to drive GFP. Because of the high GFP output of the WT forms of pVS1 and BBR1, a very large fold-change increase in GFP expression could potentially saturate the GFP channel of the plate reader; as such, the weaker pCL2 was used for these origins. As pSa had a weaker starting WT expression, pCM2 was used for this origin. To demonstrate the performance of the highest-performing mutants expressed from the same promoter for all origins, an additional set of vectors were cloned with pCM2::GFP for pVS1 and BBR1 (Supplementary Fig. 6). For the Ruby vector described in Supplementary Fig. 12, the strongly expressing pCH5 promoter was used to drive the Ruby reporter⁴⁹.

Single SNPs were introduced into the RepA coding sequence using PCR with the SNP of interest designed into a primer overhang. Proper PCR band lengths were verified in a high-throughput manner using an Agilent ZAG (zero-gel electrophoresis). PCR products were column-purified and assembled using NEBuilder HiFi DNA assembly (NEB, E2621L). The resulting assemblies were transformed by heat shock into XL1-Blue E. coli and grown overnight at 37 °C on solidified LB + kanamycin plates. Single colonies were selected, grown in a 4 ml of LB + kanamycin liquid culture and miniprepped; then, the integrity of the plasmid sequence including the proper SNP was verified using whole-plasmid sequencing from Primordium Labs (https://www.primordiumlabs.com/). Verified plasmids were then transformed into the EHA105 strain of A. tumefaciens by electroporation and selected on plates of solidified LB + kanamycin (50 mg L⁻¹) and rifampicin (100 mg L⁻¹) at 28 °C shaking at 200 rpm.

Screening origin mutants in N. benthamiana

A previously established method for transiently expressing genes in N. benthamiana was used to screen the impact of the mutant ORIs in planta⁵⁰. Single colonies of transformed EHA105 were selected and used to inoculate 5 ml of LB + kanamycin and rifampicin for overnight growth at 28 °C shaking at 200 rpm. The following morning, 10 ml of fresh LB + kanamycin + rifampicin was added to each vial and the cultures were grown for 2 h. An aliquot from each culture was taken and used to measure the OD₆₀₀. Then, 10 ml of each culture was centrifuged at 3,200g for 15 min. The supernatant was removed and cells were diluted to an OD₆₀₀ of 1.0 using an appropriate volume of tobacco infiltration buffer (10 mM MgCl₂10 mM MES and 200 μM acetosyringone (added fresh), pH 5.7). Resuspended cultures were allowed to induce for 2 h at room temperature while gently rocking.

N. benthamiana was grown at 25 °C under long-day conditions (16 h of light, 8 h of darkness) of 150 μmol m⁻² s⁻¹ photosynthetically active radiation (PAR; wavelength: 400–700 nm). Sunshine no. 4 growing mixture supplemented with Osmocote was used as the planting medium. For each origin mutant, six 4-week old N. benthamiana plants were syringe-infiltrated on the fourth and fifth leaves from the top of the plant. Infiltrated plants were watered and returned to the growth room for 3 days. After this 3-day period, four leaf discs of infiltrated tissue per leaf were taken using a standard 6-mm hole puncher and these discs were placed abaxial side up into a 96-well culture plate filled with 320 μl of water such that the discs were floating on the surface. These 96-well plates were then assayed for GFP fluorescence using a BioTek Synergy H1 microplate reader set for conditions of 483-nm excitation and 512-nm emission. GFP fluorescence readings for each disc were analyzed in R to compare mutant performances compared to the WT origins.

L. sativa transient expression assays

Buttercrunch lettuce plants were grown in 18-cell flats at 25 °C under long-day conditions of 150 μmol m⁻² s⁻¹ PAR using the same Sunshine no. 4 + Osmocote planting medium. Then, 5-week-old plants were infiltrated with a blend of two EHA105 strains consisting of the same GFP binary vector strains as the tobacco experiments and a strain harboring a binary vector for the P19 gene. P19 is a viral gene that suppresses plant gene silencing and coinfiltration of this construct aids in transient expression, particularly in more recalcitrant backgrounds⁵¹. These two strains were grown overnight at 28 °C shaking at 200 rpm and were brought to an OD₆₀₀ of 1.0 and mixed in a 1:1 ratio before inducing in the tobacco infiltration medium mentioned above for 2 h. The fifth leaf of each plant was infiltrated and a 4-day incubation period at 25 °C was used before isolating leaf discs for GFP quantification.

Copy number quantification

Plasmid copy number was determined using nanoplate dPCR. A QIAcuity EvaGreen PCR kit (Qiagen, 250111) was used for all dPCR reactions. Samples were transferred into an 8,500-partition 96-well QIAcuity nanoplate and loaded into a QIAcuity One system. dPCR reactions were conducted with a standard protocol (2 min at 95 °C followed by 40 cycles of 15 s at 95 °C, 15 s at 56 °C and 15 s at 72 °C). The nanoplate was imaged with an exposure duration of 150 ms and gain of 2. Images were analyzed with the QIAcuity Software Suite. The concentration of plasmids were calculated using Poisson statistical methods by the QIAcuity Software Suite. For each biological replicate, two dPCR reactions were run, one targeting the single-copy rpoB gene found within A. tumefaciens C58 and one targeting the kanamycin resistance (KanR) gene located on the binary vector. The final copy number was determined as the ratio of KanR to rpoB copies, which served as a measurement of the number of plasmid counts to genomic counts respectively. The following primers were ordered from IDT and used for the dPCR reactions: KanR forward, GATCATCCTGATCGACAAGACCGG; KanR reverse, CTGCCGAGAAAGTATCCATCATGGC; rpoB forward, GAGTACCGGAATCTCGTCAAAGCC; rpoB reverse, CGAAGATCTCTACGGCAACTACCTGG.

Growth rate quantification

The growth rates of all WT and mutant strains were evaluated in both a rich LB medium and a minimal salts medium (MOPS minimal + 10 mM glucose)⁵². For each, a glycerol stock was used to inoculate a 10-ml culture of LB that was grown overnight at 28 °C shaking at 200 rpm. This saturated culture was spun down and resuspended in either MOPS minimal salts + 10 mM glucose or LB medium at a concentration that was 1:200 of the original saturated stock. Next, 150 μl of this bacterial culture was added to each well of a 96-well clear plate in quadruplicate (Falcon, 353072) and the cultures were grown shaking linearly at 1,000 cpm at 28 °C in a BioTek Synergy H1 plate reader. The OD₆₀₀ was measured every 6 min for 24 h; using the data from the plate reader, the growth rate for each culture was calculated using the GrowthCurver package in R. The growth rate in this study is reported as doublings per hour, which is 60 divided by the doubling time.

A. thaliana stable transformation

A. thaliana was transformed using the floral dip method of transformation as previously described⁵³. Arabidopsis seeds were sprinkled over the surface of wet Sunshine no. 4 growing mixture and allowed to grow for 12 days. After this period, 18-pot flats were prepared with wet Sunshine no. 4 growing mixture with Osmocote and five Arabidopsis seedlings were transplanted per pot in an X pattern. Flats were grown at 22 °C under short-day conditions (8 h of light, 16 h of darkness) of 150 μmol m⁻² s⁻¹ PAR. After 3 weeks, the flats were moved to a 22 °C chamber with long-day lighting conditions of the same intensity. The first floral meristem from each plant was excised to promote axillary shoot growth. After three more weeks, all plants began flowering with multiple inflorescences and these were used for the floral dip procedure.

The top-performing origin, pVS1, and lower-expressing origins RK2 and pSa were selected for an analysis of impact of enhanced binary vector copy number on stable transformation. The same plasmids used in the tobacco screen were used for this experiment as they contained a 35S::KanR component that allows for the selection of T1 plants with kanamycin. These plasmids were electroporated into the A. tumefaciens strain GV3101 and selected on solidified plates of LB + rifampicin, kanamycin and gentamicin grown for 3 days at 30 °C. Then, 300-ml cultures of each strain were grown overnight at 30 °C shaking at 200 rpm in LB + rifampicin, kanamycin and gentamicin. These cultures were centrifuged for 20 min at 3,200g and the supernatant was discarded and replaced with 300 ml of floral dip medium consisting of 5% sucrose with 0.02% Silwet (50 g of sucrose and 200 μl of Silwet brought to 1 L with water).

Pots containing five A. thaliana plants were gently inverted and the inflorescences were submerged into the Agrobacterium solution for 15 s with light agitation. All pots dipped in the same construct were then laid sideways in an empty planting tray and were covered with a plastic lid that had been misted with water. Plants were left at room temperature overnight in this humidity chamber before being returned to an upright position in the 22 °C long-day chamber. Then, 1 week after the initial floral dip, this process was repeated with the same plants to further expose newly grown buds to the bacteria. For each construct, four pots containing a total of 20 plants were dip ped; following the second dip, a single stake was placed into one pot and the numerous inflorescences were gently tied together in a single mass. Next, 2 weeks after the second dip, a paper bag was used to cover the inflorescence heaps and the plants were moved to a drying rack to die and desiccate.

Following desiccation, seeds were sifted multiple times to separate them from any silique debris. Then, 200 mg of seeds per sample were measured and placed into tubes for the plating experiment. Seeds were then sterilized by first submerging in 70% ethanol for 1 min followed by shaking in a solution of 50% bleach from a 12.5% sodium hypochlorite stock with a drop of Triton-20 surfactant for 7.5 min before rinsing with sterile water three times. A moderately dense layer of seeds that were distributed in close but not overlapping proximity to each other was spread onto the surface of selection plates consisting of Murashige and Skoog salts + 1 g L⁻¹ MES brought to a pH of 5.7 along with 50 mg L⁻¹ kanamycin and 200 mg L⁻¹ timentin to kill residual A. tumefaciens and 8 g L⁻¹ agar for solidification. Each 200 mg of seeds approximately covered 6 5-inch plates. The plates were then wrapped in vent tape, placed into a long-day 22 °C chamber and allowed to grow for 3 weeks before quantification of successfully growing nonchlorotic plants.

R. toruloides transformation

To construct a R. toruloides expression vector, the plasmid JPUB_013523 (ref. ⁵⁴) was modified to contain the WT or mutant forms of the pVS1 or RK2 ORIs (R106H and S20F mutants, respectively). Origin variants from this base vector were created using Gibson assembly and were electroporated into the EHA105 strain of A. tumefaciens. Single colonies were used to inoculate LB + kanamycin and rifampicin for overnight growth at 28 °C shaking at 200 rpm. AMT of R. toruloides was conducted as previously described⁵⁵.

Selection was conducted on plates containing nourseothricin; for each transformation, plates with a 1:10 dilution along with a full-concentration plating of remaining cells were made. Plates were incubated at 30 °C for 2 days and then imaged using a AnalytikJena UVP GelSolo followed by colony count quantification using Fiji (https://imagej.net/software/fiji/downloads).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Next-generation sequencing data are publicly available from the NCBI SRA under BioProject accession PRJNA1031697. All plasmid sequences used in this study were published on the JBEI public repository (https://public-registry.jbei.org/folders/847). Source data are provided with this paper.

Code availability

All code used for next-generation sequencing data analysis can be found on GitHub (https://github.com/shih-lab/origin_story).

References

Nester, E. W. Agrobacterium: nature’s genetic engineer. Front. Plant Sci. 5730 (2014).

PubMed Google Scholar
Gelvin, S. B. Agrobacterium-mediated plant transformation: the biology behind the ‘gene-jockeying’ tool. Microbiol. Mol. Biol. Rev. 6716–37 (2003).

Article CAS PubMed PubMed Central Google Scholar
Thompson, M. G. et al. Agrobacterium tumefaciens: a bacterium primed for synthetic biology. Biodes. Res. 20208189219 (2020).

Article PubMed PubMed Central Google Scholar
Hwang, H.-H., Yu, M. & Lai, E.-M. Agrobacterium-mediated plant transformation: biology and applications. Arabidopsis Book 15e0186 (2017).

Article PubMed PubMed Central Google Scholar
de Groot, MJ, Bundock, P., Hooykaas, PJ & Beijersbergen, AG Agrobacterium tumefaciens-mediated transformation of filamentous fungi. Nat. Biotechnol. 16839–842 (1998).

Article PubMed Google Scholar
Kunik, T. et al. Genetic transformation of HeLa cells by Agrobacterium. Proc. Natl Acad. Sci. USA 981871–1876 (2001).

Article CAS PubMed PubMed Central Google Scholar
Zambryski, P. et al. Ti plasmid vector for the introduction of DNA into plant cells without alteration of their normal regeneration capacity. EMBO J. 22143–2150 (1983).

Article CAS PubMed PubMed Central Google Scholar
Lee, L.-Y. & Gelvin, S. B. T-DNA binary vectors and systems. Plant Physiol. 146325–332 (2008).

Article CAS PubMed PubMed Central Google Scholar
Dillen, W. et al. The effect of temperature on Agrobacterium tumefaciens-mediated gene transfer to plants. Plant J. 121459–1463 (1997).

Article CAS Google Scholar
Winans, S. C., Kerstetter, R. A. & Nester, E. W. Transcriptional regulation of the virA and VirG genes of Agrobacterium tumefaciens. J. Bacteriol. 1704047–4054 (1988).

Article CAS PubMed PubMed Central Google Scholar
Anand, A. et al. An improved ternary vector system for Agrobacterium-mediated rapid maize transformation. Plant Mol. Biol. 97187–200 (2018).

Article CAS PubMed PubMed Central Google Scholar
Thompson, M. et al. Genetically refactored Agrobacterium-mediated transformation. Preprint at bioRxiv https://doi.org/10.1101/2023.10.13.561914 (2023).
Komari, T. et al. Binary vectors and super-binary vectors. Methods Mol. Biol. 34315–41 (2006).

CAS PubMed Google Scholar
Altpeter, F. et al. Advancing crop transformation in the era of genome editing. Plant Cell 281510–1520 (2016).

CAS PubMed PubMed Central Google Scholar
Dong, M.-J., Luo, H. & Gao, F. DoriC 12.0: an updated database of replication origins in both complete and draft prokaryotic genomes. Nucleic Acids Res. 51D117–D120 (2023).

Article CAS PubMed Google Scholar
Thomas, C. M. et al. Annotation of plasmid genes. Plasmid 9161–67 (2017).

Article CAS PubMed Google Scholar
Zhi, L. et al. Effect of Agrobacterium strain and plasmid copy number on transformation frequency, event quality and usable event quality in an elite maize cultivar. Plant Cell Rep. 34745–754 (2015).

Article CAS PubMed Google Scholar
Oltmanns, H. et al. Generation of backbone-free, low transgene copy plants by launching T-DNA from the Agrobacterium chromosome. Plant Physiol. 1521158–1166 (2010).

Article CAS PubMed Google Scholar
Vaghchhipawala, Z. et al. Mutation of the RepB C-terminus of a pRi-repABC binary vector affects plasmid copy number in Agrobacterium and transgene copy number in plants. PLoS ONE 13e0200972 (2018).

Article PubMed PubMed Central Google Scholar
Tao, L., Jackson, R. E. & Cheng, Q. Directed evolution of copy number of a broad host range plasmid for metabolic engineering. Metab. Meadow. 710–17 (2005).

Article CAS PubMed Google Scholar
Thompson, M. G. et al. Isolation and characterization of novel mutations in the pSC101 origin that increase copy number. Sci. Rep. 81590 (2018).

Article PubMed PubMed Central Google Scholar
Rouches, M. V., Xu, Y., Cortes, L. B. G. & Lambert, G. A plasmid system with tunable copy number. Nat. Common. 133908 (2022).

Article CAS PubMed PubMed Central Google Scholar
Cook, T. B. et al. Genetic tools for reliable gene expression and recombineering in Pseudomonas putida. J. Ind. Microbiol. Biotechnol. 45517–527 (2018).

Article CAS&nbsp ; PubMed Google Scholar
Lee, T. S. et al. BglBrick vectors and datasheets: a synthetic biology platform for gene expression. J. Biol. Eng. 512 (2011).

Article CAS PubMed PubMed Central Google Scholar
Wadood, A., Dohmoto, M., Sugiura, S. & Yamaguchi, K. Characterization of copy number mutants of plasmid pSC101. J. Gen. Appl. Microbiol. 43309–316 (1997).

Article CAS PubMed Google Scholar
Chaillou, S. et al. Directed evolution of colE1 plasmid replication compatibility: a fast tractable tunable model for investigating biological orthogonality. Nucleic Acids Res. 509568–9579 (2022).

Article CAS PubMed PubMed Central Google Scholar
Itoh, Y., Soldati, L., Leisinger, T. & Haas, D. Low- and intermediate-copy-number cloning vectors based on the Pseudomonas plasmid pVS1. Antonie Van Leeuwenhoek 54567–573 (1988).

Article CAS PubMed Google Scholar
Murai, N. Review: plant binary vectors of Ti plasmid in Agrobacterium tumefaciens with a broad host-range replicon of pRK2, pRi, pSa or pVS1. Am. J. Plant Sci. 04932–939 (2013).

Article Google Scholar
Fang, F. C., Durland, R. H. & Helinski, D. R. Mutations in the gene encoding the replication-initiation protein of plasmid RK2 produce elevated copy numbers of RK2 derivatives in Escherichia coli and distantly related bacteria. Gene 1331–8 (1993).

Article CAS PubMed Google Scholar
Durland , RH , Toukdarian , A. , Fang , F. & Helinski , DR Mutations in the trfA replication gene of the broad-host-range plasmid RK2 result in elevated plasmid copy numbers. J. Bacteriol. 1723859–3867 (1990).

Article CAS PubMed PubMed Central Google Scholar
del Solar, G., Giraldo, R., Ruiz-Echevarría, MJ, Espinosa, M. & Díaz-Orejas, R. Replication and control of circular bacterial plasmids. Microbiol. Mol. Biol. Rev. 62434–464 (1998).

Article PubMed PubMed Central Google Scholar
Heeb, S. et al. Small, stable shuttle vectors based on the minimal pVS1 replicon for use in Gram-negative, plant-associated bacteria. Mol. Plant Microbe Interact. 13232–237 (2000).

Article CAS PubMed Google Scholar
Gasset-Rosa, F. et al. Negative regulation of pPS10 plasmid replication: origin pairing by zipping-up DNA-bound RepA monomers. Mol. Microbiol. 68560–572 (2008).

Article CAS PubMed Google Scholar
Zhou, A. et al. A suite of constitutive promoters for tuning gene expression in plants. ACS Synth. Biol. 121533–1545 (2023).

Article CAS PubMed Google Scholar
Hahn, F., Eisenhut, M., Mantegazza, O. & Weber, A. P. M. Homology-directed repair of a defective glabrous gene in Arabidopsis with Cas9-based gene targeting. Front. Plant Sci. 9424 (2018).

Article PubMed PubMed Central Google Scholar
Aird, E. J., Lovendahl, K. N., St Martin, A., Harris, R. S. & Gordon, W. R. Increasing Cas9-mediated homology-directed repair efficiency through covalent tethering of DNA repair template. Commun. Biol. 154 (2018).

Article PubMed PubMed Central Google Scholar
Ham, T. S. et al. Design, implementation and practice of JBEI-ICE: an open source biological part registry platform and tools. Nucleic Acids Res. 40e141 (2012).

Article PubMed PubMed Central Google Scholar
Chen, J., Densmore, D., Ham, TS, Keasling, JD & Hillson, NJ DeviceEditor visual biological CAD canvas. J. Biol. Eng. 61 (2012).

Article PubMed PubMed Central Google Scholar
Hillson, N. J., Rosengarten, R. D. & Keasling, J. D. j5 DNA assembly design automation software. ACS Synth. Biol. 114–21 (2012).

Article CAS PubMed Google Scholar
Gibson, D. G. et al. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat. Methods 6343–345 (2009).

Article CAS PubMed Google Scholar
Engler, C., Kandzia, R. & Marillonnet, S. A one pot, one step, precision cloning method with high throughput capability. PLoS ONE 3e3647 (2008).

Article PubMed PubMed Central Google Scholar
Green, M. R. & Sambrook, J. Molecular Cloning: A Laboratory Manual 4th edn, Vols.1–3 (Cold Spring Harbor Laboratory Press, 2012).
Kámán-Tóth, E., Pogány, M., Dankó, T., Szatmári, Á. & Bozsó, Z. A simplified and efficient Agrobacterium tumefaciens electroporation method. 3 Biotech 8148 (2018).

Article PubMed PubMed Central Google Scholar
Wilson, D. S. & Keefe, A. D. Random mutagenesis by PCR. Curr. Protoc. Mol. Biol. Chapter 8Unit 8.3 (2001).
Pearson, A. N. et al. The pGinger family of expression plasmids. Microbiol. Spectr. 11e0037323 (2023).

Article PubMed Google Scholar
Waldburger, L. et al. Transcriptome architecture of the three main lineages of agrobacteria. mSystems 8e0033323 (2023).

Article PubMed Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 302114–2120 (2014).

Article CAS PubMed PubMed Central Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9357–359 (2012).

Article CAS PubMed PubMed Central Google Scholar
He, Y., Zhang, T., Sun, H., Zhan, H. & Zhao, Y. A reporter for noninvasively monitoring gene expression and plant transformation. vegetable garden nothing 7152 (2020).

Article PubMed PubMed Central Google Scholar
Belcher, M. S. et al. Design of orthogonal regulatory systems for modulating gene expression in plants. Nat. Chem. Biol. 16857–865 (2020).

Article CAS PubMed Google Scholar
Jay, F., Brioudes, F. & Voinnet, O. A contemporary reassessment of the enhanced transient expression system based on the tombusviral silencing suppressor protein P19. Plant J. 113186–204 (2023).

Article CAS PubMed Google Scholar
LaBauve, A. E. & Wargo, M. J. Growth and laboratory maintenance of Pseudomonas aeruginosa. Curr. Protoc. Microbiol. Chapter 6Unit 6E.1 (2012).
Clough, S. J. & Bent, A. F. Floral dip: a simplified method for Agrobacterium-mediated transformation of Arabidopsis thaliana. Plant J. 16735–743 (1998).

Article CAS PubMed Google Scholar
Geiselman, G. M. et al. Conversion of poplar biomass into high-energy density tricyclic sesquiterpene jet fuel blendstocks. Microb. Cell Fact. 19208 (2020).

Article CAS PubMed PubMed Central Google Scholar
Zhang, S. et al. Engineering Rhodosporidium toruloides for increased lipid production. Biotechnol. Bioeng. 1131056–1066 (2016).

Article CAS PubMed Google Scholar

Download references

Acknowledgements

We would like to thank J. Dalton for excellent maintenance and management of our growth room and S. Sirirungruang for assistance with PyMOL. Figure cartoon representations were sourced from BioRender.com. M.G.T. is a Simons Foundation Awardee of the Life Sciences Research Foundation. L.M.W. and L.D.K. are funded through the National Science Foundation Graduate Research Fellowship. The lab of J.O.B. is supported through National Institutes of Health grant R01-GM145814 and M.B. is supported by US Department of Agriculture National Institute of Food and Agriculture postdoctoral fellowship GRANT13367270. This work was part of the JBEI (https://www.jbei.org) supported by the US Department of Energy (DOE), Office of Science, Office of Biological and Environmental Research, through contract DE-AC02-05CH11231 between Lawrence Berkeley National Laboratory and the DOE. The funders had no role in manuscript preparation or the decision to publish. The views and opinions of the authors expressed herein do not necessarily state or reflect those of the United States government or any agency thereof. Neither the United States government nor any agency thereof, nor any of their employees, makes any warranty, expressed or implied, assumes any legal liability or responsibility for the accuracy, completeness or usefulness of any information, apparatus, product or process disclosed or represents that its use would not infringe privately owned rights. The United States government retains and the publisher, by accepting the article for publication, acknowledges that the United States government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript or allow others to do so for United States government purposes. The DOE provides public access to these results of federally sponsored research in accordance with the DOE public access plan (http://energy.gov/downloads/doe-public-access-plan).

Author information

Authors and Affiliations

Joint BioEnergy Institute, Emeryville, CA, USA

Matthew J. Szarzanowicz, Lucas M. Waldburger, Liam D. Kirkpatrick, Rita C. Kuo, Joshua McCauley, Hamreet Pannu, Ruoming Cui, Shuying Liu, Nathan J. Hillson, Jay D. Keasling, John M. Gladden, Mitchell G. Thompson & Patrick M. Shih
Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA

Matthew J. Szarzanowicz, Lucas M. Waldburger, Liam D. Kirkpatrick, Hamreet Pannu, Ruoming Cui, Shuying Liu, Mitchell G. Thompson & Patrick M. Shih
Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA, USA

Matthew J. Szarzanowicz, Liam D. Kirkpatrick, Claudine Tahmin & Patrick M. Shih
Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA

Lucas M. Waldburger & Jay D. Keasling
Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA

Lucas M. Waldburger, Rita C. Kuo, Joshua McCauley, Nathan J. Hillson & Jay D. Keasling
Laboratory of Genetics, University of Wisconsin-Madison, Madison, WI, USA

Michael Busche & Jacob O. Brunkard
Sandia National Laboratories, Livermore, CA, USA

Gina M. Geiselman & John M. Gladden
Department of Chemistry, University of California, Davis, Davis, CA, USA

Alexander J. Kehl
Department of Chemical and Biomolecular Engineering, University of California, Berkeley, Berkeley, CA, USA

Jay D. Keasling
QB3, University of California, Berkeley, Berkeley, CA, USA

Jay D. Keasling
Center for Biosustainability, Danish Technical University, Kongens Lyngby, Denmark

Jay D. Keasling
Innovative Genomics Institute, Berkeley, CA, USA

Patrick M. Shih

Contributions

Conceptualization, M.G.T. and M.J.S.; methodology, M.J.S., M.G.T., L.D.K., L.M.W., M.A.B., R.C.K. and G.M.G.; investigation, M.J.S., M.G.T., L.D.K., G.M.G., L.M.W., M.A.B., R.C.K., A.K., C.T., H.P., R.C., S.L. and J.M.; writing—original draft, M.J.S.; writing—review and editing, all authors; resources and supervision, M.G.T., J.M.G., N.J.H., J.B., J.D.K. and P.M.S.

Corresponding authors

Correspondence to Mitchell G. Thompson or Patrick M. Shih.

Ethics declarations

Competing interests

A patent on optimal copy number binary vectors has been filed by Lawrence Berkeley National Laboratory with M.G.T., M.J.S., L.D.K., L.M.W. and P.M.S. as inventors. J.D.K. has financial interests in Amyris, Ansa Biotechnologies, Apertor Pharma, Berkeley Yeast, BioMia, Demetrix, Lygos, Napigen, ResVita Bio and Zero Acre Farms. N.J.H. has financial interests in TeselaGen Biotechnologies and Ansa Biotechnologies. The other authors declare no competing interests.

Peer review

Peer review information

Nature Biotechnology thanks Josh Cuperus and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Source data

Source Data Fig. 3

Raw data from N. benthamiana transient expression assay.

Source Data Fig. 4

Raw data for strain growth rate, plasmid copy number and mean GFP expression from N. benthamiana transient expression assay.

Source Data Fig. 5

Raw data for Rhodosporidium and Arabidopsis stable transformation assays.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Technology tamfitronics Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Szarzanowicz, M.J., Waldburger, L.M., Busche, M. et al. Binary vector copy number engineering improves Agrobacterium-mediated transformation. Nat Biotechnol (2024). https://doi.org/10.1038/s41587-024-02462-2

Are you over 18?

Access forbidden

Blog