Engineering of novel DNA polymerase variants for single enzyme quantitative multiplex reverse transcription-PCR

engineering-of-novel-dna-polymerase-variants-for-single-enzyme-quantitative-multiplex-reverse-transcription-pcr
Engineering of novel DNA polymerase variants for single enzyme quantitative multiplex reverse transcription-PCR

Introduction

The combination of PCR with reverse transcription (RT) laid the basis for various powerful methods in the research field of molecular biotechnology and clinical diagnostics1,2. RT-PCR is used to detect and quantify even low-abundant RNA molecules and is therefore the gold standard in gene expression analysis, generating cDNA libraries and the detection of viral or pathogen infections2,3,4,5,6. Moreover, its excellent potential is applied for both cancer diagnosis and profiling and was recently coupled to high throughput next-generation RNA sequencing (NGS) technologies7,8,9. In particular, during the global pandemic caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), RT-PCR gained strong popularity and the importance of improving established techniques became apparent10,11,12.

Classical approaches start with the RT from RNA to DNA catalysed by viral RTs, followed by DNA purification and DNA amplification2,13,14. Either fluorescent dyes that intercalate into the DNA product or fluorescently tagged hydrolysis probes (so called TaqMan probes) are commonly used to monitor the reaction15,16,17. To simplify the procedure, RT and the subsequent DNA amplification were combined in a single tube formulation18,19. This not only saves time, but also reduces the risk of contaminations and eliminates the need to check buffer systems for compatibility14,20. Furthermore, the usage of different primer sets with distinctly labelled fluorescence probes enabled the simultaneous targeting of two or even multiple RNA regions in one reaction (multiplexed RT-PCR)21,22,23,24. This method offers substantial advantages such as saving time, reduced consumption of sample material and lower costs25. This technique is successfully used to profile mRNA levels from multiple target genes26,27,28,29 detect several viral strains, subtypes or other pathogens simultaneously30,31 and monitor cancer-related biomarkers for personalised medicine approaches32,33. To further enhance the assay performance, the applied enzymes should meet specific requirements. These include high catalytic efficiency at elevated reaction temperatures, good fidelity, enzymatic stability after multiple thermocycling steps, and the ability to produce even longer cDNA products13. Taking these criteria into account, both RTs and DNA pols have been continuously engineered and improved over the last years:

Viral RTs from Moloney murine leukaemia virus (M-MLV) and avian myeloblastosis virus (AMV) are commonly used for cDNA synthesis13,14,34 along with those from HIV-1 and bacterial group II introns35,36. A key challenge is their limited thermostability, even when RT precedes the PCR cycle37. Higher temperatures allow for a more specific priming process, make complex RNA or DNA structures accessible, and inactivate disruptive factors such as RNases38,39. Technical advances now allow cDNA synthesis at 37–60°C13,40 with some HIV-1 RT variants functioning at up to 85°C41. However, RT-PCR still requires thermostable DNA pols for amplification, and there is evidence that RT and DNA pols may compete for template binding, potentially affecting reaction efficiency, particularly at low RNA input42.

As an alternative, highly thermostable DNA pols with improved RT activity have been developed43,44,45,46,47,48. Especially, Taq pol naturally has RT activity, which can be enhanced under certain reaction conditions49,50,51. However, its application is limited by two major challenges: replacement of Mg²⁺ by Mn²⁺ increases RT activity but reduces fidelity44,52 or its low RT efficiency requires either high RNA input or buffer optimisation49,53. Protein engineering has further improved the RT activity of Taq pol or its truncated form and other DNA pols, thus single enzyme real-time RT-PCR was finally demonstrated14,43,49,54,55,56,57,58,59,60,61. However, many existing approaches still rely on Mn²⁺ additives or enzyme fusions43,60 lack compatibility with hydrolysis probes54,55,57,58,59,61, or have not been demonstrated in multiplex applications49,56. To expand these capabilities, we aimed to develop a streamlined TaqMan probe-based RT-PCR system that allows multiplex RNA targeting and is as easy to implement.

This study reports the development and application of novel DNA pol variants that are able to catalyse RT and the DNA amplification simultaneously without the need of viral RTs, addition of Mn2+ ions or to be fused to other enzyme scaffolds. These enzymes derived from the DNA polymerase I of Thermus aquaticus (Taq), which is known for its thermostability, high processivity, and ability to digest TaqMan probes15,62,63,64. The enzymes were obtained by combinatorically investigating two independently discovered mutation pools that enhance the RT activity of the N-terminally truncated form of Taq pol termed KlenTaq (KTq) DNA polymerase57,61. Promising variants were selected after screening the mutant library for RT and PCR activity. We could show that the application of those enzymes in RT-PCR further simplifies the one-tube formulation by eliminating the need for viral RTs as a second enzyme. In this way, we were able to perform RT-PCR by using either SYBR green I as intercalating fluorescent dye or a TaqMan probe-based approach. Moreover, we were able to detect up to four RNA targets simultaneously with a detection limit of 20 copies. This method marks a significant advancement, enabling the simultaneous detection of four RNA targets through quadruplex RT-PCR, all achieved through the utilization of a single enzyme capable of catalysing both RT and DNA amplification.

Results

Screening for Taq pol variants with improved RT-PCR activity

Previous studies have identified two mutational pools from two previously engineered KTq pol variants: the RT-active KTq DNA polymerase (RT-KTq)57 and Mut_RT DNA polymerase (Mut_RT)61. RT-KTq carries 4 amino acid mutations namely L459M, S515R, I638F and M747K (Fig. 1A and C, amino acid side chains depicted in blue)57. The enzyme was obtained by a recombined DNA shuffling approach that combined seven distinct mutations previously reported to promote either RT or to expand the substrate spectrum55,65. After pre-selection for both PCR- and RT activity, RT-KTq showed up to 100-fold increased RT compared to the applied parental enzymes. Structural analysis of RT-KTq suggested that mutations such as L459M and S515R enhance thumb domain flexibility, while I638F and M747K may stabilize substrate binding through hydrophobic or electrostatic interactions57.

Fig. 1
figure 1

Crystal structures of RT-KTq/Mut_RT and Taq pol. (A) The overall crystal structure of RT-KTq and Mut_RT is shown as cartoon (adapted from PDB ID: 4BWM using PyMOL (Schrödinger, LLC, New York, NY)). Amino acid mutations specific for RT-KTq are coloured in blue and mutations specific for Mut_RT are coloured in green (side chains depicted as sticks). The KTq pol backbone is coloured in light grey. (B) The overall crystal structure of Taq pol is shown as cartoon (adapted from PDB ID: 1TAQ using PyMOL (Schrödinger, LLC, New York, NY)). The KTq pol backbone is coloured in light grey and the nuclease domain is depicted in dark grey. The amino acids that are changed to form the RT-Taq are coloured in blue. (C) Architecture of the KTq pol variants (top) and Taq pol (bottom). The positions of amino acid mutations are indicated in blue (RT-KTq, RT-Taq) and green (Mut_RT). (D) Combination strategy of the new Taq pol enzyme library. All eight amino acid mutations were combined to generate the wild-type Taq pol (without mutations), all single, double, triple and 4 − 8 fold mutants giving 256 Taq pol variants in total. The plasmid mix was transformed into E. coli cells, plated and 2660 single colonies were picked to form the library.

Full size image

In contrast, a combined rational and combinatorial enzyme library design revealed Mut_RT, carrying five mutations, namely N483K, E507K, K540Y, V586G and I614K (Fig. 1A and C, amino acid side chains depicted in green)61. This enzyme showed significantly increased activity on both RNA and DNA compared to the parental enzyme. Although the Mut_RT mutations were not structurally analysed, they were selected due to their proximity to the primer/template complex and their evolutionary conservation among family A DNA pols, suggesting a role in substrate interaction and catalytic efficiency61.

Taken together, RT-KTq and Mut_RT originated from two independently constructed enzyme libraries using different approaches. As consequence, two different pools of mutations were reported to successfully increase RT-activity. We hypothesized that recombining proficient mutations from both enzymes might result in an even more empowered RT-active DNA pol. In addition, since the full-length Taq DNA pol was shown to be more processive than its truncated version64,66 and insertion of RT-boosting mutations into the Taq pol backbone increased RT-activity compared to its respective KTq pol variant56 we used this scaffold. Thus, we introduced the 4 mutations of RT-KTq into the Taq pol backbone to form the RT-Taq (Fig. 1B and C bottom, amino acid side chains depicted in blue).

In preliminary experiments, RT-KTq and all 5 single mutants of the Mut_RT were tested individually to evaluate their contributions to enzyme performance. Not all mutation sites contributed equally: KTq pol K540Y showed neither primer extension nor RT-PCR activity (Supplementary Fig. S1, A, B, C, Supplementary Fig. S8, Supplementary Fig. S9 A). As consequence, this mutation was excluded from the following screening approach. However, the experiment demonstrated that the combination of different mutations in Mut_RT resulted in positive synergistic effects, which together improve the enzyme performance.

Based on these findings, a new enzyme library of Taq pol mutants was designed to recombine the differently evolved RT-enhancing mutation pools. The library included all possible combinations of L459M, N483K, E507K, S515R, V586G, I614K, I638F and M747K with the respective wild-type amino acids. Thus, the library comprised the wild-type Taq pol without mutations, as well as all single, double, triple and 4 − 8 fold mutants, resulting in a total of 256 Taq pol variants (Fig. 1D). In this way, the RT-Taq and the Taq pol variant bearing the Mut_RT specific mutations were included, as well. The genes of all DNA pol variants were synthesised in equimolar amounts, inserted into the pGDR11 vector, and used as a mixture. The plasmid mix was transformed in E. coli BL21 (DE3) cells and plated on agar plates. A total of 2660 single colonies were picked and arranged in a 348well format. Oversampling by a factor of more than 10 during colony picking ensured that more than 99% of all possible Taq pol variants were covered and included in the library67. Cell lysates from expression cultures were directly used after heat inactivation of E. coli host proteins.

The screen comprised a real-time RT-PCR approach that was previously established to detect the SARS-CoV-2 RNA in unprocessed swab samples10. Primers and TaqMan probes used in the study were similar to those applied in the FDA emergency-use authorized CDC (Centers for Disease Control and Prevention) diagnostic assay and target the nucleocapsid gene N168. After the precise and efficient PCR, a 72 bp DNA amplicon was generated. The SARS-CoV-2 RNA was ordered from the European Commission Joint Research Centre (CRM code: EURM-019) and included in vitro transcribed single-stranded RNA from the SARS-CoV-2. The formation of the cDNA could be followed by monitoring the increase in fluorescence signal since the hydrolysis probe is digested by the Taq pol during the ongoing PCR, leaving the unquenched FAM (6-carboxyfluorescein) dye in the reaction mix (Fig. 2A).

The reaction conditions were optimised for the use of expression lysates and the RNA template concentration of 1000 c/µL (copies/µL) was found to be sufficient to adequately monitor the enzyme performance (Supplementary Fig. S2). After screening 2660 cell lysates, promising Taq pol variants were selected based on lower Cq values than those of the RT-Taq and amplification curves following a sigmoidal shape (Fig. 2B).

Fig. 2
figure 2

Screening approach and selection for identifying Taq pol variants with increased RT-PCR activity (A) Principle of the screening assay. RT and amplification of the cDNA is catalysed by different Taq pol variants. The applied TaqMan probe is 5’- fluorescently labelled with FAM (6-carboxyfluorescein) and 3’- labelled with BHQ1 (Black Hole Quencher 1). Product formation can be followed by monitoring the increase in fluorescence signal as the probe is digested by the Taq pol variants and fluorescence is no longer quenched by BHQ1. (B) Selection of promising Taq pol variants are shown as an example for the first expression plate (amplification curve for lysate containing the RT-Taq depicted in red and amplification curve for lysates containing different Taq pol variants depicted in blue). Selection of promising variants were based on lower Cq values than those of the RT-Taq (red curve) and amplification curves following a sigmoidal shape. (C) Frequency of each mutation (L459M, N483K, E507K, S515R, V586G, I614K, I638F and M747K) is depicted in active vs. inactive Taq pol variants. Left: 15 different Taq pol variants were included in the group of active variants and right: 8 Taq pol variants were included in the group of inactive variants. (B) Mutation level (0-fold, 1-fold, 2-fold…) is plotted against their frequency. Left: 15 different Taq pol variants were included in the group of active variants and right: 8 Taq pol variants were included in the group of inactive variants.

Full size image

The best 379 variants were selected and corresponding cells were transferred to a new 384 deep well plate. Five positions on the plate were occupied by E. coli cells carrying the RT-Taq vector. The cell lysates were screened again by applying a RNA template dilution from 1000 − 1 c/µL and different forward primer to generate PCR products with increasing lengths (72, 100, 156, 208 and 254 bp). Finally, 20 Taq pol variants showed good performance in all reaction conditions (Supplementary Fig. S3) and 46 variants performed well in only 8 from 9 conditions. The expression levels were checked for uniformity by SDS-PAGE (Supplementary Fig. S4 A, Supplementary Fig. S9 B, C, Supplementary Fig. S10 A). In order to determine the mutation patterns of the 20 most promising Taq pol variants, plasmids from corresponding E. coli cells were isolated and sent for Sanger sequencing. 15 different DNA genes were found since some E. coli cells carried the same plasmid encoding for the same Taq pol variant (Table 1).

Table 1 Composition of the mutation sites found in the 20 most promising Taq pol variants. Corresponding (corres.) cell lysates are indicated. Mutation sites originate from RT-KTq are coloured in blue and mutation sites originate from mut_rt are coloured in green. Positions with mutations are highlighted in bold letters.

Full size table

Moreover, some plasmids from E. coli cells whose lysates showed no activity were also analysed by Sanger sequencing (Table 2). To ensure that lower activity was not due to low expression levels, only those lysates were selected that show protein expression upon SDS-PAGE analysis (Supplementary Fig. S4 B, Supplementary Fig. S10).

Table 2 Composition of the mutation sites found in inactive Taq pol variants. Composition of the mutation sites found in inactive Taq pol variants. Mutation sites originate from RT-KTq are coloured in blue and mutation sites originate from Mut_RT are coloured in green. Positions with mutations are highlighted in bold letters.

Full size table

When comparing the mutation composition of active and inactive Taq pol variants, it is striking that the mutation I638F only occurs in the inactive variants (Fig. 2C). I614K was the most common mutation in the active variants, while it was less present in the inactive ones. Interestingly, most of the well performing variants carried only three mutations in their amino acid sequence (Fig. 2D, left). However, within the group of inactive variants, there were some enzymes with a higher mutation load (Fig. 2D, right). These two aspects indicate that not only the composition of RT-promoting mutations, but also a moderate mutation load of about three mutations contributes to a well-performing enzyme.

Further characterization of promising Taq pol variants

For further studies, all 15 promising RT-active Taq pol variants were purified and subjected to a similar screen as described above for the study of the most promising E. coli cell lysates. For this purpose, we used a RNA template dilution from 1000 − 1 c/µL to further increase the stringency of the assay conditions. Furthermore, RT-PCR was performed with different forward primers to generate PCR products of increasing length (72, 100, 156, 208 and 254 bp). Of note, the same reaction conditions were selected for all approaches without any further optimisation, even if there was low input of RNA or longer amplicons had to be generated. Taq pol V2, V3, V7, V9, V11, V12, V13 and V15 showed the best amplification curves over all conditions (Fig. 3).

Fig. 3
figure 3

Amplification curves after RT-PCR from SARS-CoV-2 RNA catalysed by purified Taq pol variants. 30 nM enzyme, 100 nM Taq pol aptamer, 670 nM primer, 170 nM TaqMan probe and SARS-CoV-2 RNA (concentrations as indicated) were present in the reaction mix. Forward primer was varied to generate PCR products with different lengths (as indicated). The same mastermix was used for the control reaction (con.), but the amount of RNA template was replaced by Milli-Q water. The reactions with diluted RNA template were performed twice and reactions forming longer amplicons were conducted once.

Full size image

Product formation was observed even in more demanding reactions, such as those with very low RNA template input or the synthesis of longer amplicons. Similar to reactions performed with E. coli lysates, RT-Taq was not able to amplify from RNA templates with low concentrations and to form longer PCR products (Fig. 3, red curves).

Next, we investigated if the identified Taq pol variants tolerate SYBR green I in real-time PCR. Therefore, all 15 Taq pol variants were applied again in the RT-PCR assay using SARS-CoV-2 RNA with different combinations of reverse and forward primers. Similar to observations above Taq pol V2, V3, V7, V9, V11, V12, V13 and V15 showed the best amplification curves over all reactions (Fig. 4A). Furthermore, the experiment showed that the improved RT-PCR activity of the identified Taq pol variants is not only due to a higher nuclease activity, but most importantly due to an improved PCR activity.

Fig. 4
figure 4

Amplification curves after RT-PCR from SARS-CoV-2 RNA and from the artificial RNA used in preliminary studies. (A) 1000 c/µL SARS-CoV-2 RNA was used as template and different forward primers were applied to generate PCR products of different lengths (as indicated). 30 nM purified Taq pol variants (as indicated) and 100 nM Taq pol aptamer was used for catalysis. The reaction was monitored by SYBR green I intercalating into the forming cDNA. (B) 100 nM artificial RNA template was present in the reaction. 100 nM purified Taq pol variants (as indicated) and 100 nM Taq pol aptamer was used for catalysis. The reaction was monitored by SYBR green I intercalating into the forming cDNA. A)/B) For the control reaction (con.), the same mastermix was used but the amount of RNA template was replaced by Milli-Q water. The reactions from the SARS-CoV-2 RNA template were performed twice and reactions from the artificial RNA template were conducted three times.

Full size image

In order to examine whether the purified Taq pol variants can catalyse RT-PCR without a prior RT step, we performed amplification from an RNA oligonucleotide under time-limited conditions. Here, the cDNA was obtained by direct heating to 95°C and subsequent conducting a two-step amplification. For comparison, the RT-KTq, Mut_RT and RT-Taq and all 15 Taq pol variants were tested using the same reaction parameters as shown in Supplementary Fig. S1 B. The amplification curves measured for Taq pol V2, V3, V7, V9, V12 and V15 passed the threshold at the same time or earlier than RT-KTq, Mut_RT or RT-Taq (Fig. 4B). The highest activity was observed for Taq pol V2 and V3 starting the DNA amplification after a low number of cycles and generating a high fluorescence intensity. Indeed, this experiment demonstrated that the most effective Taq pol variants identified in the screening did not require an additional RT step prior to amplification. Considering all RT-PCR experiments with purified enzyme, Taq pol V2, V3, V12 and V15 showed the best performance in terms of low Cq values and amplification curves with high fluorescence intensities.

Taq pol variants are able to catalyse multiplexed RT-PCR from the SARS-CoV-2 RNA

Having established that Taq pol V2, V3, V12, V15 can faithfully catalyse RT-PCR from the SARS-CoV-2 RNA targeting the nCov_N1 gene, we investigated whether other targets could be detected using this approach. We performed RT-PCR using the primer and probe sets for the nCov_N2 and nCov_E genes that correspond to the FDA-approved CDC emergency diagnostic assay68. Initially, all gene targets were monitored individually in order to find a suitable reaction condition for all of them. Moreover, we have introduced the additional mutation I707L into the backbone of the Taq pol V2 (henceforth referred to as Taq pol V2 IL) to investigate whether this can further improve PCR activity. Barnes and colleagues had shown that I707L reduce the polymerase activity at lower temperatures, while retaining normal activity at the optimal reaction temperature of the KTq pol enzyme69. Taq pol V2 were selected as an example because of its excellent performance in previous experiments.

All variants of the Taq pol were able to catalyse the reaction from each SARS-CoV-2 gene under the same optimised reaction condition (Fig. 5A). Taq pol V15 is partially excluded here, as it did not show an increase in the fluorescence signal for the Texas Red dye and thus detection of the nCov_E gene was not successful. The parental enzyme RT-Taq was only able to amplify products from the nCov_N2 gene and to a much lesser extent from the nCov_N1 gene (Fig. 5A, red curves). Of note, detecting the nCov_E gene is more challenging than detecting the other targets because the other amplicons are shorter in length (72 bp for nCov_N1, 67 bp for nCov_N2 and 113 bp for nCov_E). This experiment benefited from the exceptional thermostability of the enzymes since RT was carried out at 65 °C.

Next, we challenged the Taq pol variants by gradually reducing the template concentration to determine the limit of detection. With this reaction setup, it should be noted that uniform reaction conditions were defined for all genes to allow simultaneous detection. Consequently, the conditions for nCoV_E were not individually optimised. Since the Taq pol V2 IL had previously shown an improved performance on the nCov_E gene compared to the Taq pol V2, this variant was considered further. The Cq values were plotted against the decadic logarithm of the template concentration and linear regression was applied (Fig. 5C, Supplementary Table S4). The parental RT-Taq pol was able to detect 100 copies of the SARS‑CoV‑2 template (5 c/µL) by monitoring the nCov_N2 gene. The dilution series was amplified with a PCR efficiency of 91%. Other targets could not be detected adequately (Fig. 5C, left). Taq pol V2, V3 and V2 IL showed the lowest detection limits with 20, 10 and 20 copies respectively (Fig. 5C, three plots right). All dilution series had a PCR efficiency above 90% except for the reaction from the nCov_E gene. These results render Taq pol V2, V3 and V2 IL as promising candidates.

In the next step, we investigated if the nCov_N1, nCov_N2 and the nCov_E gene could be monitored simultaneously in one reaction. The promising Taq pol variants V2, V3, V12, V15 and V2 IL were applied in multiplexed RT-PCR under similar conditions as in the singleplex reaction setup (Fig. 5B). At a template concentration of 10.000 c/µL, all DNA pols, except Taq pol V15 and the parental RT-Taq, successfully amplified all three genes. As in the previous experiments, the concentration of the SARS-CoV-2 template was gradually decreased, the corresponding Cq values were plotted against the decadic logarithm of the template concentration and linear regression was applied (Fig. 5D, Supplementary Table S4).

Fig. 5
figure 5

Amplification curves after RT-PCR from SARS-CoV-2 RNA in singleplex and multiplex reaction set-up. (A) 10.000 c/µL SARS-CoV-2 RNA was used as template and the genes nCoV_N1, N2 and E were analysed independently (as indicated). 120 nM purified Taq pol variants (as indicated) and 400 nM Taq pol aptamer was used for catalysis. 670 nM primer, 170 nM TaqMan probe (in case of nCoV_N1/N2) or 850 nM primer, 216 nM TaqMan probe (in case of nCov_E) were present in the reaction mix. Reactions were monitored by measuring the fluorescence from the 6-FAM dye (nCoV_N1), the Sun dye (nCoV_N2) and the Texas Red dye (nCov_E). (B) 10.000 c/µL SARS-CoV-2 RNA was used as template and the genes nCoV_N1, N2 and E were analysed simultaneously (triplex RT-PCR). The reaction conditions were similar to reactions described in A), but all primer sets were present in the same reaction. (C) SARS-CoV-2 RNA template was diluted stepwise and the genes nCoV_N1, N2 and E were analysed independently (singleplex RT-PCR). (D) SARS-CoV-2 RNA template was diluted stepwise and the genes nCoV_N1, N2 and E were analysed simultaneously (triplex RT-PCR). C)/D) The same reaction conditions were use as in A) or B). Respective average Cq values were plotted against the decadic logarithm of the template concentration. Linear regression was applied. Linear functions, R2 values and the PCR efficiencies are indicated in Supplementary Table S4. Error bars indicating the standard deviation of duplicates are shown in black. A)/B)/C)/D) The same mastermix was used for the control reaction (con.), but the amount of RNA template was replaced by water. Reactions were performed in duplicates.

Full size image

When monitoring the nCov_N2 gene, the parental RT-Taq showed a reduced sensitivity from 100 (singleplex) to 2000 copies (multiplex). PCR efficiency was determined to be 104%. However, Taq pol V2 and V2 IL were still able to detect 100 copies, while Taq pol V3 detected only 20 copies of RNA. Thus, the latter is 100 times more sensitive than the parental RT-Taq. The PCR efficiencies are consistently above 92%, with the exception of the reactions from the nCov_E gene (Supplementary Table S4). Overall, this demonstrates that Taq pol V2, V3 and V2 IL are able to catalyse RT-PCR in singleplex and multiplex reaction formulations and independently of the RNA target sequence.

We then tested whether the amplification from the human RNase P transcript could be used as an internal control reaction to verify the integrity of the patient samples in the future. For this purpose, RT-Taq, Taq pol V2, V3 and V2 IL were used in multiplexed RT-PCR targeting the SARS-CoV-2 nCov_N1, nCov_N2, nCov_E gene and the RNase P transcript simultaneously (quadruplex RT-PCR). The universal human reference RNA (Thermo Fisher Scientific) was used as template and the primer/probe sets from the FDA authorized CDC diagnostic assay were applied for catalysis68.

Fig. 6
figure 6

Amplification curves after multiplex RT-PCR from SARS-CoV-2 RNA and human RNase P transcript. (A) 10.000 c/µL SARS-CoV-2 RNA was used as template. 1 ng/µL universal human reference RNA was used to monitor amplification from the RNase P transcript. The genes nCoV_N1, N2 and E, as well as the RNase P transcript were analysed simultaneously (quadruplex RT-PCR). 120 nM purified Taq pol variants (as indicated) and 400 nM Taq pol aptamer was used for catalysis. 670 nM primer, 170 nM TaqMan probe (for nCoV_N1/N2, RNase P) and 850 nM primer, 216 nM TaqMan probe (for nCov_E) were present in the reaction mix. Reactions were monitored by measuring the fluorescence from the 6-FAM dye (nCoV_N1), the Sun dye (nCoV_N2), the Texas Red dye (nCov_E) and the Cy5 dye (RNase P). (B) SARS-CoV-2 RNA template was diluted stepwise while keeping the human reference RNA constant. The genes nCoV_N1, N2, E and RNase P transcript were analysed simultaneously (quadruplex RT-PCR). Respective average Cq values were plotted against the decadic logarithm of the template concentration. Linear regression was applied. Linear functions, R2 values and the PCR efficiencies are indicated in Supplementary Table S4. Error bars indicating the standard deviation of duplicates are shown in black. A)/B) The same mastermix was used for the control reaction (con.), but the amount of RNA template was replaced by water. Reactions were performed in duplicates.

Full size image

At the template concentrations of 10.000 c/µL of SARS-CoV-2 and 1 ng/µL human reference RNA, all DNA pols were able to successfully amplify the 4 target sequences (Fig. 6A). However, the parental RT-Taq was only able to catalyse the reaction from the nCov_N2 gene and the RNase P transcript (Fig. 6A, red curves). Of note, the reaction conditions for the quadruplex reaction did not need to be optimized again.

To further investigate the enzyme performance, we diluted the SARS-CoV-2 RNA to 500 c/µL in 4 steps, while keeping the human reference RNA constant at 1 ng/µL. The respective Cq values were plotted against the decadic logarithm of the template concentration and linear regression was applied (Fig. 6B, Supplementary Table S4). The best performance was observed for Taq pol V3, as all 4 RNA targets were successfully monitored at each dilution step and PCR efficiencies ranged from 93 to 104%. Furthermore, the Cq values for reactions from the RNase P transcript remained constant at approximately 13.9. The successful detection of SARS-CoV-2 observed in the triplex RT-PCR was therefore also achieved in the presence of a different RNA target, demonstrating the potential applicability of an internal standard in the reaction.

Taken together, we were able to demonstrate that the newly developed Taq pol variants can be successfully used in single- tube RT-PCR approaches without the need for a second enzyme. As the human reference RNA used in this study is a mixture of RNA extracts from 10 cell lines, SARS-CoV-2 was successfully detected from a complex mixture of RNA. Moreover, our data shows an example of a one-step quadruplex RT-PCR using only one single DNA polymerase without the addition of manganese ions.

Furthermore, we investigated the polymerase fidelity within the RT-PCR process. Therefore, the parental enzymes RT-Taq, Mut_RT, Taq pol V2, V3 and V2 IL were used to reverse transcribe and amplify a 100 nt segment of the nCov_N1 gene and the resulting amplicons were subsequently subjected to NGS (Supplementary Fig. S5). RT-Taq and Mut_RT showed comparable error rates and revealed an average error of 0.07%. Taq pol V2, V3 and V2 showed slightly increased error patterns compared to the parental enzymes resulting in average errors of 0.21%, 0.19%, and 0.14%, respectively. It should be noted that the KTq pol generally has a higher accuracy than the full-length Taq pol64. The introduction of the I707L mutation into the Taq V2 pol variant increased the polymerase fidelity by a factor of 1.5. Of note, naturally occurring RTs are known to have lower fidelity than DNA-dependent DNA polymerases13. Therefore, the catalysis during RT-PCR is likely to be more error prone. The Taq pol variants used in this study are DNA pols capable of accepting and processing RNA while successfully amplifying DNA. Thus, errors might not be primarily introduced in the first linear RT step, but also during amplification. Examination of the error distribution for each nucleobase identity separately showed that all DNA pols had the highest error rates for U, followed by A (Supplementary Fig. S6). The highest fidelity was observed for the incorporation of nucleotides opposite G and C bases.

Finally, the best working enzymes should be compared to a commercial two enzyme one-step RTPCR system. Here, the Luna® Universal Probe One-Step RT-PCR Kit from NEB was used to catalyze the quadruplex RT-PCR from the SARS-CoV-2 RNA and the human RNaseP mRNA transcript by using the same primer and probes as described above. The reaction conditions and the cycling protocol were optimized according to the manufacture’s recommendations. For better comparability, Taq pol V2, V3, V2 IL, and RT-Taq were tested under the same reaction conditions optimised for the Luna® Universal Probe One-Step RT-PCR Kit. The enzyme concentrations of the Taq pol variants were consistent with those in the previous quadruplex RT-PCR experiment. Luna enzymes were used in the amounts specified in the manufacturer’s protocol. However, as the supplier does not provide specific information on enzyme concentration or quantity, direct comparability with the Taq pol variants is limited in this respect. All reactions were carried out in the reaction buffer from the Luna® Kit to ensure identical buffer compositions. Taq pol V2 and V3 showed comparable RT-PCR performances as the two enzyme system from the Luna® Universal Probe One-Step RT-PCR Kit targeting all three SARS-CoV-2 genes and the human RNaseP transcript (Supplementary Fig. S7). Taq pol V2 IL showed a slight increase in Cq levels in experiments using the SARS-CoV-2 RNA as template compared to Taq pol V2 and V3. In contrast, RT-Taq showed no activity in processing the SARS-CoV-2 nCov_E gene and demonstrated lower RT-PCR performance in amplifying from the human RNase P transcript compared to the other enzymes. Of note, all Taq pol variants were functional under the buffer and reaction conditions of the Luna® Universal Probe One-Step RT-PCR Kit, and optimising their enzyme concentration might further enhance performance.

Discussion

In this study, we report on the development of RT-PCR active DNA polymerases, Taq pol V2, V3 and V2 IL, that differ by only 3 − 4 amino acids from wild-type Taq pol. The enzymes are capable of catalysing one-step RT-PCR. Moreover, these DNA pols are also capable of performing multiplex RT-PCR and thereby represent enzymes that are capable of performing multiplex RT-PCR with a single enzyme without the addition of manganese ions. Their use in well-established RNA detection approaches can not only improve current laboratory procedures in terms of time and cost, but also offers the possibility of addressing complex RNA targets due to their high thermostability. The Taq pol variants were discovered in a screening approach that combined two independently discovered pools of mutations enhancing the RT activity of the parental KTq pol57,61. The full length Taq pol was used as library scaffold because it is known to be more processive than its truncated version56,64,66. Moreover, the Taq pol scaffold allows their employment in hydrolysis probe-based test systems that are widely used in molecular diagnostics17. Several DNA pol variants were identified that are capable of detecting the exemplary chosen SARS-CoV-2 genome in the N1 region, even when the RNA template was diluted and the enzyme had to generate longer DNA amplicons. Further analysis and characterisation lead to the identification of Taq pol V2, V3 and V2 IL carrying the mutations N483K, E507K, I614K (Taq pol V2); N483K, S515R, I614K (Taq pol V3); and N483K, E507K, I614K, I707L (Taq pol V2 IL). These enzymes were able to detect the N1, N2 and E gene of SARS-CoV-2, both individually and simultaneously with sensitivity of down to 10 (singleplex RT-PCR) and 20 (multiplex RT-PCR) copies of RNA. Application in a quadruplex RT-PCR assay demonstrated simultaneous monitoring of all three SARS-CoV-2 genes from a complex mixture of RNA and successful detection of the RNase P transcript to ensure the integrity of the patient sample. Finally, direct comparison showed that Taq pol V2 and V3 can achieve a RT-PCR performance comparable to the two enzyme Luna® Universal Probe One-Step RT-PCR Kit from NEB.

The one-enzyme formulation has several advantages over a two-enzyme based RT-PCR. Reaction parameters can be optimally adapted to a single DNA pol variant. This means that there is no need to find a compromise between the reaction conditions of two enzymes. Furthermore, the thermostability and the usage of the Taq pol aptamer enable a hot-start of the reaction minimizing the propensity for false amplification in case of high structured RNA70. Besides, RT-PCR does not necessarily require an extra RT step prior to amplification, which saves time. Moreover, there is no competition for the binding and thereby blocking of the RNA template, as may occur with two enzymes This aspect could potentially improve efficiency of detecting small amounts of the target molecules42. Since only 3 or 4 mutations need to be introduced into the wild-type Taq pol backbone, established production and purification processes can simply be retained, which saves current resources.

The mechanism by which these DNA pols acquire their properties is still unclear. All mutation sites are located within the polymerase domain, which likely allows for the proper binding of the DNA/RNA hybrid and/or the folding of the DNA pol into its active form after DNA/RNA binding57,61. It is known that amino acid N483 is located in the thumb domain of the Taq pol contacting the template strand71. Residues E507 and S515 are also part of the thumb domain and interact with the primer strand. Especially, the mutation E507K has been shown to contribute to a fast PCR cycling property of the Taq pol72. The amino acid I614 is located in the finger domain and forms part of the hydrophobic pocket that binds the nucleobase and deoxyribose of the incoming dNTP. Its mutation has been identified to affect the fidelity of nucleobase pairing73. Mutation I707L is placed in the finger domain and has been shown to reduce the polymerase activity at lower temperatures, while retaining normal activity at the optimal reaction temperature and thermostability at 95°C69. Overall, the mutation sites are distributed throughout the polymerase domain and might contribute to synergistic effects that are improving the properties of RT-PCR. Our results add new DNA pols to the toolbox of enzymes suitable to be used in RT-PCR methodologies and further demonstrate that DNA polymerase engineering is a viable strategy to obtain enzymes with suitable properties.

Materials and methods

Oligonucleotides

DNA oligonucleotides were ordered from IDT or Biomers in HPLC grade and directly used in RT-PCR assays and primer extension reactions. The RNA oligonucleotide was purchased PAGE purified from Purimex and was directly used. SARS-CoV-2 RNA was purchased from the European Commission Joint Research Centre (CRM code: EURM-019) and was directly used. The SARS-CoV-2 RNA was stored at −20 °C according to the manufacturer’s instructions and diluted samples were stored at −80 °C in aliquots. The universal human reference RNA was purchased from Invitrogen (catalogue number: QS0639) and was directly used. For primer extension reactions analysed by denaturing PAGE, oligonucleotides were purified by preparative PAGE prior to usage. Thereafter, radioactive labelling with [γ-32p]-ATP (Hartmann Analytic) and T4 polynucleotide kinase (NEB) was conducted following the manufacturer’s protocol. The concentration of the used oligonucleotides was determined by measuring the absorbance at 260 nm and applying the Lambert-Beer law. Used dNTPs were supplied by Jena Bioscience. DNA and RNA sequences of all oligonucleotides are shown in Supplementary Table S1.

Taq pol library preparation

256 pGDR11 plasmids carrying different Taq pol genes were purchased from Genewiz (Supplementary Table S2) and delivered in one single tube. Plasmids were synthesized in equimolar amounts using the trimer-controlled gene synthesis approach (smart library) from Genewiz and quality checked by sequencing 96 individual clones. Transformation of electro competent E. coli BL21 DE3 (Stratagene, Supplementary Table S3) cells was performed by adding 10 ng plasmids to 100 µL cells and applying an electric pulse with 1800 V, 200 Ω and a pulse length of around 5 ms by Gene Pulser Xcell (Bio-Rad). Cells were mixed with 1 mL SOC medium and incubated at 37 °C for 1 h. After plating and incubation of the cells (37 °C, overnight), 100 µL of LB medium containing 100 µg/mL carbenicillin disodium salt was inoculated with a single cell colony to end up with 7 × 384 deep-well plates. Cells were grown at 37 °C overnight, 100 µL glycerol was added in each well and library plates were stored at −80 °C.

Generation of E. coli cell lysates in 96 well plates

1 mL LB medium with 100 µg/mL carbenicillin disodium salt was inoculated with glycerol stocks of E. coli and cells were grown at 37 °C in a shaker. After reaching an OD600 of 0.6, expression was induced by the addition of IPTG (0.4 mM final concentration). Cells were incubated at 37 °C in a shaker for 3 h and then harvested by centrifugation with 4000 rpm at 4 °C for 30 min. Pellets were resuspended and lysed by the addition of 800 µL 1x lysis buffer I (50 mM Tris-HCl (pH 9.2), 16 mM (NH4)2SO4, 2.5 mM MgCl2, 0.1% (v/v) Tween 20, 0.5 mg/mL lysozyme) and incubation at 37 °C for 20 min. After denaturing of E. coli host proteins at 75 °C for 45 min, plates were centrifuged at 4000 rpm and 4 °C for 30 min and stored at 4 °C. Uniform expression was exemplary analysed by SDS-PAGE.

Large scale protein production and purification

400 mL LB medium with 100 µg/mL carbenicillin disodium salt was inoculated with 4 mL of the respective E. coli overnight culture and incubated at 37 °C in a shaker. After reaching an OD600 of 0.6–0.8, induction was performed by adding 1 mM IPTG. Cells were incubated at 37 °C in a shaker for 4 h. Then, cultures were harvested by centrifugation at 4 °C and 4000 rpm for 30 min and pellets were stored at −20 °C. Cell pellets were resuspended and lysed in 10 mL 1x lysis buffer II (10 mM Tris-HCl (pH 9.2), 300 mM NaCl, 2.5 mM MgCl2, 0.1% (v/v) Triton X-100, 1 mg/mL lysozyme) at 37 °C for 20 min. After heat inactivating of E. coli host proteins at 75 °C for 45 min, lysates were ultra-centrifuged at 20.000 rpm for 1 h. 5 mM imidazole was added to the supernatant and metal ion-based affinity purification was conducted by applying the cOmplete™ His-Tag purification resin (Roche). Here, 3 mL beads were washed 4 times with 5 mL calibration buffer (10 mM Tris-HCl (pH 9.2), 300 mM NaCl, 2.5 mM MgCl2, 0.1% (v/v) Triton X-100, 5 mM imidazole) by alternate mixing and centrifugation at 1000 rpm and 4 °C for 3 min. The cell lysate was mixed with the calibrated beads and shaken at 4 °C in an overhead shaker overnight. The protein bead solution was applied onto a 15 mL chromatography column and the flow through was collected. Then, beads were washed 4 times with 5 mL washing buffer (10 mM Tris-HCl (pH 9.2), 300 mM NaCl, 2.5 mM MgCl2, 0.1% (v/v) Triton X-100, 20 mM imidazole). Protein elution was performed by loading 1.6 mL elution buffer I (100 mM Tris-HCl (pH 9.2), 5 mM MgCl2, 200 mM imidazole) 6 times and collecting the flow through separately at 4 °C. Protein purification was analysed by SDS-PAGE. Pure enzyme fractions were combined and concentrated by using an Amicon filter tube (Amicon Ultra-15, 30 kDa MWCO, Millipore) at 4 °C and 4000 rpm. Remaining imidazole was removed by washing the protein solution 4 times with 20 mL elution buffer II (100 mM Tris-HCl (pH 9.2), 5 mM MgCl2) followed by a final concentration step to obtain an end volume of 0.4–0.5 mL. For storage, 20x storage buffer (100 mM Tris-HCl (pH 9.2), 320 mM (NH4)2SO4, 5 mM MgCl2, 2% (v/v) Tween 20) was added in an amount equal to 1/9th of the volume of the protein (final 2x storage buffer). Finally, autoclaved glycerol was added to reach a final concentration of 50% and enzyme stocks were stored in aliquots at −20 °C. Protein concentration was determined by employing a Bradford assay (Roti®Quant, Roth) by using BSA as reference protein.

Primer extension with radioactively labelled primer

Reactions were carried out in 10 µL end volume. The reaction mixture contained 225 nM RNA oligonucleotide template, 150 nM radioactively labelled DNA primer, 100 µM of dNTPs (each) and 2 nM KTq pol variant in 1x Taq reaction buffer (50 mM Tris-HCl (pH 9.2), 16 mM (NH4)2SO4, 2.5 mM MgCl2, 0.1% (v/v) Tween 20). First, template and primer were annealed by heating the mixture to 95 °C for 2 min and then cooling it down stepwise to 4 °C. Then, dNTPs in 1x Taq reaction buffer was added and pre-warmed to 55 °C. Primer extension was started by the addition of 4 µL KTq pol variant in 1x Taq reaction buffer. Reactions were allowed to proceed at 55 °C for indicated reaction times and were quenched by the addition of 20 µL stop solution (80% formamide, 20 mM EDTA, 0.25% (w/v) bromophenol blue, 0.25% (w/v) xylene cyanole). After denaturation at 95 °C for 5 min, reactions were analysed by 12% denaturing PAGE and visualised by phosphor imaging conducted on a Typhoon™ FLA 9500 (GE Healthcare Life Science). Reactions were repeated twice.

RT-PCR assay using the RNA oligonucleotide as template

RT-PCRs were performed in a reaction volume of 10 µL containing 100 pM RNA oligonucleotide as template, 100 nM forward and reverse primer (each), 200 µM dNTPs (each), 0.5 M betaine, 1x SYBR green I, 100 nM Taq pol aptamer and 100 nM of Taq pol variants in 1x Taq reaction buffer (50 mM TrisHCl (pH 9.2), 16 mM (NH4)2SO4, 2.5 mM MgCl2, 0.1% (v/v) Tween 20). RT-PCR was conducted following the thermocycling protocol: 95 °C for 60 s, then 40 cycles of 95 °C for 15 s and 62 °C for 30 s. Fluorescent intensities were measured after each of the 40 cycles using the LightCycler® 96 instrument (Roche) and the results were analysed using the Light Cycler® 96 SW 1.1 software (Roche, absolute quantification mode). Reactions were performed two or three times.

Screening assay for Taq pol variants with increased RT-PCR activity

For screening, 10 µL reaction mixture contained 250 µM dNTPs (each), 670 nM forward and reverse primer (each), 0.5 M betaine, 800 nM Taq pol aptamer, 3 µL of the respective E. coli cell lysate, 170 nM fluorescent probe and SARS-CoV-2 RNA template as indicated in 1x Taq reaction buffer (50 mM TrisHCl (pH 9.2), 16 mM (NH4)2SO4, 2.5 mM MgCl2, 0.1% (v/v) Tween 20). After preincubation at 61 °C for 150 s, 10 cycles with 95 °C for 5 s and 61 °C for 150 s were carried out. Then, 45 cycles with 95 °C for 10 s and 57 °C for 35 s followed. Fluorescent intensities were measured after each of the 45 cycles. PCR was monitored by using the CFX384 Touch Real-Time PCR Detection System (Biorad) and the Bio-Rad CFX Manager 3.1 software (single treshold mode). Reactions were performed in 384 well plates, thus 4 lysate plates (96 wells) could be analysed in parallel. For easy handling, pipetting was conducted by using the pipetting robot ASSIST plus from Integra. Enzyme performance was evaluated by assessing the Cq values of each Taq pol variant and the shape of the product curve. Product formation was exemplary analysed by agarose gel electrophoresis (2.5% agarose gel in 1x TAE buffer).

For reactions with purified Taq pol variants, 10 µL reaction mixture contained 250 µM dNTPs (each), 670 nM forward and reverse primer (each), 0.5 M betaine, 100 nM Taq pol aptamer, 30 nM respective Taq pol variant, 170 nM fluorescent probe or 1x SYBR green I and SARS-CoV-2 RNA template as indicated in 1x Taq reaction buffer (50 mM Tris-HCl (pH 9.2), 16 mM (NH4)2SO4, 2.5 mM MgCl2, 0.1% (v/v) Tween 20). After preincubation at 61 °C for 150 s, 10 cycles with 95 °C for 5 s and 61 °C for 150 s were carried out. Then, 35 cycles with 95 °C for 10 s and 57 °C for 35 s followed. Fluorescent intensities were measured after each of the 35 cycles. PCR was monitored by using the CFX384 Touch Real-Time PCR Detection System (Biorad) and the Bio-Rad CFX Manager 3.1 software (single treshold mode).

Site directed mutagenesis for introducing I707L mutation

For PCR, Phusion High-Fidelity DNA Polymerase (NEB) was used according to the manufacturer’s protocol. The reaction mixture contained 200 µM dNTPs (each), 0.5 µM forward primer (which had the desired point mutation in its DNA-strand), 0.5 µM reverse primer (with a phosphate residue on its 5’-ending), 2 ng/µL plasmid DNA, 3% (v/v) DMSO (NEB) and 0.02 U/µL Phusion DNA Polymerase (NEB) in 1x Phusion HF buffer (NEB). The reaction had a total volume of 20 µL and was performed in triplicates. An initial denaturation step was performed at 98 °C for 30 s, followed by 30 PCR cycles. One cycle consisted of a denaturation step at 98 °C for 10 s, an annealing step at 60 °C for 15 s and an extension step at 72 °C for 160 s. Finally, the PCR was terminated by a final extension step at 72 °C for 5 min. To eliminate the methylated paternal DNA plasmid, a DpnI enzymatic digestion was conducted by adding 1 µL DpnI (NEB) and 5 µL CutSmart buffer (NEB, finally 1x) to 44 µL PCR reaction, resulting in a final volume of 50 µL. The digestion was carried out at 37 °C for 1.5 h. The product was purified by agarose gel electrophoresis (0.8% agarose gel in 1x TAE), excision of the respective bands and applying the NucleoSpin Gel and PCR clean-up kit (Macherey-Nagel). Finally, the mutated plasmid was ligated by adding 1 µL T4 DNA Ligase (NEB) and 2 µL T4 DNA ligase reaction buffer (NEB, finally 1x) to 17 µL purified DNA. Ligation was performed at 23 °C for 2 h and the enzyme was denatured at 65 °C for 10 min. The circular plasmid was transformed into electro competent E. coli BL 21 (DE3) using the same method as described for the Taq pol library preparation. After plating and incubation of the cells (37 °C, overnight), 6 mL of LB medium containing 100 µg/mL carbenicillin disodium salt was inoculated with a single cell colony. Cells were grown at 37 °C overnight and a glycerol stock was generated (500 µL cells mixed with 500 µL glycerol). The presence of the I707L mutation was verified by isolating and sequencing the plasmid.

Singleplexed and multiplexed one-step RT-PCR using SARS-CoV-2 RNA as template

Singleplex RT-PCR

For the singleplex PCR, the CDC-approved DNA primers and fluorescent probes that target the SARS-CoV-2_N1, N2 and E genes were used in the reaction and are listed in Supplementary Table S1. 20 µL reaction mixture contained 250 µM dNTPs (each), 670 nM forward and reverse primer (each) addressing either the nCov_N1 or nCov_N2 gene, 0.5 M betaine, 0.5 mM MgCl2, 400 nM Taq pol aptamer, 120 nM respective Taq pol variant, 170 nM fluorescent probe addressing either the nCov_N1 or nCov_N2 gene and SARS-CoV-2 RNA template as indicated in 1x Taq reaction buffer (50 mM Tris-HCl (pH 9.2), 16 mM (NH4)2SO4, 2.5 mM MgCl2, 0.1% (v/v) Tween 20). When addressing the nCoV_E gene, the same reaction mix was used as described above, but 850 nM forward and reverse primer (each) and 216 nM fluorescent probe were applied. After preincubation at 65 °C for 150 s, 10 cycles with 95 °C for 5 s and 65 °C for 150 s were carried out. Then, 40 cycles with 95 °C for 10 s and 57 °C for 35 s followed. Fluorescent intensities were measured after each of the 40 cycles. The LightCycler® 96 System (Roche) was used to monitor the reaction and the Light Cycler® 96 SW 1.1 software (Roche, absolute quantification mode) was used to analyse the results. To determine the detection limits, the SARS-CoV-2 RNA template was diluted as indicated and applied in the singleplexed RT-PCR assay as explained above. Average Cq values were plotted against the decadic logarithm of the template concentrations and linear regression was employed. PCR efficiencies (E) were calculated using the slope (m) of the calculated linear function: (text{E}={text{10}}^{text{(}frac{text{-1}}{text{m}}text{)}}-1)

Linear functions, coefficient of determination and PCR efficiencies are shown in Supplementary Table S4. Product formation was analysed by agarose gel electrophoresis (2.5% agarose gel in 1x TAE buffer). Reactions were performed at least twice.

Triplex RT-PCR

For the triplex PCR, the CDC-approved DNA primers and fluorescent probes that target the SARS-CoV-2_N1, N2 and E genes were used in the reaction and are listed in Supplementary Table S1. The same reaction volume and reaction mix was used as for the singleplex RT-PCR, but all primer/probe sets targeting the nCov_N1, N2 and E gene were simultaneously added in the same concentrations as above. The same thermocycling protocol was conducted as for the singleplex RT-PCR above. The LightCycler® 96 System (Roche) was used to monitor the reaction and the Light Cycler® 96 SW 1.1 software (Roche, absolute quantification mode) was used to analyse the results. To determine the detection limits, the SARS-CoV-2 RNA template was diluted as indicated and applied in the multiplexed RT-PCR assay. Average Cq values were plotted against the decadic logarithm of the template concentrations and linear regression was employed. PCR efficiencies (E) were calculated as stated above. Linear functions, coefficient of determination and PCR efficiencies are shown in Supplementary Table S4. Product formation was analysed by agarose gel electrophoresis (2.5% agarose gel in 1x TAE buffer). Reactions were performed at least twice.

Quadruplex RT-PCR

For the quadruplex PCR, the CDC-approved DNA primers and fluorescent probes targeting the SARS-CoV-2_N1, N2 and E genes and the RNase P mRNA were used in the reaction (Supplementary Table S1). The same reaction volume and reaction mix was used as for the singleplex RT-PCR, but all primer/probe sets targeting the nCov_N1, N2 and E gene were simultaneously added in the same concentrations as before. Additionally, 670 nM forward and reverse primer (each), 170 nM fluorescent probe for the RNase P detection and 1 ng/µL universal human reference RNA were added. The same thermocycling protocol was conducted as for the singleplex RT-PCR above. The LightCycler® 96 System (Roche) was used to monitor the reaction and the Light Cycler® 96 SW 1.1 software (Roche, absolute quantification mode) was used to analyse the results. Furthermore, The SARS-CoV-2 RNA template was diluted as indicated and applied in the multiplexed RT-PCR assay. Average Cq values were plotted against the decadic logarithm of the template concentrations and linear regression was employed. PCR efficiencies (E) were calculated as stated above. Linear functions, coefficient of determination and PCR efficiencies are shown in Supplementary Table S4. Product formation was analysed by agarose gel electrophoresis (2.5% agarose gel in 1x TAE buffer). Reactions were performed at least twice.

Quadruplex RT-PCR with Luna® Universal probe one-step RT-PCR kit

For the quadruplex PCR, the CDC-approved DNA primers and fluorescent probes targeting the SARS-CoV-2_N1, N2 and E genes, as well as the RNase P mRNA, were used in the reaction (Supplementary Table S1). The reaction was optimized according to the manufacturer’s recommendations. 20 µL reaction mixture contained 670 nM of each forward and reverse primer and 170 nM of each fluorescent probe for the nCov_N1 and N2 genes, as well as 850 nM of each forward and reverse primer and 216 nM of fluorescent probe for the nCov_E gene. Additionally, 670 nM of each forward and reverse primer and 170 nM fluorescent probe were included for RNase P detection. The reaction was carried out with 1 ng/µL universal human reference RNA, 10.000 c/µL of the SARS-CoV-2 RNA template and 1x Luna WarmStart RT enzyme mix in 1x Universal Probe One-Step Reaction Mix (NEB). When using the engineered Taq pol variants, the Luna WarmStart RT enzyme mix was replaced with 120 nM of the respective Taq pol variant and 400 nM Taq pol aptamer. One-step RT-PCR was carried out using the following cycling protocol: After preincubation at 60 °C for 10 min, an initial denaturation step at 95 °C for 1 min was performed, followed by 45 cycles of 95 °C for 10 s and 60 °C for 1 min. Fluorescent intensities were measured after each of the 45 cycles. The LightCycler® 96 System (Roche) was used to monitor the reaction and the Light Cycler® 96 SW 1.1 software (Roche, absolute quantification mode) was used to analyse the results.

DNA amplicon library preparation for NGS analysis

RT-PCR with Taq pol variants was conducted similarly to the RT-PCR screening approach using purified enzymes. 10 µL reaction mix contained 250 µM dNTPs (each), 670 nM forward and reverse primer (each, giving the 100 bp transcript from the nCov_N1 gene, Supplementary Table S1), 0.5 M betaine, 100 nM Taq pol aptamer, 30 nM respective Taq pol variant, 10.000 copies of SARS-CoV-2 RNA template in 1x Taq reaction buffer (50 mM Tris-HCl (pH 9.2), 16 mM (NH4)2SO4, 2.5 mM MgCl2, 0.1% (v/v) Tween 20). After preincubation at 61 °C for 150 s, 10 cycles with 95 °C for 5 s and 61 °C for 150 s were carried out. Then, 35 cycles with 95 °C for 10 s and 57 °C for 35 s followed.

To determine the DNA product concentration, each reaction mix was analysed by using the 4150 TapeStation System (Agilent), High Sensitivity D1000 ScreenTapes (Agilent) and High Sensitivity D1000 Reagents (Agilent). Sample preparation and measurement was carried out according to the guidelines of the company. Each reaction was loaded twice and 1 ng of a 100 bp reference DNA with the same sequence was applied as a control. Afterwards, remaining reaction mix was purified by agarose gel electrophoresis (2.5% agarose gel in 1x TAE buffer), excision of the respective band and applying the NucleoSpin Gel and PCR clean-up kit XS (Macherey-Nagel). Elution was achieved by adding 7 µL elution buffer to the column (for better recovery, elution was loaded again). DNA concentration was determined by using the 4150 TapeStation System (Agilent) as described above. In the next step, DNA was repaired by using 0.35 nM purified DNA, 0.1 µL PreCR Repair mix (NEB) with 100 µM of dNTP (each), 1x NAD+ (provided by the manufacturer) and 1x ThermoPol reaction buffer (NEB) to reach 20 µL reaction volume. Repair was performed at 37 °C for 20 min. Unique molecular identifier (UMI) introduction was obtained by applying Q5 Hot Start High-Fidelity DNA Polymerase (NEB) in a three-step PCR reaction. The reaction mixture contained 20 µL repair mix, 0.02 U/µL DNA polymerase, 200 nM forward and reverse primer (each) and finally 0.2x Q5 reaction buffer to reach 25 µL reaction volume. After initial denaturation at 98 °C for 2 min, two cycles with 98 °C for 10 s, 62 °C for 30 s and 72 °C for 30 s were carried out followed by a final elongation step at 72 °C for 2 min. Reaction was purified by agarose gel electrophoresis as described above. This time, buffer NTI was diluted with Milli-Q water in a ratio of 2:1 and then used for dissolving gel pieces at 50 °C. DNA concentration was determined with a qPCR approach, where an DNA oligonucleotide with a similar length was applied as reference in different template concentrations (200 pM–0.01 pM). 1x NEBNext Ultra II Q5 Mastermix (NEB) was used for PCR with 1.6 µL DNA, 400 nM forward and reverse primer (each), 1x SYBR green I in 5 µL reaction volume. After a final denaturation step at 98 °C for 1 min, a three-step cycling protocol with 35 PCR cycles followed with 98 °C for 10 s, 60 °C for 30 s and 72 °C for 30 s and a final elongation step with 72 °C for 2 min was conducted. Fluorescent intensity was measured after each of the 35 cycles (LightCycler® 96 System (Roche)) and melting curves were determined at the end of the reaction. The Cq values were determined using the Light Cycler® 96 SW 1.1 software. For analysis, Cq values of reactions with reference DNA were plotted against the decadic logarithm of the template concentrations and linear regression was employed. DNA concentrations of samples to be analysed were calculated using the linear predictor function. Results were multiplied by two. DNA was repaired once again by using 0.05 µL PreCR mix (NEB), 1x NAD+, 100 µM of dNTP (each), 48.9 fM DNA and 1x ThermoPol reaction buffer (NEB) to reach 11.25 µL end volume. After reaction for 20 min at 37 °C, DNA mix was directly used for Amplicon PCR. Here, 400 nM forward and reverse primer (equipped with Illumina adapter sequence and indices), the entire repair mix (final DNA concentration 22 fM) and 1x NEBNext Ultra II Q5 Mastermix (NEB) was used for reaction in 25 µL end volume. After an initial denaturation step at 98 °C for 5 min, a three-step cycling protocol with 35 PCR cycles followed with 98 °C for 10 s, 70 °C for 30 s and 72 °C for 30 s and a final elongation step at 72 °C for 2 min was conducted. Reaction was analysed and purified by agarose gel electrophoresis as described above. To dissolve gel pieces, buffer NTI (Macherey Nagel) was diluted with Milli-Q water in ration 2:1 and 450 µL NTI mix was added to the gel pieces. After complete dissolving at 50 °C, the mixture was diluted once again with MilliQ water to reach a final volume of 1750 µL. DNA was loaded on the column and eluted with 20 µL elution buffer (load twice). After performing the repair reaction on the whole eluate as described above (25 µL reaction volume), DNA was purified by using the QIAEX II System (Qiagen). Here, 2 µL particle slurry was used for each sample and elution of DNA was achieved by adding 18 µL Milli-Q water to the dried beads, incubation for 5 min and taking 16 µL after centrifugation. The concentration of the final DNA library was determined by Quantus™ Fluorometer (Promega) and ranged between 19 and 64 nM. After quality control with Bioanalyzer 2100 (Agilent), DNA libraries were pooled and sequenced in paired-end mode (2*75 bp) on an IIlumina NextSeq2000 instrument. For RT-Taq, Mut_RT and Taq pol V2, library preparation was performed twice. Each DNA library was sequenced once and the resulting coverage was used for error calculation (Supplementary Table S5).

NGS data processing of amplicon libraries

Sequencing data were processed by using the software KNIME (Supplementary Table S6). Quality and sequence are read from the FastQ files. Phred quality scores (Q scores) were transformed into base calling error probabilities (P) by using: (text{P}={text{10}text{}}^{frac{text{-Q}}{text{10}}})

Read1 and Read 2 were merged giving the expected size of transcript. The highest quality data was used in the overlapping regions of Read1 and Read2. High quality data were filtered and aligned to the RNA reference of the SARS-CoV-2 RNA sequences. A frameshift correction was performed e.g. caused by phasing or pre-phasing during sequencing74,75. Those reads with a misalignment higher than 24% were removed from the data set (removal of only 0.5% of the reads such as adapter dimers). Reads were sorted into UMI families and family read numbers were counted. Substitution error calculation was performed in each UMI family having more than 3 reads. If 90% of the reads within each family carry the misincorporation, the error is set to 1 (otherwise 0). Afterwards, the substitution error was calculated over all UMI families at each position of the transcript. To normalize the error rates of each Taq pol variant according to its amplification activity, the individual amplification and doubling numbers of each enzyme were included in the calculation. The amplification number (A) and the doubling number (D) were determined by using the following equations (n = amount of substance [mol]):

$$text{A}=text{}frac{text{n(after PCR)}}{text{n(input RNA)}}qquad 2^text{D}=text{A}rightarrowtext{D}=frac{text{Ln(A)}}{text{Ln(2)}}$$

Each error rate for each position in the template was divided by the individual doubling number and plotted using Microsoft excel software. For the average substitution error, the region of the amplicon without primer binding sites is included (area depicted in Supplementary Fig. S5). The last T read in this region was excluded since values on this position differ very clearly from the rest of the data. The coverage, the number of UMI families and the doubling numbers used for error calculation are listed individually for each sequencing library in Supplementary Table S5.

Data availability

HTS data are available in the NCBI GEO database, records GSE280134, (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE280134, private secure token: ibytqmygljivrkz). All data needed to evaluate the conclusions in the paper are present in the paper and/or in the Supplementary data. Crystal structure information is available at the RCSB PDB Protein Data Bank under the accession number 4BWM (https://www.rcsb.org/structure/4BWM) and 1TAQ (https://www.rcsb.org/structure/1TAQ). Additional data may be requested from the authors (A) M. and L. (B) H.

References

  1. Rappolee, D. A., Mark, D., Banda, M. J. & Werb, Z. Wound macrophages express TGF-alpha and other growth factors in vivo: analysis by mRNA phenotyping. Science 241, 708–712. https://doi.org/10.1126/science.3041594 (1988).

    Article  ADS  CAS  PubMed  Google Scholar 

  2. Bustin, S. A. & Mueller, R. Real-time reverse transcription PCR (qRT-PCR) and its potential use in clinical diagnosis. Clin. Sci. (Lond). 109, 365–379. https://doi.org/10.1042/cs20050086 (2005).

    Article  CAS  PubMed  Google Scholar 

  3. Zhu, Y. Y., Machleder, E. M., Chenchik, A., Li, R. & Siebert, P. D. Reverse transcriptase template switching: a SMART approach for full-length cDNA library construction. Biotechniques 30, 892–897. https://doi.org/10.2144/01304pf02 (2001).

    Article  CAS  PubMed  Google Scholar 

  4. VanGuilder, H. D., Vrana, K. E. & Freeman, W. M. Twenty-five years of quantitative PCR for gene expression analysis. Biotechniques 44, 619–626. https://doi.org/10.2144/000112776 (2008).

    Article  CAS  PubMed  Google Scholar 

  5. Johnson, G., Nolan, T. & Bustin, S. A. Real-time quantitative PCR, pathogen detection and MIQE. Methods Mol. Biol. 943, 1–16. https://doi.org/10.1007/978-1-60327-353-4_1 (2013).

    Article  CAS  PubMed  Google Scholar 

  6. Zhou, B. et al. Multiplex reverse Transcription-PCR for simultaneous surveillance of influenza A and B viruses. J. Clin. Microbiol. 55, 3492–3501. https://doi.org/10.1128/jcm.00957-17 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Mocellin, S., Rossi, C. R., Pilati, P., Nitti, D. & Marincola, F. M. Quantitative real-time PCR: a powerful ally in cancer research. Trends Mol. Med. 9, 189–195. https://doi.org/10.1016/s1471-4914(03)00047-9 (2003).

    Article  CAS  PubMed  Google Scholar 

  8. Vaughn, C. P. et al. Simultaneous detection of lung fusions using a multiplex RT-PCR next generation sequencing-based approach: a multi-institutional research study. BMC Cancer. 18, 828. https://doi.org/10.1186/s12885-018-4736-4 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Yelagandula, R. et al. Multiplexed detection of SARS-CoV-2 and other respiratory infections in high throughput by SARSeq. Nat. Commun. 12, 3132. https://doi.org/10.1038/s41467-021-22664-5 (2021).

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  10. Kuiper, J. W. P. et al. Detection of SARS-CoV-2 from Raw patient samples by coupled high temperature reverse transcription and amplification. PLoS One. 15, e0241740. https://doi.org/10.1371/journal.pone.0241740 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Eftekhari, A. et al. A comprehensive review of detection methods for SARS-CoV-2. Microorganisms 9 https://doi.org/10.3390/microorganisms9020232 (2021).

  12. Yoo, H. M., Kim, I. H. & Kim, S. Nucleic acid testing of SARS-CoV-2. Int. J. Mol. Sci. 22 https://doi.org/10.3390/ijms22116150 (2021).

  13. Martín-Alonso, S., Frutos-Beltrán, E. & Menéndez-Arias, L. Reverse transcriptase: from transcriptomics to genome editing. Trends Biotechnol. 39, 194–210. https://doi.org/10.1016/j.tibtech.2020.06.008 (2021).

    Article  CAS  PubMed  Google Scholar 

  14. Huber, L. B., Betz, K. & Marx, A. Reverse transcriptases: from discovery and applications to xenobiology. Chembiochem 24, e202200521. https://doi.org/10.1002/cbic.202200521 (2023).

    Article  CAS  PubMed  Google Scholar 

  15. Holland, P. M., Abramson, R. D., Watson, R. & Gelfand, D. H. Detection of specific polymerase chain reaction product by utilizing the 5’—-3’ exonuclease activity of Thermus aquaticus DNA polymerase. Proc. Natl. Acad. Sci. U S A. 88, 7276–7280. https://doi.org/10.1073/pnas.88.16.7276 (1991).

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  16. Kubista, M. et al. The real-time polymerase chain reaction. Mol. Aspects Med. 27, 95–125. https://doi.org/10.1016/j.mam.2005.12.007 (2006).

    Article  CAS  PubMed  Google Scholar 

  17. Navarro, E., Serrano-Heras, G., Castaño, M. J. & Solera, J. Real-time PCR detection chemistry. Clin. Chim. Acta. 439, 231–250. https://doi.org/10.1016/j.cca.2014.10.017 (2015).

    Article  CAS  PubMed  Google Scholar 

  18. Mallet, F., Oriol, G., Mary, C., Verrier, B. & Mandrand, B. Continuous RT-PCR using AMV-RT and Taq DNA polymerase: characterization and comparison to uncoupled procedures. Biotechniques 18, 678–687 (1995).

    CAS  PubMed  Google Scholar 

  19. Bustin, S. A. Absolute quantification of mRNA using real-time reverse transcription polymerase chain reaction assays. J. Mol. Endocrinol. 25, 169–193. https://doi.org/10.1677/jme.0.0250169 (2000).

    Article  CAS  PubMed  Google Scholar 

  20. Rejali, N. A., Zuiter, A. M., Quackenbush, J. F. & Wittwer, C. T. Reverse transcriptase kinetics for one-step RT-PCR. Anal. Biochem. 601, 113768. https://doi.org/10.1016/j.ab.2020.113768 (2020).

    Article  CAS  PubMed  Google Scholar 

  21. Huang, H. S. et al. Multiplex PCR system for the rapid diagnosis of respiratory virus infection: systematic review and meta-analysis. Clin. Microbiol. Infect. 24, 1055–1063. https://doi.org/10.1016/j.cmi.2017.11.018 (2018).

    Article  CAS  PubMed  Google Scholar 

  22. Waggoner, J. J. et al. Triplex Real-Time RT-PCR for severe acute respiratory syndrome coronavirus 2. Emerg. Infect. Dis. 26, 1633–1635. https://doi.org/10.3201/eid2607.201285 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Ishige, T. et al. Highly sensitive detection of SARS-CoV-2 RNA by multiplex rRT-PCR for molecular diagnosis of COVID-19 by clinical laboratories. Clin. Chim. Acta. 507, 139–142. https://doi.org/10.1016/j.cca.2020.04.023 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Shu, B. et al. Multiplex Real-Time reverse transcription PCR for influenza A virus, influenza B virus, and severe acute respiratory syndrome coronavirus 2. Emerg. Infect. Dis. 27, 1821–1830. https://doi.org/10.3201/eid2707.210462 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Yang, J., Li, D., Wang, J., Zhang, R. & Li, J. Design, optimization, and application of multiplex rRT-PCR in the detection of respiratory viruses. Crit. Rev. Clin. Lab. Sci. 59, 555–572. https://doi.org/10.1080/10408363.2022.2072467 (2022).

    Article  CAS  PubMed  Google Scholar 

  26. Giulietti, A. et al. An overview of real-time quantitative PCR: applications to quantify cytokine gene expression. Methods 25, 386–401. https://doi.org/10.1006/meth.2001.1261 (2001).

    Article  CAS  PubMed  Google Scholar 

  27. Maley, D., Mei, J., Lu, H., Johnson, D. L. & Ilyin, S. E. Multiplexed RT- PCR for high throughput screening applications. Comb. Chem. High. Throughput Screen. 7, 727–732. https://doi.org/10.2174/1386207043328300 (2004).

    Article  CAS  PubMed  Google Scholar 

  28. Gupta, D. G. et al. Identification and validation of suitable housekeeping genes for gene expression studies in BCR-ABL1 positive B-lineage acute lymphoblastic leukemia. Mol. Biol. Rep. 49, 4841–4848. https://doi.org/10.1007/s11033-022-07337-w (2022).

    Article  CAS  PubMed  Google Scholar 

  29. Chruscinski, A. et al. Evaluation of a gene expression biomarker to identify operationally tolerant liver transplant recipients: the LITMUS trial. Clin. Exp. Immunol. 207, 123–139. https://doi.org/10.1093/cei/uxab011 (2022).

    Article  PubMed  Google Scholar 

  30. Růzek, D., Stastná, H., Kopecký, J., Golovljova, I. & Grubhoffer, L. Rapid subtyping of tick-borne encephalitis virus isolates using multiplex RT-PCR. J. Virol. Methods. 144, 133–137. https://doi.org/10.1016/j.jviromet.2007.04.010 (2007).

    Article  CAS  PubMed  Google Scholar 

  31. Alotaibi, B. S. et al. Simultaneous detection of influenza A/B, respiratory syncytial virus, and SARS-CoV-2 in nasopharyngeal swabs by One-Tube multiplex reverse transcription polymerase chain reaction. Trop. Med. Infect. Dis. 8 https://doi.org/10.3390/tropicalmed8060326 (2023).

  32. Clark-Langone, K. M. et al. Biomarker discovery for colon cancer using a 761 gene RT-PCR assay. BMC Genom. 8, 279. https://doi.org/10.1186/1471-2164-8-279 (2007).

    Article  CAS  Google Scholar 

  33. Montero-Macías, R. et al. HPV molecular genotyping as a differential diagnosis tool in cervical Cancer metastasis. J. Pers. Med. 13 https://doi.org/10.3390/jpm13020177 (2023).

  34. Oscorbin, I. P. & Filipenko, M. L. M-MuLV reverse transcriptase: selected properties and improved mutants. Comput. Struct. Biotechnol. J. 19, 6315–6327. https://doi.org/10.1016/j.csbj.2021.11.030 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Álvarez, M. & Menéndez-Arias, L. Temperature effects on the fidelity of a thermostable HIV-1 reverse transcriptase. Febs J. 281, 342–351. https://doi.org/10.1111/febs.12605 (2014).

    Article  CAS  PubMed  Google Scholar 

  36. Vellore, J., Moretz, S. E. & Lampson, B. C. A group II intron-type open reading frame from the thermophile Bacillus (Geobacillus) stearothermophilus encodes a heat-stable reverse transcriptase. Appl. Environ. Microbiol. 70, 7140–7147. https://doi.org/10.1128/aem.70.12.7140-7147.2004 (2004).

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  37. Yasukawa, K., Nemoto, D. & Inouye, K. Comparison of the thermal stabilities of reverse transcriptases from avian myeloblastosis virus and Moloney murine leukaemia virus. J. Biochem. 143, 261–268. https://doi.org/10.1093/jb/mvm217 (2008).

    Article  CAS  PubMed  Google Scholar 

  38. Higgs, P. G. RNA secondary structure: physical and computational aspects. Q. Rev. Biophys. 33, 199–253. https://doi.org/10.1017/s0033583500003620 (2000).

    Article  CAS  PubMed  Google Scholar 

  39. Malboeuf, C. M., Isaacs, S. J., Tran, N. H. & Kim, B. Thermal effects on reverse transcription: improvement of accuracy and processivity in cDNA synthesis. Biotechniques 30, 1074–1078. https://doi.org/10.2144/01305rr06 (2001). 1080, 1082, passim.

    Article  CAS  PubMed  Google Scholar 

  40. Baba, M. et al. Further increase in thermostability of Moloney murine leukemia virus reverse transcriptase by mutational combination. Protein Eng. Des. Sel. 30, 551–557. https://doi.org/10.1093/protein/gzx046 (2017).

    Article  CAS  PubMed  Google Scholar 

  41. Martínez, D. et al. Charge engineering of the nucleic acid binding cleft of a thermostable HIV-1 reverse transcriptase reveals key interactions and a novel mechanism of RNase H inactivation. J. Mol. Biol. 435, 168219. https://doi.org/10.1016/j.jmb.2023.168219 (2023).

    Article  CAS  Google Scholar 

  42. Sellner, L. N., Coelen, R. J. & Mackenzie, J. S. Reverse transcriptase inhibits Taq polymerase activity. Nucleic Acids Res. 20, 1487–1490. https://doi.org/10.1093/nar/20.7.1487 (1992).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Myers, T. W. & Gelfand, D. H. Reverse transcription and DNA amplification by a Thermus thermophilus DNA polymerase. Biochemistry 30, 7661–7666. https://doi.org/10.1021/bi00245a001 (1991).

    Article  CAS  PubMed  Google Scholar 

  44. Shandilya, H. et al. Thermophilic bacterial DNA polymerases with reverse-transcriptase activity. Extremophiles 8, 243–251. https://doi.org/10.1007/s00792-004-0384-5 (2004).

    Article  CAS  PubMed  Google Scholar 

  45. Vichier-Guerre, S., Ferris, S., Auberger, N., Mahiddine, K. & Jestin, J. L. A population of thermostable reverse transcriptases evolved from Thermus aquaticus DNA polymerase I by phage display. Angew Chem. Int. Ed. Engl. 45, 6133–6137. https://doi.org/10.1002/anie.200601217 (2006).

    Article  CAS  PubMed  Google Scholar 

  46. Schönbrunner, N. J. et al. Chimeric thermostable DNA polymerases with reverse transcriptase and attenuated 3’-5’ exonuclease activity. Biochemistry 45, 12786–12795. https://doi.org/10.1021/bi0609117 (2006).

    Article  CAS  PubMed  Google Scholar 

  47. Ong, J. L., Loakes, D., Jaroslawski, S., Too, K. & Holliger, P. Directed evolution of DNA polymerase, RNA polymerase and reverse transcriptase activity in a single polypeptide. J. Mol. Biol. 361, 537–550. https://doi.org/10.1016/j.jmb.2006.06.050 (2006).

    Article  CAS  PubMed  Google Scholar 

  48. Jozwiakowski, S. K. & Connolly, B. A. A modified family-B archaeal DNA polymerase with reverse transcriptase activity. Chembiochem 12, 35–37. https://doi.org/10.1002/cbic.201000640 (2011).

    Article  CAS  PubMed  Google Scholar 

  49. Bhadra, S., Maranhao, A. C., Paik, I. & Ellington, A. D. One-Enzyme reverse transcription qPCR using Taq DNA polymerase. Biochemistry 59, 4638–4645. https://doi.org/10.1021/acs.biochem.0c00778 (2020).

    Article  CAS  PubMed  Google Scholar 

  50. Jones, M. D. & Foulkes, N. S. Reverse transcription of mRNA by Thermus aquaticus DNA polymerase. Nucleic Acids Res. 17, 8387–8388. https://doi.org/10.1093/nar/17.20.8387 (1989).

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  51. Tse, W. T. & Forget, B. G. Reverse transcription and direct amplification of cellular RNA transcripts by Taq polymerase. Gene 88, 293–296. https://doi.org/10.1016/0378-1119(90)90047-u (1990).

    Article  CAS  PubMed  Google Scholar 

  52. Cadwell, R. C. & Joyce, G. F. Randomization of genes by PCR mutagenesis. PCR Methods Appl. 2, 28–33. https://doi.org/10.1101/gr.2.1.28 (1992).

    Article  CAS  PubMed  Google Scholar 

  53. Grabko, V. I., Chistyakova, L. G., Lyapustin, V. N., Korobko, V. G. & Miroshnikov, A. I. Reverse transcription, amplification and sequencing of poliovirus RNA by Taq DNA polymerase. FEBS Lett. 387, 189–192. https://doi.org/10.1016/0014-5793(96)00491-7 (1996).

    Article  CAS  PubMed  Google Scholar 

  54. Moser, M. J. et al. Thermostable DNA polymerase from a viral metagenome is a potent RT-PCR enzyme. PLoS One. 7, e38371. https://doi.org/10.1371/journal.pone.0038371 (2012).

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  55. Sauter, K. B. & Marx, A. Evolving thermostable reverse transcriptase activity in a DNA polymerase scaffold. Angew Chem. Int. Ed. Engl. 45, 7633–7635. https://doi.org/10.1002/anie.200602772 (2006).

    Article  CAS  PubMed  Google Scholar 

  56. Kranaster, R. et al. One-step RNA pathogen detection with reverse transcriptase activity of a mutated thermostable Thermus aquaticus DNA polymerase. Biotechnol. J. 5, 224–231. https://doi.org/10.1002/biot.200900200 (2010).

    Article  CAS  PubMed  Google Scholar 

  57. Blatter, N. et al. Structure and function of an RNA-reading thermostable DNA polymerase. Angew. Chem. Int. Ed. Engl. 52, 11935–11939. https://doi.org/10.1002/anie.201306655 (2013).

    Article  CAS  PubMed  Google Scholar 

  58. Shi, C., Shen, X., Niu, S. & Ma, C. Innate reverse transcriptase activity of DNA polymerase for isothermal RNA direct detection. J. Am. Chem. Soc. 137, 13804–13806. https://doi.org/10.1021/jacs.5b08144 (2015).

    Article  CAS  PubMed  Google Scholar 

  59. Ellefson, J. W. et al. Synthetic evolutionary origin of a proofreading reverse transcriptase. Science 352, 1590–1593. https://doi.org/10.1126/science.aaf5409 (2016).

    Article  ADS  CAS  PubMed  Google Scholar 

  60. Heller, R. C. et al. Engineering of a thermostable viral polymerase using metagenome-derived diversity for highly sensitive and specific RT-PCR. Nucleic Acids Res. 47, 3619–3630. https://doi.org/10.1093/nar/gkz104 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  61. Raghunathan, G. & Marx, A. Identification of Thermus aquaticus DNA polymerase variants with increased mismatch discrimination and reverse transcriptase activity from a smart enzyme mutant library. Sci. Rep. 9, 590. https://doi.org/10.1038/s41598-018-37233-y (2019).

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  62. Chien, A., Edgar, D. B. & Trela, J. M. Deoxyribonucleic acid polymerase from the extreme thermophile Thermus aquaticus. J. Bacteriol. 127, 1550–1557. https://doi.org/10.1128/jb.127.3.1550-1557.1976 (1976).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  63. Innis, M. A., Myambo, K. B., Gelfand, D. H. & Brow, M. A. DNA sequencing with Thermus aquaticus DNA polymerase and direct sequencing of polymerase chain reaction-amplified DNA. Proc. Natl. Acad. Sci. U S A. 85, 9436–9440. https://doi.org/10.1073/pnas.85.24.9436 (1988).

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  64. Barnes, W. M. The fidelity of Taq polymerase catalyzing PCR is improved by an N-terminal deletion. Gene 112, 29–35. https://doi.org/10.1016/0378-1119(92)90299-5 (1992).

    Article  CAS  PubMed  Google Scholar 

  65. Gloeckner, C., Sauter, K. B. & Marx, A. Evolving a thermostable DNA polymerase that amplifies from highly damaged templates. Angew Chem. Int. Ed. Engl. 46, 3115–3117. https://doi.org/10.1002/anie.200603987 (2007).

    Article  CAS  PubMed  Google Scholar 

  66. Merkens, L. S., Bryan, S. K. & Moses, R. E. Inactivation of the 5’-3’ exonuclease of Thermus aquaticus DNA polymerase. Biochim. Biophys. Acta. 1264, 243–248. https://doi.org/10.1016/0167-4781(95)00153-8 (1995).

    Article  PubMed  Google Scholar 

  67. Bosley, A. D. & Ostermeier, M. Mathematical expressions useful in the construction, description and evaluation of protein libraries. Biomol. Eng. 22, 57–61. https://doi.org/10.1016/j.bioeng.2004.11.002 (2005).

    Article  CAS  PubMed  Google Scholar 

  68. in EMERGENCY USE AUTHORIZATION (EUA) SUMMARY Global Direct RT-PCR Test (2023). https://www.fda.gov/media/167870/download.

  69. Kermekchiev, M. B., Tzekov, A. & Barnes, W. M. Cold-sensitive mutants of Taq DNA polymerase provide a hot start for PCR. Nucleic Acids Res. 31, 6139–6147. https://doi.org/10.1093/nar/gkg813 (2003).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  70. Dang, C. & Jayasena, S. D. Oligonucleotide inhibitors of Taq DNA polymerase facilitate detection of low copy number targets by PCR. J. Mol. Biol. 264, 268–278. https://doi.org/10.1006/jmbi.1996.0640 (1996).

    Article  CAS  PubMed  Google Scholar 

  71. Eom, S. H., Wang, J. & Steitz, T. A. Structure of Taq polymerase with DNA at the polymerase active site. Nature 382, 278–281. https://doi.org/10.1038/382278a0 (1996).

    Article  ADS  CAS  PubMed  Google Scholar 

  72. Arezi, B. et al. Compartmentalized self-replication under fast PCR cycling conditions yields Taq DNA polymerase mutants with increased DNA-binding affinity and blood resistance. Front. Microbiol. 5, 408. https://doi.org/10.3389/fmicb.2014.00408 (2014).

    Article  PubMed  PubMed Central  Google Scholar 

  73. Patel, P. H., Kawate, H., Adman, E., Ashbach, M. & Loeb, L. A. A single highly mutable catalytic site amino acid is critical for DNA polymerase fidelity. J. Biol. Chem. 276, 5044–5051. https://doi.org/10.1074/jbc.M008701200 (2001).

    Article  CAS  PubMed  Google Scholar 

  74. Buermans, H. P. & den Dunnen, J. T. Next generation sequencing technology: advances and applications. Biochim. Biophys. Acta. 1842, 1932–1941. https://doi.org/10.1016/j.bbadis.2014.06.015 (2014).

    Article  CAS  PubMed  Google Scholar 

  75. Potapov, V. & Ong, J. L. Examining sources of error in PCR by Single-Molecule sequencing. PLoS One. 12, e0169774. https://doi.org/10.1371/journal.pone.0169774 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

L. B. H gratefully acknowledge support by the Konstanz Research School Chemical Biology. We thank Prof. Dr. Yuri Motorin for his help with the NGS measurements.

Funding

Open Access funding enabled and organized by Projekt DEAL. This research was funded by the Deutsche Forschungsgemeinschaft in part by the priority program SPP 1784 “Chemical Biology of native Nucleic Acid Modifications” (MA 2288/17 − 1, 17 − 2) and grant MA 2288/22 − 1.

Author information

Authors and Affiliations

  1. Department of Chemistry, Konstanz Research School Chemical Biology, University of Konstanz, Universitätsstraße 10, 78464, Konstanz, Germany

    Luisa B. Huber & Andreas Marx

  2. SMP IBSLor, Epitranscriptomics and RNA Sequencing Core Facility, Université de Lorraine, Nancy, 54000, France

    Virginie Marchand

  3. Department of Chemistry, University of Konstanz, Universitätsstraße 10, 78464, Konstanz, Germany

    Melike Somtürk

  4. Department of Biology, Screening Centre, University of Konstanz, Universitätsstraße 10, 78464, Konstanz, Germany

    Silke Müller

Authors

  1. Luisa B. Huber
  2. Virginie Marchand
  3. Melike Somtürk
  4. Silke Müller
  5. Andreas Marx

Contributions

A. M. and L. B. H. conceived and designed the experiments. L. B. H. performed, analysed, optimized and visualized the experiments. V. M. conducted the NGS measurements. M. S. cloned and purified the Taq pol V2 IL. S. M. supervised and programmed the pipetting robot ASSIST plus from Integra. A. M. and L. B. H. prepared the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Andreas Marx.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Huber, L.B., Marchand, V., Somtürk, M. et al. Engineering of novel DNA polymerase variants for single enzyme quantitative multiplex reverse transcription-PCR. Sci Rep 15, 26147 (2025). https://doi.org/10.1038/s41598-025-10211-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1038/s41598-025-10211-x