Prokaryotic and eukaryotic DNA helicases: Essential molecular motor proteins for cellular machinery (original) (raw)

Abstract

DNA helicases are ubiquitous molecular motor proteins which harness the chemical free energy of ATP hydrolysis to catalyze the unwinding of energetically stable duplex DNA, and thus play important roles in nearly all aspects of nucleic acid metabolism, including replication, repair, recombination, and transcription. They break the hydrogen bonds between the duplex helix and move unidirectionally along the bound strand. All helicases are also translocases and DNA‐dependent ATPases. Most contain conserved helicase motifs that act as an engine to power DNA unwinding. All DNA helicases share some common properties, including nucleic acid binding, NTP binding and hydrolysis, and unwinding of duplex DNA in the 3′ to 5′ or 5′ to 3′ direction. The minichromosome maintenance (Mcm) protein complex (Mcm4/6/7) provides a DNA‐unwinding function at the origin of replication in all eukaryotes and may act as a licensing factor for DNA replication. The RecQ family of helicases is highly conserved from bacteria to humans and is required for the maintenance of genome integrity. They have also been implicated in a variety of human genetic disorders. Since the discovery of the first DNA helicase in Escherichia coli in 1976, and the first eukaryotic one in the lily in 1978, a large number of these enzymes have been isolated from both prokaryotic and eukaryotic systems, and the number is still growing. In this review we cover the historical background of DNA helicases, helicase assays, biochemical properties, prokaryotic and eukaryotic DNA helicases including Mcm proteins and the RecQ family of helicases. The properties of most of the known DNA helicases from prokaryotic and eukaryotic systems, including viruses and bacteriophages, are summarized in tables.

Keywords: DNA helicase, helicase assay, recombination, repair, replication


Abbreviations

Cdc

cell division cycle

CDK

cyclin‐dependent kinase

DDK

Cdc7–dumb bell former 4 (Cdc7–Dbf4)

Mcm

minichromosome maintenance

PCNA

proliferating cell nuclear antigen

PDH

pea DNA helicase

RPA

replication protein A

SV40

simian virus 40

Genetic information is locked in the duplex DNA of the genome. To access this information for many important biological processes, the duplex DNA has to be transiently unwound. For this purpose, a diverse class of enzymes has evolved, known as ‘DNA helicases’. They catalyze the unwinding of duplex DNA and thus play an essential role in many cellular processes, including DNA replication, repair, recombination, and transcription [1, 2, 3]. They are also thought to be motor proteins translocating along DNA using nucleoside triphosphate hydrolysis as the source of energy [4]. Multiple DNA helicases have been isolated from different organisms [1, 2, 3, 5]. Most contain short conserved amino‐acid fingerprints called helicase motifs. They are named Q, Ia, Ib, II, III, IV, V and VI. The helicases of this family are also called DEAD/H helicases [6, 7, 8]. This review focuses on the general aspects of DNA helicases including historical background, biochemical assays and biochemical properties. In addition, we have also covered the minichromosome maintenance (Mcm) proteins and RecQ family of helicases and summarized the characteristics of most of the known DNA helicases from prokaryotic and eukaryotic systems.

Historical perspective

DNA helicase was first discovered in E. coli in 1976 and classified as a ‘DNA unwinding enzyme’[9]. In 1978 the existence of the first eukaryotic DNA helicase was reported in the lily [10]. Since then, several different DNA helicases have been isolated from many organisms from both prokaryotic and eukaryotic systems. Recently, a new helicase motif, named the ‘Q motif’, has been identified in the DEAD‐box family of helicases [8]. The major milestones in the discovery of DNA helicases are summarized in Table 1.

Table 1.

Historical background of DNA helicases.

Year Discovery
1953–76 Prehelicase years
1953 Duplex structure of DNA was solved [55]
1967 Rep protein was the first helicase in E. coli to be identified by genetic criteria [56]
1976 Hoffman‐Berling isolated first DNA helicase (helicase I, a traI gene product) from _E. coli_[9]
1978 Existence of the first eukaryotic DNA helicase was reported from the lily [10]
1979 E. coli Rep protein was shown to contain DNA helicase activity [57]
1982 First bacteriophage protein reported as DNA helicase was T4 gene 41protein [11]
1982–83 A direct biochemical assay (strand displacement assay) for measuring helicase activity was developed [11, 12]
1985 First mammalian DNA helicase was reported from calf thymus [58]
1986 First viral encoded protein reported as DNA helicase was SV40 large tumor antigen [59]
First yeast protein reported as DNA helicase was ATPase III [60]
1988 Hodgeman and Gorbalenya discovered helicase motifs (seven conserved amino acid domains) [9, 61]
1989 Two helicase superfamilies (SF1 and SF2) were reported [62]
DEAD box helicase family was identified [7]
1990 First human DNA helicase (HDH) reported in the purified form [63]
E. coli RecQ protein was reported as DNA helicase [43]
1992 First mitochondrial DNA helicase was isolated from bovine brain [64]
1996 First chloroplast DNA helicase was reported in the purified form from the pea [65]
Crystallization of First DNA helicase (PcrA from thermophilic bacterium) was reported [66]
1997 Mcm4/6/7 protein complex from HeLa cells was reported as a DNA helicase [37]
Werner syndrome protein (WRN) was reported as a DNA helicase [67]
Bloom's syndrome (BLM) gene product was reported as a DNA helicase [68]
1998 Sgs1 (slow growth suppressor), a RecQ helicase from yeast, was reported as a DNA helicase [69]
2000 First plant DNA helicase gene (PDH45) encoding the biochemically active enzyme was cloned [30]
First eIF‐4A (from pea) was reported as a DNA helicase [30]
2002 First biochemically active malarial parasite DNA helicase was reported [34]
2003 A new helicase motif (Q motif) was identified in DEAD box helicases [8]
Presence of Helitron insertion, a DNA helicase bearing transposable element, reported from maize genome [70]

Biochemical assay

The most widely accepted and direct assay for measuring helicase activity in vitro was developed almost simultaneously by the Nossal and Richardson groups [11, 12]. Helicase partially unwinds duplex DNA substrate (32P‐labeled oligonucleotide annealed to a longer ssDNA molecule) yielding two ssDNA molecules of different sizes which are resolved from the starting duplex by electrophoresis followed by autoradiography (Fig. 1). The radioactive label in the DNA permits direct visualization (Fig. 1B) and quantitation of the results. Several other assays have also been developed but are not commonly used: for example, a rapid quench‐flow method [13], fluorescence‐based assays [14], filtration assay [15], a scintillation proximity assay [16], a time resolved fluorescence resonance energy transfer assay [17], an assay based on flashplate technology [18], homogeneous time‐resolved fluorescence quenching assay and electrochemiluminescence‐based helicase assay [19]. Xu et al. [20] recently developed a new method for simultaneous monitoring of DNA binding and helicase‐catalyzed DNA unwinding by fluorescence polarization.

Figure 1.

Figure 1

Scheme of biochemical assay for measuring unwinding activity of ATP/Mg2+‐dependent DNA helicase (A) and autoradiogram of the gel (B). (A) Asterisks denote the 32P‐labeled end of the DNA. The partial duplex DNA helicase substrate was prepared by annealing the radiolabeled DNA oligo to M13 ssDNA (circular) as described previously [65]. (B) Lane 1, reaction without enzyme; lane 2, heat‐denatured substrate; lane 3, reaction in presence of DNA helicase enzyme. S, Substrate; UD, unwound DNA.

Biochemical properties

The basic unwinding reaction catalysed by all of the known DNA helicases is similar. They all share the following common biochemical properties: nucleic acid binding; NTP binding; nucleic acid‐stimulated hydrolysis of NTP; NTP/dNTP hydrolysis‐dependent unwinding of duplex nucleic acids with specific polarity.

Binding to nucleic acid

Helicases usually need ssDNA as a loading zone where they bind in a sequence‐independent manner and translocate unidirectionally. In general, they bind to ssDNA with higher affinity than to dsDNA. However, RecBCD, simian virus 40 (SV40) large antigen, and RuvB helicases preferentially bind to dsDNA. Many helicases need a replication fork‐like structure on the substrate for optimum unwinding. Some DNA helicases such as RecBCD, UvrD, Rep and RecQ from E. coli and SV40 large T antigen can also initiate unwinding from the ends of blunt‐ended duplex DNA [2]. RecG, RuvA and RuvB helicases of E. coli specifically recognize Holliday junctions [21, 22]. Usually oligomeric helicases carry multiple ssDNA‐binding sites, although all the sites may not simultaneously bind to the DNA. For example, E. coli DnaB [23] and bacteriophage T7gp4 [24] DNA helicases contain six DNA‐binding sites even though only one or two subunits interact with ssDNA at a time.

NTP binding and nucleic acid‐stimulated hydrolysis of NTP

All DNA helicases bind NTP and exhibit nucleic acid‐dependent intrinsic NTPase activity necessary for duplex unwinding. In general, replicative hexameric helicases bind ssDNA tightly in the presence of NTP, and weakly in the presence of NDP. This binding results in a conformational change in homo‐oligomeric helicases and thereby affects the assembly state of the protein. NTP does not bind with equal affinity to all the sites in hexameric helicases. In general, the DNA helicases are poor NTPases in the absence of ssDNA and their NTPase activities are stimulated by the presence of ssDNA.

Polarity of DNA unwinding

DNA helicases exhibit specific polarity, which is defined as the direction of DNA helicase movement on initially bound ssDNA template (i.e. 3′ to 5′ or 5′ to 3′) with respect to the polarity of the sugar phosphate backbone. The polarity of unwinding by DNA helicase can usually be determined by using a DNA substrate consisting of a linear ssDNA template with duplex regions near the end(s). Possible structures of substrates that can be used for directional study in vitro are shown in Fig. 2. For helicases involved in DNA replication, the polarity of the reaction is strongly indicative of helicase placement on the leading (3′ to 5′ polarity) or lagging (5′ to 3′ polarity) strands. It has been reported that RecBCD contains bipolar enzyme activitiy, where RecB and RecD components of the complex unwind DNA from 3′ to 5′ and 5′ to 3′ directions, respectively [25]. Interestingly, PcrA helicase from Bacillus anthracis showed robust 3′ to 5′ as well as 5′ to 3′ helicase activities, with substrates containing a duplex region and a 3′ or 5′ ss poly(dT) tail [26]. Recently, Constantinesco et al. [27] have shown that the HerA DNA helicases from thermophilic archaea is able to utilize either 3′ or 5′ ssDNA extensions for loading and subsequent DNA duplex unwinding.

Figure 2.

Figure 2

Structures of the linear partial duplex substrates commonly used to determine the direction of translocation of the helicase. The 3′ to 5′ directional substrates are on the left and 5′ to 3′ directional substrates are on the right. Asterisks denote the 32P‐labeled end.

Monomeric and oligomeric nature of helicases

The fact that many helicases function as hexamers, as well as the necessity for multiple DNA‐binding sites, has led to the suggestion that oligomerization may be necessary for helicase function. The property of an oligomeric helicase is that it possesses multiple DNA‐binding sites, a feature that is required for any ‘active’ mechanism of DNA unwinding, because it enables a helicase to bind both ss and duplex DNA or two strands of ssDNA simultaneously at an unwinding fork. On the basis of active assembly of DNA helicases, they can be grouped as monomeric or multimeric helicases. Figure 3 shows the interaction of different forms of DNA helicases with a DNA‐unwinding fork. In all these models, at least one subunit always binds to the ssDNA track along which it moves [28]. The other subunit may bind to the dsDNA. On the other hand the monomeric helicases may contain two different domains, one for ssDNA and the other for dsDNA binding. The bacterial helicases II, IV and PriA ([1] and references cited therein), bacteriophage T4 Dda helicase [29], human DNA helicase IV, V and VI [5], and pea DNA helicase 45 (PDH45) and PDH65 [30, 31] are the few examples of monomeric forms. It has been shown that UvrD from E. coli is active as a monomer [32].

Figure 3.

Figure 3

Interaction of monomeric or oligomeric DNA helicases with the DNA forked substrate. (A) Monomeric helicase binds to both ssDNA and dsDNA. (B) In homodimeric helicases, one subunit always binds to the ssDNA track along which it moves. (C) Heterodimeric helicase contains two separate domains: one subunit binds/interacts with dsDNA and anchors the helicase to the DNA lattice and the other subunit interacts with ssDNA and translocates along it. (D) Hexameric or oligomeric helicases contain a ring‐like structure that enables the proteins to encircle the DNA and thus prevent local reannealing. In this case one or more subunits bind to ssDNA at the ss/dsDNA junction.

The functionally active forms of many DNA helicases are oligomeric: E. coli DnaB, RuvB, RecBCD and Rho proteins, phage T7 gene 4 helicase/primase, phage T4 gene 41 helicase, SV40 large T antigen. All can assemble to form ring‐like toroidal hexamers [1, 2, 22, 23, 25, 28]. Electron microscopy studies have confirmed that the ssDNA transverses in the center of the hexameric ring. These results show that the hexameric G40P DNA helicase encircles the 5′ tail, interacts with the duplex DNA at the ss/dsDNA junction, and excludes the 3′ tail of the forked DNA [33]. Oligomerization of many DNA helicases can be modulated by interactions with other ligands. For example, the E. coli Rep helicase is a monomer up to concentrations of 12 mm in the absence of DNA but forms a dimer on binding to DNA [1].

Prokaryotic and eukaryotic DNA helicases

DNA helicases have been isolated from many sources and accordingly named as prokaryotic, eukaryotic, bacteriophage, and viral helicases. More than one helicase is present in each system because of a variety of different needs for the duplex DNA to unwind in different DNA metabolisms. For example, at least 14 different DNA helicases have been isolated from a simple single cell organism such as E. coli(Table 2), six from bacteriophages (Table 3), 12 from viruses (Table 4), 15 from yeast (Table 5), eight from plants (Table 6), 11 from calf thymus (Table 7), and as many as 25 from human cells (Table 8). The properties of these DNA helicases are summarized in their respective tables. The first malarial parasite helicase protein has recently been purified which contains ATP/Mg2+‐dependent DNA unwinding and ssDNA‐dependent ATPase activities [34] and prefers to unwind DNA containing replicative fork‐like structures [35].

Table 2.

E. coli DNA helicases.

S. No. Name of helicase Mol. mass (kDa) Gene Polarity Remarks
1. DnaB protein a 52 dnaB 5′−3′ Replicative helicase. Moves on lagging strand of replication fork.
2. PriA proein b 81.7 priA 3′−5′ Replicative helicase. Formerly called n′‐protein. Binds to ssDNA at primosome assembly sites.
3. Rep protein c 72.8 rep 3′−5′ Replicative helicase, unwinds the phage DNA in a highly processive and catalytic manner.
4. UvrAB 103 uvrA 5′−3′ Repair helicase. Involved in nucleotide excision repair.
complex d 76 uvrB UvrB is helicase component
5. Helicase II (UvrD) e 82 uvrD 3′−5′ Repair helicase. Involved in nucleotide excision repair.
6. Helicase IV f 78 helD 3′−5′ Originally called as 75 kDa helicase. Helicase activity stimulated by SSB.
7. RecQ g 80 recQ 3′−5′ Recombination helicase.
8. RecBCD 13.4 recB 3′−5′ Catalyzes the first step in the recombinational repair of dsDNA breaks.
complex h 129 recC Highly processive helicase with bipolar polarity.
(exo V) 66 recD 5′−3′
9. RuvAB i 22 37 ruvA ruvB 5′−3′ Recombination helicase. It is an ATP‐driven translocase (pump) that promotes branch migration.
10. Helicase I j 192 tral 5′−3′ First helicase identified. May be involved in site‐specific nicking reaction.
11. RecG k 76 recG 3′−5′ A junction‐specific DNA helicase that acts postsynap‐tically to drive branch migration of holliday junction.
12. Rho l 46 rho 5′−3′ RNA·DNA helicase, can also unwind RNA·RNA but not DNA·DNA.
13. Helicase III m 20 ? 5′−3′ Smallest prokaryotic helicase. SSB protein inhibits the ATPase activity of the protein.
14. DinG n ? DinG 5′−3′ DNA damage inducible helicase. It is a monomer in solution.

Table 3.

Bacteriophage DNA helicases. nd, Not determined; PNA, peptide nucleic acid.

S. No. Name of helicase Mol. mass (kDa) Nucleotide cofactor Polarity Remarks
1. T4 gene 41 a 58 dGTP > ATP = dATP > GTP 5′−3′ Essential for both the priming and helicase activities. Stimulated by T4 gene 59 protein and forked 3′ tail substrate.
2. T4 dda b 56 ATP, dATP 5′−3′ Can also unwind DNA‐PNA substrate. Inhibited by T4 gene 32 protein involved in the DNA replication. Oligomerization of dda is not required for DNA unwinding.
3. T4 UvsW c 65 ATP Nd Catalyzes branch migration and is involved in recombination, repair and the regulation of DNA replication origin.
4. T7 gene 4 d 56 and 63 ATP, dATP dGTP,dTTP 5′−3′ 56 kDa protein contains DNA helicase activity, while 63 kDa (with 63 amino acids more) contains both the helicase and primase activities.
5. P4 gene α e 84.9 ATP, dATP GTP, dGTP CTP, dCTP 3′−5′ Stimulated by forked substrate, contains pri‐mase and sequence specific (5′‐TGTTCAC C −3′) binding activity of ori and crr DNA.
6. G40P f 300 ATP, GTP, CTP, UTP 5′−3′ Essential for B. subtilis bacteriophage SPP1 replication.

Table 4.

Viral DNA helicases. AAV, Adeno‐associated virus; ACNP, Autographa californica nuclear polyhedrosis; BPV, bovine papilloma virus; HSV, herpes simplex virus; MVM NS1, minute virus of mice–nonstructural protein; nd, not determined; OBP, origin binding protein; SV40 T‐antigen, Simian virus 40 large T antigen; SARS‐CoV, severe acute respiratory syndrome coronavirus (Coronoviridae helicase).

S. No. Name of helicase Mol. mass (kDa) Nucleotide cofactors Polarity Remarks
1 SV40 T‐antigen a 94 ATP > dATP > dTTP = UTP 3′−5′ Interacts with DNA pol. α; essential for DNA replication; contains both DNA and RNA helicase activities.
2. Polyoma T‐antigen b 100 ATP = dATP > CTP = UTP 3′−5′ Contains Polyoma ori binding and unwinding activities.
3. HSV‐1. UL5/UL8/UL52 Complex c 120 97 70 ATP > GTP > CTP = UTP 5′−3′ UL5 and UL52 required for helicase‐primase activity.
4. HSV‐1, UL9 protein d 68 ATP = dATP > CTP > dCTP 3′−5′ OBP involved during the initiation of HSV replication.
5. BPV‐1, E1 protein e 68 ATP, dATP, CTP, dCTP, UTP, dTTP, GTP, dGTP 3′−5′ OBP, which is stimulated by E2 protein of BPV‐1.
6. AAV, Rep68, Rep78 f 68 71 ATP > CTP > dATP > GTP > UTP 3′−5′ Contains site‐and strand‐specific endonuclease activity.
7. AAV‐Rep52 g 52 ATP, dATP, CTP, dCTP, UTP, dTTP, GTP, dGTP 3′−5′ Lysine to histidine substitution within motif I was deficient for both DNA helicase and ATPase activities.
8. AAV‐Rep40 h 40 ATP 3′−5′ Lysine to histidine mutation in the purine nucleotide‐binding site results in a protein that inhibits helicase activity.
9. MVM NS‐1 i 83 ATP > dATP nd Appears to have site‐specific endonuclease activity.
10. ACNP virus P143 j 143 ATP nd Stimulated by LEF3/SSB; essential for virus DNA replication.
11. Vaccinia virus A18R k 57.5 ATP 3′−5′ It is a DExH box protein and is involved in transcription.
12 SARS‐CoV helicase l 70 ATP, dATP, CTP > all others 5′−3′ Attractive target for anti‐SARS therapy.

Table 5.

Yeast DNA helicases. nd, Not determined.

S. No. Name of helicase Source Mol. mass (kDa) Nucleotide cofactors Polarity Remarks
1. ATPase III a S. cerevisiae 63 ATP > dATP 3′−5′ Stimulated by yeast Pol I.
2. Rad3 b (XPD) S. cerevisiae 89 ATP > dATP ≫ CTP 5′−3′ Active at acidic pH; involved in DNA excision repair; homologous to XPD gene.
3. Rad25 c (SSl2) S. cerevisiae 95 ATP, dATP 3′−5′ Functions in nucleotide excision repair; homo logous to XPB; required for Pol II transcription.
4. Srs2d S. cerevisiae 134 ATP, dATP 3′−5′ Involved in error‐prone repair; negatively modulates recombination.
5. PIF1 e S. cerevisiae 97 ATP, dATP 5′−3′ Functions in mitochondrial DNA repair and recombination.
6. DNA helicase A f S. cerevisiae 90 ATP, dATP 5′−3′ Copurifies with DNA Pol‐α‐ primase; helicase activity stimulated by the yeast RPA.
7. DNA helicase B g S. cerevisiae 127 ATP, dATP > CTPdCTP, UTP 5′−3′ Copurifies with DNA Pol‐δ; stimulated by scRPA; encoded by the yORF61 gene
8. DNA helicase Ch S. cerevisiae 32 60 ATP, dATP CTP,dCTP > UTP,GTP > dGTP 5′−3′ Copurifies with DNA Pol δ?
9. DNA helicase D i S. cerevisiae 60 ATP,dATP > CTP,dCTP UTP 5′−3′ Copurifies with RF‐C.
10. ScHel I j S. cerevisiae 135 ATP, dATP 5′−3′ Stimulated by E. coli SSB.
11. DNA helicase III k S. cerevisiae 120 ATP, dATP 5′−3′ Encoded by a gene different from Rad3 and RadH.
12. Sgs1 l S. cerevisiae ATP, dATP 3′−5′ Binds more tightly to a forked DNA substrate than to ss and ds DNA.
13. Dna2 m S. pombe 65 ATP, dATP 5′−3′ Involved in DNA replication.
14. MER3 n S. cerevisiae 130 ATP 3′−5′ Meiosis‐specific helicase: required for crossing over at time of first meiotic division.
15. Hmi1p helicase o S. cerevisiae 80 ATP nd Mitochondrial helicase; required for the maintenance of mitochondrial genome.

Table 6.

Biochemically active DNA helicases from plant cells. CDH, Chloroplast DNA helicase; DSBs, double‐strand breaks; HDH, human DNA helicase; nd, not determined; PDH, pea DNA helicase.

S. No. Name of helicase Mol. mass (kDa) Nucleotide cofactors Polarity Remarks
1. Lily U‐protein a 130 (native) ATP nd Partially purified. Unwinds DNA from ends, gaps and nicks. Its abundance increases during meiosis.
2. Soybean chloroplast DNA helicase b nd ATP ≫ GTP > dATP > NTPs nd Partially purified. Shows unwinding activity at higher conc. of Mg2+ (10 mm).
3. Pea chloroplast DNA helicase I (CDH I) c 68 138 (native) ATP > dATP ≫ NTPs 3′−5′ First plant helicase reported in purified form. It is a homodimer.
4. Pea chloroplast DNA helicase II (CDH II) d 78 ATP = dATP 3′−5′ Purified to homogeneity. Stimulated by fork‐like structures.
5. Pea nuclear DNA helicase I (PDH45) e 45 ATP > dATP ≫ NTPs 3′−5′ First plant nuclear DNA helicase; homologous to eIF‐4 A; stimulates topo I activity.
6. Pea nuclear DNA helicase II (PDH65) f 65 ATP > dATP 3′−5′ A plant homolog of HDHI. localized in nucleo lus. Phosphorylated and upregulated by CK2 and cdc2 protein kinases.
7. Arabidopsis Ku DNA helicase (AtKu70/80) g 70.3 76.7 ATP nd Functions as heterodimer that binds to dsDNA. Gene expression is induced by DSBs, may be involved in DSB repair.
8. Pea nuclear DNA helicase III (PDH120) h 54 66 120 (native) ATP > NTPs 3′−5′ Purified to homogeneity. Present at extremely low abundance and contain the high specific activity.

Table 7.

Calf thymus/bovine DNA helicases.

S. No. Name of helicase Mol. mass (kDa) Nucleotide cofactors Polarity Remarks
1. DNA helicase A a 47 ATP = dATP > CTP > dCTP 3′−5′ Copurifies with calf thymus DNA Pol‐α/?Primase; stimulated 20‐fold by RPA.
2. DNA helicase B b 100 dATP = ATP ≫ all other 5′−3′ Binds to dsDNA also.
3. DNA helicase C c 40 dATP = ATP ≫ all other 5′−3′ Stimulated by 100 mm KCl on short substrates.
4. DNA helicase D d 100 45 dATP = ATP 5′−3′ Stimulated 10‐fold by RPA and forms large aggregates in low salt.
5. DNA helicase E e 104 dATP = ATP 3′−5′ Copurifies with DNA Pol‐ε and dependent on unspecific SSB; probably involved in DNA repair.
6. DNA helicase F f 72 ATP, dATP, dCTP, UTP, CTP, GTP, dGTP, dTTP 5′−3′ Copurifies with RPA. In presence of RPA, this helicase can unwind longer duplexes
7. DNA helicase I g 200 ATP = dATP 3′−5′ Stimulated by 150 mm NaCl.
8. DNA helicase II h 130 ATP = dATP ≫ 1 other 3′−5′ Can unwind dsRNA also.
9. δ helicase i 57 ATP = dATP > CTP = UTP 5′−3′ Copurifies with DNA Pol. δ and acts as strand‐ displacement factor for Pol.δ
10. Cytosolic DNA helicase j 110 ATP = dATP > CTP, dCTP 3′−5′ All the three subunits bind to ATP.
65
34
11. Bovine mitochondrial helicase k 57 ATP = dATP 3′−5′ Possible role in mitochondrial DNA replication.

Table 8.

Human DNA helicases. nd, Not determined.

S. No. Name of helicase Mol. mass (kDa) Nucleotide cofactors Polarity Remarks
1. HDH I a 65 ATP, dATP 3′−5′ Can also unwind DNA/RNA and RNA/RNA hybrids; may be involved in rDNA transcription.
2. HDH II/ 87 ATP, dATP 3′−5′ Functions in dsDNA break repair and V(D)J recombination; regulator of DNA‐dependent
Ku b 72 protein kinase
3. HDH III c 46 ATP, dATP 3′−5′ Prefers replication fork‐like structure of substrates
4. HDH IV d 100 ATP, dATP 5′−3′ Can unwind DNA/RNA hybrids
5. HDH V e 92 ATP, dATP 3′−5′ Has highest turnover rate
6. HDH VI f 128 ATP, dATP 3′−5′ Prefers replication fork‐like structure of substrates
7. HDH VII g 36 nd nd Trimer of one molecule of hnRNP A1 and two molecules of annexin II
8. HDH VIII h (G3GP) 68 ATP 5′−3′ A DNA and RNA helicase corresponding to G3 bp protein, an element of the RAS transduction pathway, similar to E. coli RecBCD
9. HDH IX g (RNP DO) 45 nd nd A Gly‐Arg rich protein identified as ribonuclear protein DO
10. XPD/ERCC2 i 87 ATP, dATP 5′−3′ Functions in nucleotide excision repair; component of BTF2‐TFIIH transcription factor.
11. XPB/ ERCC3 j 89 ATP 3′−5′ Functions in nucleotide excision repair; component of BTF2‐TFIIH transcription factor.
12. Helicase ε k 72 ATP, dATP, CTP 3′−5′ Helicase activity is dependent on HRP‐A
13. Helicase µ l 110 ATP, dATP, 3′−5′ Stimulated by 5′‐tailed fork and SSB.
90 CTP, dCTP
14. RIP 100 m 100 ATP, dATP 3′−5′ Associated with RIP60; RIP60 binds to replication origin region of DHFR h
15. Helicase Q1 n 73 ATP, dATP 3′−5′ Gene homologous to E. coli RecQ gene; identical to human DNA helicase I
16. Helicase Q2 o 100 ATP 5′−3′ Identical with DNA helicase IV
17. HchlR1 p 112 ATP 5′−3′ Can unwind RNA/DNA substrates. Unlike others it can translocate along ssDNA in both directions when substrate have a very long ssDNA region.
18. HHcsA q 116 ATP, dATP 5′−3′ Hexameric protein.
19. WRN helicase r 163 ATP, dATP ≫ DCTP, CTP 3′−5′ Mutated in Werner syndrome, homologous to RecQ and contains 3′‐5′ exonuclease activity
20. BLM helicase s ≈ 160 ATP 3′−5′ Mutated in cells of Bloom's syndrome patient and belongs to RecQ family.
21. Mcm4/6/7 complex t ≈ 600 ATP, dATP 3′−5′ The DNA unwinding activity is stimulated by SSB and forked DNA structures; can function as a replication helicase.
22. HEL308 u 124.5 ATP, dATP 3′−5′ Homologous to DNA crosslink sensitivity protein Mus308 of D. melanogaster. Stimulated by RPA.
23. HFDH1 v ≈ 120 ATP 3′−5′ First F‐box protein that possesses enzyme activity.
24. Human RECQ1 w 75 ATP 3′−5′ Needs 3′ tail of 10 nt on the substrate to open the duplex; can unwinds blunt end substrate with bubble of 25 nt; stimulated by hRPA.
25. BACH1 x 130 ATP 5′−3′ A nuclear phosphoprotein interacts with tumor suppressor, BRCA1. Involved in DSB repair and contain tumor suppression activity.

Mcm proteins

Mcm proteins (named because they were discovered as the products of genes essential for minichromosome maintenance in yeast) were identified initially for their role in plasmid replication or cell cycle progression. In eukaryotes, Mcm proteins 2–7 are required for initiation and elongation steps of chromosomal DNA replication [36]. The Mcm complex is now known to serve as a replicative helicase. A heterotrimeric complex of human Mcm4/6/7 forms a dimeric structure, contains ATP‐dependent DNA helicase activity, binds to ssDNA, and possesses DNA‐dependent ATPase activity [37], whereas Mcm2/3/7 serve as regulatory subunits. In the presence of forked DNA structures and single stranded DNA binding protein (SSBP), the Mcm4/6/7 complex possesses processive DNA helicase activity [38]. The six proteins (Mcm2–7) interact with each other to form multiple complexes; however, the predominant form is a hexamer containing all six Mcms, which is relatively stable. Electron microscopy indicates that this complex has a globular structure. In vivo studies suggest that all six Mcms are recruited to replication origins during the G1 phase [36]. During G1 phase human Mcm proteins first assemble at or adjacent to bound origin recognition complex along with cell division cycle (Cdc) 6 and Cdc10‐dependent transcript 1 (Cdt1) proteins (Fig. 4) and move to other sites during genome replication [39]. These proteins together form a prereplication complex at the origin of DNA replication at the beginning of the S phase. After assembly, the complex is activated by cyclin‐dependent kinases (CDKs) and the Cdc7–dumb bell former 4 (Cdc7–Dbf4, DDK) complex in the S phase to promote the initiation of DNA replication (Fig. 4).

Figure 4.

Figure 4

Recruitment to activation of Mcm complex during initiation of DNA replication at the origin. (A) The origin is ‘marked’ by the origin recognition complex (orc). (B) Assembly of the prereplication complex (pre‐RC) begins during the G1 phase, when the ‘loading factors’ Cdc6 and Cdt1 are recruited to the replication origin to which orc (and Mcm10) bind. (C) Two Mcm complexes (ring‐shaped hexamers) load on to the origin, which is facilitated by Cdc6 and Cdt1. The Cdc7‐Dbf4 kinase (DDK) is also recruited to the origin during the G1 phase and phosphorylates the Mcm complex during the S phase. (D) The loading factors have been displaced from the DNA; the phosphorylated two Mcm complexes (enzymatically active helicase) have moved apart along the template, generating a replication ‘bubble’ by unwinding and displacing the orc. At each ‘fork’ the Cdc45 protein binds. The loading of other DNA replication factors, such as DNA polymerase α, RPA, and primase, etc., start at the time of initial DNA melting, which leads to the initiation of DNA replication.

A growing list of proteins, including Mcm10 and Cdt1, are involved in the recruitment process. Actually the two protein kinases (CDK and DDK) trigger a chain reaction that results in the phosphorylation of the Mcm complex and finally in the initiation of DNA synthesis [36, 39, 40]. Recruitment of DDK to the replication origin occurs during the G1 phase. However, it must function sometimes during the S phase, and this decision is controlled locally at individual origins. Phosphorylation of Mcm2 by DDK results in a conformational change in the Mcm complex. Recruitment of Cdc45 depends on phosphorylation of the Mcm complex and the activity of CDK. The Mcm component of the complex provides the DNA‐unwinding function (Fig. 4) to start the DNA replication, which finally requires the concerted action of many enzymes/factors such as replication protein A (RPA), polymerase α, replication factor C, polymerase δ, proliferating cell nuclear antigen (PCNA), flap endonuclease 1, endonuclease DNA2, and DNA ligase I [35, 39]. The unwinding of origin DNA is detected only at a later step when the Mcm complex is phosphorylated by DDK at the G1 to S phase transition [40]. These studies suggest an anchoring mechanism other than direct contact with DNA for the Mcm complex immediately after its recruitment to the replication origins. Recently it has been reported that the Mcm8 protein from HeLa cells, a new member of the Mcm family, forms a complex with Mcm4, Mcm6 and Mcm7 proteins and this complex is involved in initiation of DNA replication [41].

RecQ family of DNA helicases

RecQ is a recombination‐specific DNA helicase from the SF2 family. Members of the RecQ family of DNA helicases are involved in processes linked to DNA replication, DNA recombination, and gene silencing. It was discovered and named RecQ by H. Nakayama of Kyushu University, Japan (‘Q’ came from Kyushu) [42]. This gene was discovered during a search for genes that control the loss of viability in thymine‐starved bacteria, a classic phenomenon known as thymineless death. RecQ1 mutant was resistant to thymineless death, and, in a recBCsbcB background, it exhibited enhanced UV sensitivity and a deficiency in conjugational recombination, suggesting a role for RecQ in the RecF recombination pathway [42]. RecQ protein was purified to homogeneity and shown to be a helicase that unwinds duplex DNA in the 3′ to 5′ direction with respect to the single strand to which it binds [43].

The family name ‘RecQ’ is derived from the E. coli RecQ helicase. In general, unicellular organisms express a single RecQ enzyme, whereas more complex organisms express two or more. All helicases of this family share a central seven helicase motifs. RecQ DNA helicases are proposed to function at the interface between DNA replication and recombination to ‘repair’ the damaged replication forks [44]. This family includes at least five members in humans, and the following three are defective in genetic disorders associated with cancer and/or premature aging: WRN, BLM and RECQ4 helicases defective in Werner's syndrome, Bloom's syndrome, and Rothmud–Thomson syndrome, respectively [44, 45]. RecQ helicases are considered to be ‘caretaker’ tumor suppressors which suppress neoplastic transformation through control of chromosomal stability [44]. Figure 5 depicts the members of the RecQ family of DNA helicases from humans, Xenopus, Drosophila, yeast and E. coli. The family contains a highly conserved helicase domain which comprises about 400 amino acids. Two signature motifs are present in most RecQ helicases: the RQC domain and HRDC (helicase and RNaseD C‐terminal) domain (Fig. 5). The RQC domain is unique to RecQ family helicases and probably has a role in mediating interactions with other proteins. The HRDC domain is found in a number of nucleases that digest RNA, and may be involved in DNA binding in RecQ proteins [45]. The members of this family of helicases can unwind many different kinds of structurally diverse DNA substrates, including G4‐DNA (G‐quadruplex DNA) and synthetic Holliday junctions. The BLM and WRN proteins have been shown to promote branch migration of bona fide Holliday junctions generated by the RecA protein [46, 47]. Significantly, this RecQ‐helicase‐catalysed branch migration occurs over several kilobases of DNA, whereas the regular unwinding that is catalysed by these enzymes is only moderately processive and limited to 75–150 bp in the absence of cofactors. This ability to process recombination intermediates formed during DNA replication is proposed to be a key function of the RecQ helicases.

Figure 5.

Figure 5

The RecQ helicase family. Schematic representation of the RecQ family of DNA helicases from human (WRN, BLM, RECQ4, RECQ5, RECQL), Xenopus (FFA‐1, xBLM), Drosophila (DmBLM), Saccharomyces cerevisiae (Sgs1), Saccharomyces pombe (Rqh1) and E. coli (RecQ). Proteins are aligned by their conserved helicase domains. The size of each protein (in amino acids) is indicated in parentheses under the respective member. The key to various domains is indicated in the box at the bottom. Note: The Drosophila melanogaster RECQ5 and RECQE, Caenorhabditis elegans RecQ5, and RecQ homologs from Arabidopsis thaliana are not shown.

Except for E. coli RecQ, human RECQ5 and RECQL, members of this family are large with extended amino and carboxy ends (Fig. 5). These flanking ends are poorly conserved and are important in functionally distinguishing the roles of the different ‘isoforms’ in humans. WRN helicase differs from the other family of RecQ helicases in containing exonuclease activity as well. The BLM helicase belongs to the family of ring helicases, which includes SV40 large T antigen and E. coli DnaB and RuvB proteins [47]. BLM makes numerous contacts with other proteins. It is a component of ATM kinase, BASC (BRCA1‐associated genome surveillance complex), a multienzyme complex that contains BRCA1 and MLH1, the MRE11–RAD50–NBS1 complex [44]. It has also been shown in a number of studies that WRN interacts physically and functionally with proteins required for DNA replication, including DNA polymerase δ, flap endonuclease 1, PCNA, and RPA [44]. A physical and functional interaction has also been reported for WRN protein and the Ku heterodimer complex, suggesting that WRN protein is also involved in double‐strand break repair [48]. The connection between RecQ homologs and Ku appears to be relevant for telomere maintenance as well. It has been shown that primary fibroblasts from patients with Werner's syndrome, like Ku‐deficient murine cells, display excessive telomere shortening and premature replicative senescence, which is thought to contribute to the early onset of aging seen in Werner's syndrome [49]. However, yeast Sgs1p is also required for a telomere maintenance pathway that is independent of telomerase and dependent on recombination [50]. Thus the RecQ helicases help to define two mechanically distinct telomere maintenance pathways that are both telomerase‐independent and recombination‐dependent.

In a recent study it has been shown that only short DNA duplexes (< 30 bp) can be unwound by RECQ1 alone, but the addition of human replication protein A (hRPA) increases the processivity of the enzyme (> 100 bp). These findings suggest that RECQ1 and hRPA may also interact in vivo and function together in DNA metabolism [51]. RECQ5 is one of the five RecQ helicase homologs identified in humans. Drosophila RECQ5 helicase is capable of unwinding 3′ Flap, three‐way junction, fork and three‐strand junction substrates at lower protein concentrations than 5′ Flap, 12 nucleotide bubble and synthetic Holliday junction structures, which can be unwound efficiently by WRN and BLM [52]. Taken together these findings provide evidence that RecQ helicases facilitate smooth ‘replisome’ progression through the genome. Six cDNAs of RecQ‐like proteins (AtRecQI1, AtRecQI2, AtRecQI3, AtRecQI4A, AtRecQI4B and AtRecQsim) and one gene (AtWRNexo) homologous to the exonuclease domain of human Werner protein have also been isolated from plants [53].

Concluding remarks

The DNA helicases are known to play essential roles in unwinding of duplex strands in almost every aspect of nucleic acid metabolism, which makes them very important molecules. Functional helicases always work as an integrated component of a large macromolecular complex which is ‘designed’ to carry out a particular function on its genomic DNA target. The functions of all reported DNA helicases have not yet been investigated. It will be important to identify and characterize their specific substrates, interacting proteins, and cofactors in order to precisely elucidate their specific functions in nucleic acid metabolism. Despite the diversity of their functions and the large range of organisms in which these proteins have been identified, high sequence conservation is maintained, suggesting that all helicase genes evolved from a common ancestor. The family of DEAD‐box helicases differ mainly in the N‐terminal and C‐terminal sequences, which contain different targeting signals. The different regulatory mechanisms at both the level of expression and post‐transcription may explain the wide spectrum of functions involving DEAD‐box helicases. The crystal structures of DNA helicases suggest that the helicase motifs are clustered together for the overall unwinding function. The motifs, crystal structure, mechanism of unwinding, and various functions of DNA helicases are described in detail in the following review.

The Mcm proteins play an important role in replication by acting as licensing factors, ensuring that replication is not re‐initiated until the first round is completed in one cell cycle. These proteins initiate unwinding at the origin, leading to the formation of a replication bubble. They create a replication fork and move along the fork, at the same time bringing the origin to the unlicensed state. The conservation of Mcms from Archaea to humans suggests that several of these proteins may have evolved from a single progenitor. Helicases of the RecQ family catalyse critical genome maintenance reactions in bacterial and eukaryotic cells, playing key roles in several DNA metabolic processes. Mutations in RecQ genes are linked to genome instability and human disease. As mutants in RecQ family genes have unstable chromosomes, it was proposed that members of the RecQ helicase family play a central role in the maintenance of genomic stability and thereby the prevention of tumorigenesis. A connection between the action of RecQ homologs and gene expression was demonstrated by the involvement of QDE3 of Neurospora crassa in post‐transcriptional gene silencing [54]. However, only one RecQ homolog was found in E. coli and yeast genomes, but multiple homologs are found in the animal and plant kingdom. These homologs of RecQ seem to have evolved first by duplication and then by consecutive shuffling or fusion with other protein domains [54]. Careful genomic studies and further biochemical analysis are needed to understand fully how RecQ DNA helicases influence the processing and/or prevention of recombination intermediates.

Despite considerable progress overall in the helicase field in the last two decades, a large gap in our knowledge persists which prevents complete understanding of the complex genetic processes. Considering the multiplicity of helicases, their tissue specificity, functional diversity, and control of activity by phosphorylation/dephosphorylation, it is reasonable to assume that they also play an important role in cell division, growth, and development. Progress in plant helicases is slow. A better understanding of these proteins will have to await the breakthrough that hopefully will be provided by the high‐resolution structure to be solved by X‐ray crystallography. Furthermore, the RNAi approach and/or transgenic antisense plant technology should help in understanding the detailed role of helicases in plant development.

The number of DNA helicases isolated from all systems is continuously growing. This has created the problem of a bewildering array of different names and classification. Therefore it is important that a clear, scientific system for nomenclature and classification of helicases is formulated. However, this class of proteins does not, at this point, lend itself to a simple method of classification.

Acknowledgements

We are grateful to Dr N. S. Mishra and Mr T. Q. Ngoc for their help in preparation of illustrations.

References