Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống
1
/ 13 trang
THÔNG TIN TÀI LIỆU
Thông tin cơ bản
Định dạng
Số trang
13
Dung lượng
480,77 KB
Nội dung
Site-directedmutagenesisandfootprinting analysis
of theinteractionofthesunflowerKNOXprotein HAKN1
with DNA
Mariana F. Tioni, Ivana L. Viola, Raquel L. Chan and Daniel H. Gonzalez
Ca
´
tedra de Biologı
´
a Celular y Molecular, Facultad de Bioquı
´
mica y Ciencias Biolo
´
gicas, Universidad Nacional del Litoral, Santa Fe, Argentina
Homeobox genes encode a group of eukaryotic tran-
scription factors generally involved in the regulation of
developmental processes [1]. These genes contain a
region coding for the homeodomain, a 60 amino acid
protein motif that interacts specifically withDNA [2].
The homeodomain folds into a characteristic three-
helix structure. Helices I and II are connected by a
loop, while helices II and III are separated by a turn,
resembling prokaryotic helix-turn-helix transcription
factors. However, unlike helix-turn-helix-containing
proteins, most homeodomains are able to bind DNA
as monomers with high affinity, through interactions
made by helix III (the so-called recognition helix) and
a disordered N-terminal arm located beyond helix I
[3–6].
In plants, the first homeobox was identified in the
maize gene Knotted1 (kn1; [7]). Dominant mutations in
kn1, which is normally active only in meristematic
cells, affect leaf development due to its aberrant
expression in these organs [8]. Additional kn1-like
genes (also termed knox genes) have been isolated
from maize and other monocot and dicot species
Keywords
DNA-binding specificity; footprinting;
homeodomain; KNOX protein; recognition
code
Correspondence
D. H. Gonzalez, Ca
´
tedra de Biologı
´
a Celular
y Molecular, Facultad de Bioquı
´
mica y
Ciencias Biolo
´
gicas (UNL), CC 242 Paraje El
Pozo, 3000 Santa Fe, Argentina
Fax ⁄ Tel: +54 342 4575219
E-mail: dhgonza@fbcb.unl.edu.ar
(Received 13 July 2004, revised 31 August
2004, accepted 21 September 2004)
doi:10.1111/j.1432-1033.2004.04402.x
The interactionofthe homeodomain ofthesunflowerKNOX protein
HAKN1 withDNA was studied by site-directed mutagenesis, hydroxyl
radical footprintingand missing nucleoside experiments. Binding of
HAKN1 to different oligonucleotides indicated that HAKN1 prefers the
sequence TGACA (TGTCA), with changes within the GAC core more pro-
foundly affecting the interaction. Footprintingand missing nucleoside
experiments using hydroxyl radical cleavage ofDNA showed that HAKN1
interacts with a 6-bp region ofthe strand carrying the GAC core, covering
the core and nucleotides towards the 3¢ end. On the other strand, protec-
tion was observed along an 8-bp region, comprising two additional nucleo-
tides complementary to those preceding the core. Changes in the residue
present at position 50 produced proteins with different specificities. An
I50S mutant showed a preference for TGACT, while the presence of lysine
shifted the preference to TGACC, suggesting that residue 50 interacts with
nucleotide(s) 3¢ to GAC. Mutation of Lys54 fi Val produced a protein
with reduced affinity and relaxed specificity, able to recognize the sequence
TGAAA, while the conservative change of Arg55 fi Lys completely abol-
ished binding to DNA. Based on these results, we propose a model for the
interaction ofHAKN1withDNA in which helix III ofthe homeodomain
accommodates along the major groove with Arg55, Asn51, Lys54 and
Ile50, establishing specific contacts with bases ofthe GACA sequence or
their complements. This model can be extended to other KNOX proteins
given the conservation of these amino acids in all members ofthe family.
Abbreviations
TALE, three-amino-acid loop extension.
190 FEBS Journal 272 (2005) 190–202 ª 2004 FEBS
(reviewed in [9]), indicating that this class of genes
constitutes a family present throughout the plant king-
dom. Theknox family of genes can be subdivided into
two classes, I and II, by sequence relatedness and
expression patterns [10]. Based on the expression pat-
terns [11–13], analysisof mutants [14–17] and over-
expression studies [18–21] it was proposed that class
I knox genes are involved in the maintenance of meris-
tematic cells in an undifferentiated state. Indeed, over-
expression of some class I genes in Arabidopsis and
tobacco produces the proliferation of meristems on the
surface of leaves.
The proteins encoded by knox genes belong to the
three-amino-acid loop extension (TALE) superclass.
Members of this superclass contain three extra amino
acids within the loop connecting helices I and II [22]
and are present in several eukaryotic kingdoms, sug-
gesting that they represent an early evolutionary acqui-
sition.
Concerning their interactionwith DNA, studies with
proteins from barley [23], tobacco [24], rice [25] and
maize [26] indicate that they bind related sequences
containing a TGAC core (GTCA in the complement-
ary strand), considerably different from the sequence
TAAT recognized by most homeodomains [27]. Eluci-
dating the structural basis for this difference would
help to understand at the molecular level how KNOX
transcription factors recognize their DNA target site.
In this study, we analysed theinteractionof the
homeodomain of HAKN1, a sunflower class I KNOX
protein [28], with DNA. Based on studies of wild-type
and mutant forms ofthe homeodomain, we propose a
model for the complex between HAKN1and its target
site. This model must be applicable to all KNOX
homeodomains, as important amino acids are con-
served within this family.
Results
Expression andDNA binding analysisof the
HAKN1 homeodomain
The homeodomain oftheKNOX transcription factor
HAKN1 was expressed in Escherichia coli as a fusion
with the maltose binding protein using vector
pMALc2. The fusion protein was purified by affinity
chromatography in amylose resin and used for DNA–
protein interaction studies. A 24-bp oligonucleotide
(HAKN1 binding site; BS1) containing the sequence
TGT(G ⁄ C)ACA was used as DNA target. This seq-
uence was designed against a compilation of sequences
bound by KNOX transcription factors from different
species, and contains the TGAC (GTCA) core that is
present in all of them.
Figure 1A shows an electrophoretic mobility shift
assay performed withHAKN1and oligonucleotide
BS1 or variants containing changes at single positions
(sequences shown in the right panel). We have arbi-
trarily numbered from 1 to 7 those positions present in
the strand that contains the central G. Two shifted
B
A
C
Fig. 1. Binding ofHAKN1 to different oligo-
nucleotides. (A) Electrophoretic mobility
shift assay performed with 30 ng of HAKN1
and oligonucleotides containing different
variants ofthe sequence TGT(G ⁄ C)ACA
(numbers indicated above each lane). (B)
Competition assay ofHAKN1 binding to BS1
using a 15-fold molar excess of different
oligonucleotides (numbers indicated above
each lane) as competitors. The sequence of
the 7-bp core present in each oligonucleo-
tide is shown in (C) for reference. Modifica-
tions with respect to BS1 are shown within
black boxes.
M. F. Tioni et al. KNOX homeodomain–DNA interactions
FEBS Journal 272 (2005) 190–202 ª 2004 FEBS 191
bands of similar intensity were observed in this experi-
ment. The relative intensity ofthe low mobility com-
plex varied when different protein preparations were
used. We speculate that this behavior may arise from
aggregation ofthe protein. Nevertheless, different pro-
tein preparations showed the same specificity and affin-
ity when considering the amount of bound protein as
the sum of both shifted bands. These bands displayed
similar footprinting patterns (see below), suggesting
that a single HAKN1 homeodomain is bound to DNA
in both complexes. This is strengthened by the fact
that only monomer–DNA complexes were observed in
crosslinking experiments (data not shown).
Analysis oftheinteractionofHAKN1with different
oligonucleotides indicates that modifications in the
outermost positions (1 and ⁄ or 7) do not significantly
affect binding (Fig. 1A, lanes BS1, 1,7, 7T, 1 and 7C),
while certain inner nucleotides, notably those located
at positions 4–6, are critical for binding (Fig. 1A, lanes
4, 6A and 5). Regarding position 7, the change of A
for T does not seem to affect binding, while the intro-
duction of C partially reduces the amount of complex
formed. Mutations at positions 2 (not shown) and 3
(lane 3) have only a moderate effect. Similar obser-
vations could be made in experiments in which the
binding to oligonucleotide BS1 was competed with a
15-fold molar excess of different oligonucleotides
(Fig. 1B). These results indicate that HAKN1 mainly
recognizes the GAC (GTC) trinucleotide and displays
lower specificity at outer positions. The GAC triplet is
contained within the TGAC sequence, found to be
part ofthe binding sites ofthe barley KNOX protein
Hooded [23] andof maize Knotted1 [26]. This element
is also present in the sequence GTNAC, postulated to
be important for the binding ofthe tobacco protein
NTH15 to DNA [24], provided that N is G or C.
Analysis ofDNA binding by hydroxyl radical
footprinting and interference assays
A more detailed picture ofthe binding ofHAKN1 to
its target site was obtained by theanalysisof footprint-
ing patterns after cleavage of free and protein-bound
DNA with hydroxyl radicals generated by Fe–EDTA
complexes. For this purpose, a dimer ofthe corres-
ponding oligonucleotide ligated through its EcoRI
cohesive site was cloned into the BamHI site of pBlue-
script SK
–
. Cleavage with HindIII and XbaI produces
a 94-bp fragment that contains two HAKN1 binding
sites in opposite orientations. After HAKN1 binding
to the 94-bp oligonucleotide, labeled specifically at one
of its 3¢ ends by filling-in the HindIII site, the complex
was subjected to hydroxyl radical attack, and free and
bound DNA were separated, recovered from the gel
and analysed by denaturing polyacrylamide gel electro-
phoresis (Fig. 2A). Because the oligonucleotide con-
tains two sites in opposite orientation, both strands of
the binding site can be observed in a single footprint-
ing assay. Analysisofthe cleavage patterns indicates
that HAKN1 protects six nucleotides from the strand
carrying the sequence TGTGACA (hereafter named
the top strand). The protected area includes GACA
and two adjacent nucleotides (GA) towards the 3¢ end
(Fig. 2A). On the bottom strand, the protected region
covers two additional nucleotides, AC complementary
to GT in TGTGACA (Fig. 2A). For both strands, the
highest protection is observed within the GAC core,
suggesting that theprotein makes closer contacts in
this region. This agrees withthe important role of
these nucleotide positions in determining the binding
strength ofHAKN1 to DNA shown by electrophoretic
mobility shift assays. When the oligonucleotide labeled
at its XbaI site (at the opposite 3¢ end) was used, foot-
printing patterns were identical to those described
above, indicating that HAKN1 makes equivalent
contacts with both binding sites present in the 94-bp
fragment.
Footprinting analysis was also performed with a
similar oligonucleotide containing two mutated sites
[BS(mut1,7);
AGTGACT instead of TGTGACA,
mutations underlined). The results obtained were
essentially the same (not shown), suggesting that
HAKN1 contacts the nucleotide adjacent to the GAC
core and its complement on the other strand whether
they are A or T.
Information about the nucleotide positions that
influence binding ofHAKN1 to DNA was obtained
from missing nucleoside (interference) experiments.
Here, DNA is treated with hydroxyl radical-generating
agents before protein binding, thus producing a popu-
lation of molecules with single cleavages along the
phosphodiester backbone. This population is incubated
with theproteinof interest and subjected to an elec-
trophoretic mobility shift assay from which the free
and bound fractions are recovered. Molecules with
cleavages at positions important for binding are then
under-represented in the bound fraction and, depend-
ing on the binding conditions, over-represented in the
free fraction. Figure 2B shows a missing nucleoside
experiment using HAKN1andthe 94-bp DNA frag-
ment containing two binding sites previously labeled in
one of its 3¢ ends (HindIII or XbaI sites) and treated
with Fe–EDTA. It is noteworthy that there is a good
correlation between the region protected by HAKN1
and the nucleotide positions important for binding.
This means that all nucleotides in the protected area
KNOX homeodomain–DNA interactions M. F. Tioni et al.
192 FEBS Journal 272 (2005) 190–202 ª 2004 FEBS
establish contacts that contribute to binding efficiency.
Again, the GAC core seems to be particularly import-
ant, but outside positions are also required (Fig. 2B).
Within the core, modifications to G and A or their
complements influence binding more markedly. These
results agree withthe fact that mutations of these two
nucleotides abolish binding ofHAKN1 to DNA. On
the other hand, because nucleotides at outside posi-
tions can be mutated without significant loss in bind-
ing efficiency, it can be assumed that they mainly
participate in nonspecific contacts, such as those estab-
lished withthe sugar–phosphate backbone.
The results offootprintingand missing nucleoside
experiments also indicate that HAKN1 does not make
AB
Fig. 2. Hydroxyl radical footprintingand interference analysisofHAKN1 binding to DNA. An oligonucleotide containing two HAKN1 binding
sites (BS1) in opposite orientations was labeled in the 3¢ end of either strand (HindIII or XbaI sites) and subjected to hydroxyl radical attack
either after (A) or before (B) HAKN1 binding. Free (F) and bound (B) DNA were separated and analysed. A portion ofthe same fragment
digested with defined restriction enzymes was used as a standard (S) to calculate the position ofthe footprint. Letters to the right of each
panel indicate theDNA sequence (5¢ end in the upper part) ofthe corresponding strand in this region. In the lower part, the sequence of the
binding site is shown andthe protected positions are indicated in bold and underlined. The GAC (GTC) core that shows the highest protec-
tion is shaded.
M. F. Tioni et al. KNOX homeodomain–DNA interactions
FEBS Journal 272 (2005) 190–202 ª 2004 FEBS 193
symmetrical contacts with its target site. The protein
establishes contacts with both strands at the right side
of the GAC core, while only one strand seems to be
contacted at the left side. This lack of symmetry and
the extension ofthe contacts most probably indicate
that only one molecule ofHAKN1 is bound at each
target site.
Binding ofHAKN1 single-site mutants to DNA
The picture that emerges from our results is that
HAKN1 binds an 8-bp region ofDNAwith a tGACa
(tGTCa) specificity core. An interesting question is
how theHAKN1 homeodomain interacts with this
sequence and which amino acids are involved in
sequence-specific contacts. To answer this, we have
analysed the effect of single-site mutations on HAKN1
binding to TGACA and variants of this sequence. It is
logical to assume that changes in amino acids involved
in theinteraction must influence binding efficiency. In
addition, some substitutions may alter binding specific-
ity, indicating the existence of contacts between a given
residue and defined positions within the DNA.
Residue 50 (53 in TALE homeodomains) is usually
involved in determining the different specificities
among related homeodomains [27,29–31]. In homeo-
domains that bind the canonical TAAT sequence,
residue 50 interacts with nucleotides located 3¢ to this
site [27,31]. We reasoned, then, that changing Ile50,
present in HAKN1and all KNOX proteins, may influ-
ence sequence preferences at external positions of the
core. As a first approach, we mutated Ile50 to Ser, pre-
sent in the yeast TALE protein MATa2 [32]. The ana-
lysis of binding of I50S–HAKN1 to variants of the
HAKN1 binding site indicates a preference for an
oligonucleotide containing the sequence TGACT, while
the wild-type HAKN1 homeodomain binds TGACA
and TGACT with similar efficiency (Fig. 3A). This
suggests that residue 50 interacts withthe 3¢ region of
the top strand (and ⁄ or the 5¢ region ofthe bottom
strand), outside the GAC core. This is also evident in
competition experiments (Fig. 3B), where oligonucleo-
tides BS(mut1,7) and BS(mut7T) compete more effi-
ciently than variants with A [BS1 and BS(mut1)] or C
[BS(mut7C)] at this position. Changes at other posi-
tions within the target DNA sequence produced sim-
ilar effects on binding than withthe wild-type protein
(Fig. 3).
To further explore the hypothesis that residue 50
is oriented towards the 3¢ end ofthe top strand, we
also mutated Ile50 to Lys, present in Drosophila bicoid
[33]. I50K–HAKN1 shows a net preference for an
oligonucleotide containing the sequence TGACCC
[BS(mut7C)] over the original TGACAG, present in
AB
CD
Fig. 3. DNA binding preferences of HAKN1
mutants at position 50. (A) Electrophoretic
mobility shift assay of I50S–HAKN1 (30 ng)
binding to BS1 and BS(mut1,7). (B) Binding
of I50S–HAKN1 to BS(mut1,7) was com-
peted with a 100-fold molar excess of oligo-
nucleotides with different sequences
(depicted in Fig. 1). (C) Binding of I50K–
HAKN1 (30 ng) to different oligonucleotides
was analysed by an electrophoretic mobility
shift assay. In (D), the binding of different
amounts (50, 100 and 250 ng) of either
HAKN1 or I50K-HAKN1 to oligonucleotides
BS1 and BS(mut7C) is shown. Oligonucleo-
tide sequences are shown in Fig. 1.
KNOX homeodomain–DNA interactions M. F. Tioni et al.
194 FEBS Journal 272 (2005) 190–202 ª 2004 FEBS
BS1 and BS(mut1) (Fig. 3C). This result confirms that
residue 50 interacts with nucleotides adjacent to the
TGAC core. Binding analysiswith different oligonuc-
leotides indicated that I50K–HAKN1 is also able to
interact with oligonucleotide BS(mut6G), that contains
a TGAG core (Fig. 3C). In fact, when higher protein
concentrations were used in the assays, binding to
TGAGAG was considerably better than to TGACAG
(not shown), suggesting that Lys50 may also be able to
contact the fourth position ofthe core, thus changing
the preference for G. The inclusion of Lys at position
50, in addition to promoting a change in specificity,
resulted in a proteinwith increased affinity towards its
preferred binding site (Fig. 3D). An additional, fast-
migrating band observed in this experiment is present
in free DNAand may represent noncovalent oligo-
nucleotide dimers interacting through their cohesive
ends. We have observed that the presence of this spe-
cies does not affect the intensity ofthe shifted band.
The increased affinity dispalyed by I50K–HAKN1
may arise from the fact that lysine is able to establish
hydrogen bonds with DNA, which are more stable
than the van der Waals contacts established by Ile.
The interactionof mutants at position 50 with their
preferred binding sites was also analysed by footprint-
ing experiments. I50S–HAKN1 protects a region cov-
ering five nucleotides ofthe top strand and six
nucleotides ofthe bottom strand (Fig. 4A). This region
is coincident withthe one more strongly protected by
wild-type HAKN1, but is shorter towards the 3¢ end
of the top strand. This result further suggests that
Ile50 contacts the nucleotides located 3¢ to the TGAC
core, as its replacement by a smaller residue such as
Ser allows better access of this region to the modifying
agent. Conversely, I50K–HAKN1 shows an extended
footprinting pattern towards the 3¢ end ofthe top
strand andthe 5¢ end ofthe bottom strand (Fig. 4B).
This agrees withthe presence of a larger residue that
makes stable contacts with this region of DNA.
The interactionof mutants at position 50, and par-
ticularly of I50K–HAKN1 with DNA, provides a
framework to build a model of HAKN1–DNA inter-
actions, taking into account experiments performed with
other homeodomains. Theprotein bicoid, for example,
is able to bind the sequence TAATCC that contains
the canonical TAAT box [31]. Lys50 of bicoid puta-
tively interacts withthe CC dinucleotide, as its muta-
tion to Gln changes its preference to TAATTG [29]. A
reciprocal change, Gln50 to Lys, in engrailed or fushi
tarazu shifts sequence preferences from TAATTA or
TAATTG to TAATCC [30,34]. We postulate, then,
that positioning oftheHAKN1 homeodomain along
the TGAC core in DNA must be equivalent to that
adopted by other homeodomains along the TAAT
sequence. The third position of both sequences con-
tains an adenine, known to interact with Asn51,
AB
Fig. 4. Hydroxyl radical footprintingof I50S–HAKN1 (A) and I50K–HAKN1 (B) bound to their preferred binding sites. After binding and hydro-
xyl radical attack, free (F) and bound (B) DNA were separated and analysed. The left and right panels in (A) and (B) represent the top and
bottom strands ofthe binding site, respectively. A portion ofthe same fragment digested with defined restriction enzymes was used as a
standard to calculate the position ofthe footprint. Letters to the right of each panel indicate theDNA sequence (5¢ end in the upper part) of
the corresponding strand in this region. Below the footprints, the sequence ofthe corresponding binding site is shown andthe protected
positions are indicated in bold and underlined.
M. F. Tioni et al. KNOX homeodomain–DNA interactions
FEBS Journal 272 (2005) 190–202 ª 2004 FEBS 195
universally conserved among homeodomains [2,31].
The importance of this interaction is reflected by the
fact that this nucleotide cannot be mutated without a
complete loss ofHAKN1 binding. The fourth base in
TAAT is usually recognized by a nonpolar amino acid
(mostly Ile or Val) present at position 47 [2,31].
HAKN1 contains Asn at this position, which may be
too small to establish specific contacts with bases.
Asn47 does not make specific contacts in the homeo-
domain–DNA complexes of MATa2 and extradenticle
[5,35]. Here, we favour the hypothesis that the fourth
position ofthe core is contacted by Lys54, because the
nucleotide next to that contacted by Asn51 is recog-
nized by residue 54 in other homeodomains (see
below). In support of a prominent role of Lys54, its
mutation to Val produces a significant decrease in
DNA binding (not shown). In addition, K54V–
HAKN1 binds with similar efficiency to sequence vari-
ants containing either A [BS(mut6A)] or C (BS1) at
the fourth position of TGAC, suggesting that it has a
decreased discrimination capacity with respect to wild-
type HAKN1 (Fig. 5). An oligonucleotide containing
TGAG [BS(mut6G)], however, is bound with reduced
efficiency, suggesting that the mutant homeodomain
retains partial specificity. Discrimination at other posi-
tions ofthe bound region is similar to those displayed
by the wild-type protein. Although the results obtained
do not necessarily indicate a direct role of Lys54 in
establishing contacts with DNA, a plausible explan-
ation is that this residue interacts with at least one of
the members ofthe CÆG pair at the fourth position of
TGAC in the HAKN1–DNA complex.
The two leftmost positions ofthe core interact
through the minor groove withthe N-terminal arm in
most homeodomains [2,31]. In yeast MATa2, for
example, the N-terminal arm makes base-specific con-
tacts withthe first two nucleotides of a TTAC core [5].
Hence, we replaced the N-terminal arm (residues 1–9)
of HAKN1withthe same portion of MATa2, to
determine if a change in specificity was observed. The
resulting protein, Na–HAKN1, showed an overall
reduced affinity but the same sequence preferences as
HAKN1 (Fig. 6). It is noteworthy that it did not bind
oligonucleotide BS(mut4), which contains a TTAC
core on the complementary strand. This indicates that
the N-terminal arm of MATa2 is not able to interact
with DNA within the context oftheHAKN1 homeo-
domain as it does within MATa2. Poor binding may
arise from incorrect folding ofthe chimeric protein or
from the fact that important contacts withDNA are
lost upon replacement oftheHAKN1 N-terminal arm.
In addition to a role ofthe N-terminal arm in contact-
ing the first two amino acids ofthe core, examination
Fig. 6. Effect of changes within the N-terminal arm and position 55
on the binding ofHAKN1 to oligonucleotides BS1 and BS(mut4).
Binding to oligonucleotides containing the sequences TGACA (BS1)
or TTACA [BS(mut4)] was analysed using 30 ng of proteins HAKN1,
R55K–HAKN1, Na–HAKN1 (a protein containing the N-terminal arm
of MATa2) or R55K–Na–HAKN1 (a proteinwith both modifications).
A
B
Fig. 5. K54V–HAKN1 shows relaxed specificity. Binding of K54V–
HAKN1 (150 ng) to different oligonucleotides was analysed in an
electrophoretic mobility shift assay (A). (B) Competition of K54V–
HAKN1 binding to BS1 with a 25-fold molar excess of different
oligonucleotides (depicted in Fig. 1).
KNOX homeodomain–DNA interactions M. F. Tioni et al.
196 FEBS Journal 272 (2005) 190–202 ª 2004 FEBS
of other homeodomain–DNA complexes suggests the
possibility that Arg55 recognizes the second position
of TGAC. Arg55 participates in binding to G residues
in other homeodomains, such as yeast MATa1
(GATG; [36]) or Drosophila extradenticle (TGAT;
[35]). Consistent with a role in DNA binding, an
Arg55 to Ala mutation completely disrupts the inter-
action ofHAKN1withDNA (not shown). To further
analyse its involvement in base-specific contacts, we
reasoned that a conservative substitution for Lys
would not affect nonspecific interactions (i.e. electro-
static interactions withthe phosphate backbone), but
would preclude the establishment of hydrogen bonds
with the guanine base of G. The results shown in
Fig. 6 indicate that R55K–HAKN1 is unable to bind
DNA, supporting the hypothesis that Arg55 is
involved in base-specific contacts, which are disrupted
upon mutation to Lys. Another explanation would be
that this change disturbs the overall folding of the
homeodomain, but this seems unlikely because several
homeodomains, notably MATa2, contain Lys at posi-
tion 55. Assuming that the N-terminal arm of MATa2
and Arg55 may be incompatible as both portions may
interact withthe same positions ofthe target site, we
also constructed a mutant in which the N-terminal
arm of MATa2 was inserted into the R55K mutant of
HAKN1. This protein was also ineffective in binding
to theHAKN1 target site or its variants (Fig. 6).
A model for the HAKN1–DNA interaction
Based on theanalysisofthe binding of wild-type and
mutant HAKN1 homeodomains to different DNA tar-
get sites, we propose a model for theinteraction of
HAKN1 with DNA. A set of four amino acids,
located within helix III ofthe homeodomain, would
make base-specific contacts with defined nucleotides
within the tGACAg sequence. Arg55 would establish a
pair of hydrogen bonds with positions O6 and N7 of
guanine in GACA. As mentioned above, similar inter-
actions have been observed in complexes of other
homeodomains withDNA [35,36]. Asn51 would inter-
act withthe first adenine in GACA, also establishing
a pair of hydrogen bonds, as in most homeodomain–
DNA complexes. The next position (C, or G in the
opposite strand) would be contacted by Lys54.
Although there is no evidence in the literature about a
specific contact made by a lysine at this position, resi-
due 54 interacts withthe nucleotide adjacent to that
bound by Asn51 in several homeodomains, for exam-
ple MATa2 (Arg54, TTAC; [5]), TTF1 (Tyr54, CAAG;
[37]), bicoid (Arg54, TAAT or TAAG; [38]) and Hahr1
(Thr54, TAAA, in this case in combination with
Phe47; [39]). Additionally, lysine determines a prefer-
ence for C at an adjacent position when present at
position 50 in bicoid and other mutant homeodomains
(including HAKN1, see above), presumably by inter-
acting with guanine bases through hydrogen bonds as
observed in the Lys50–engrailed crystal structure [40].
Finally, our results also indicate that Ile50 is
involved in establishing a preference for A or T at the
3¢ side ofthe core. Mutations of this residue to Ser or
Lys were able to confer a new binding specificity to
HAKN1, changing to a net preference for T or C,
respectively. Ile50 is present in MATa1, where it inter-
acts with a TA dinucleotide adjacent to the position
contacted by Met54 [36]. Accordingly, Ile50 may also
be involved in contacts with an adjacent position,
which is protected by HAKN1 in footprinting experi-
ments and interferes with binding when modified by
hydroxyl radical attack.
To examine the consistence ofthe interactions des-
cribed above, we have constructed a theoretical model
of the HAKN1–DNA complex using the program
swiss-model [41] available in the ExPASy web server.
Different models for wild-type and mutant HAKN1
were obtained using the homeodomain–DNA com-
plexes of extradenticle [35], MATa1 [36] and MATa2
[5] as templates. Figure 7 shows the alignment of helix
III oftheHAKN1 homeodomain along the major
groove ofDNA (the MATa1 binding site in this case).
Amino acids in red are those present in wild-type
HAKN1 that putatively contact the GAC core. Note
that Arg55 and Asn51 establish hydrogen bonds with
adjacent G and A, respectively. Interestingly, Lys54
also appears making a hydrogen bond withthe N7 of
an adjacent purine (adenine in this case) present in the
complementary strand. A similar contact could be
made with a guanine complementary to C in GAC,
further suggesting that Lys54 is likely to contact this
position. The position of Lys55 in the corresponding
mutant is shown in yellow. Clearly, the specific con-
tacts made by Arg55 are lost and are replaced by an
interaction withthe phosphate backbone. Val54 is
shown in pink. The shorter side chain andthe loss of
a hydrogen bond explain the decrease in affinity and
relaxed specificity. Finally, the variants at position 50
(Ile in orange, Ser in green and Lys in blue) are also
represented. All these residues are oriented towards the
3¢ end ofthe core and probably establish contacts with
the complementary strand. It should be emphasized
that themutagenesis experiments described here do
not prove that certain amino acids make base-specific
contacts, especially when a new specificity at a defined
position was not achieved. The combination of these
experiments withthefootprinting results and previous
M. F. Tioni et al. KNOX homeodomain–DNA interactions
FEBS Journal 272 (2005) 190–202 ª 2004 FEBS 197
knowledge, however, are highly indicative that this is
the case. Determination ofthe three-dimensional struc-
ture ofthe complex will be required to evaluate the
accuracy ofthe DNA–protein contacts proposed by
this model.
Discussion
In this study, we investigated theinteractionof the
homeodomain oftheKNOXproteinHAKN1 with
DNA. As no structural studies on theinteraction of
any KNOXproteinwithDNA have been reported,
ours constitutes a first approach to understand these
interactions at the molecular level. Electrophoretic
mobility shift assays, footprinting analyses and missing
nucleoside experiments using different binding sites
and mutated proteins allowed us to establish a model
for HAKN1–DNA interaction. This model postulates
that HAKN1 binds to a TGACNN core primarily
through interactions of certain helix III amino acids
(Ile50, Asn51, Lys54 and Arg55) with DNA. This
particular combination of amino acids is present only
in KNOX proteins, indicating that they may have been
selected through evolution to generate a defined specif-
icity. Among them, the incorporation of Ile50 and
Arg55 must have been particularly important. Other
homeodomains that contain Ile50 and Arg55 are those
of the TGIF, Meis and Bell families [22] and yeast
MATa1 [36]. TGIF and Meis proteins bind the
sequence TGTCA (TGACA on the complementary
strand [42,43]), which is identical to that recognized by
HAKN1. They possess Arg at position 54, suggesting
that Lys54 in HAKN1 may not be the only means of
recognizing the TGAC core. Accordingly, the Bell
protein ATH1, which contains Val54, also selects
the sequence TGACA from a random population
(I. Viola, unpublished results). The presence of Val54
produces, however, a relaxed specificity at the fourth
position and reduced affinity within the context of
both theHAKN1 (this study) andthe ATH1 homeo-
domain (I. Viola, unpublished results). MATa1, in
turn, binds a completely different sequence (GATGT ⁄
ACATC [44]), indicating that other factors apart from
these residues also influence specificity. The GA dinu-
cleotide in GATGT is recognized by Arg55 and Asn51
of MATa1, as proposed here for the GA dinucleotide
in TGAC. The GT dinucleotide is, in turn, contacted
by Met54 and Ile50 through interactions withthe com-
plementary strand [36]. This means that, in MATa1,
positions contacted by residues 55 ⁄ 51 and 54 ⁄ 50 are
separated by one additional base pair. This may be
originated by the presence of Val47, which binds the
nucleotide adjacent to the A recognized by Asn51 in
many homeodomains. The above-mentioned TGAC
binding proteins (including KNOX proteins) contain
Asn47, which may not establish specific contacts with
DNA. These differences may also originate changes in
the relative orientation ofDNA contacting amino
acids.
The model presented here can also be compared
with the structures determined for the TALE proteins
extradenticle and PBX1 bound to DNA [35,45]. These
proteins bind the sequence TGAT, with Arg55 and
Asn51 establishing hydrogen bonds withthe GA dinu-
cleotide, as proposed here for HAKN1. The initial T is
also contacted by Arg55 through van der Waals inter-
actions in the PBX1 complex [45]. The second T makes
van der Waals contacts with Asn47. As PBX1 and
extradenticle contain Ile54, this situation may resemble
the binding behaviour of K54V–HAKN1, which shows
relaxed specificity at this position.
Our results withHAKN1 clearly support the idea
that there is a general recognition code for homeo-
domains. Accordingly, recognition at the left side of
A
B
Fig. 7. A model for theinteractionofHAKN1with DNA. (A) Dia-
gram oftheHAKN1DNA binding site withthe residues putatively
involved in binding each position. (B) Spatial model ofthe interac-
tion of helix III of wild-type and mutant HAKN1 homeodomains
with DNA. The model was constructed withthe program
SWISS-
MODEL [41] using the structures oftheDNA complexes of MATa1
(1YRN), extradenticle (1B8I) and MATa2 (1APL) as templates.
Amino acids in red are those present in wild-type HAKN1 that puta-
tively contact the GAC core. Residues at position 50 are: Ile in
orange, Ser in green and Lys in blue. Val54 and Lys55, present in
the mutants, are shown in pink and yellow, respectively.
KNOX homeodomain–DNA interactions M. F. Tioni et al.
198 FEBS Journal 272 (2005) 190–202 ª 2004 FEBS
the conserved A that is contacted by the universally
present Asn51 is determined by the N-terminal arm
and ⁄ or Arg55. The presence of Arg55 determines a G
5¢ to the conserved A, while the N-terminal arm seems
to determine a preference for A ⁄ T base pairs. The set
of residues present at positions 47, 50 and 54 influence
binding preferences at the right side.
The putative DNA-contacting amino acids of
HAKN1 are also present in all described KNOX pro-
teins, indicating that they may all recognize identical
or similar sequences. This raises the question of how
the specificity ofinteraction is achieved in vivo, because
different KNOX proteins have different functions.
A similar paradox has been noted for animal homeo-
domains, for which current evidence suggests that spe-
cificity arises from theinteractionof homeodomain
proteins with other factors that somehow influence
their DNA binding properties [46]. Plant KNOX pro-
teins interact with proteins from the Bell family, which
also belong to the TALE superclass [47,48] and bind
similar sequences [49]. Chen et al. [49] have shown that
potato KNOXand Bell proteins bind two tandem cop-
ies of a TGAC motif separated by one additional nuc-
leotide. As both types of homedomains seem to
establish similar contacts with their target sites
(I. Viola, unpublished results), this indicates that the
respective recognition helices must lie in an antiparallel
orientation within the major groove at opposite sides
of the DNA. According to thefootprinting data pre-
sented here, the central nucleotide pair andthe first
two pairs ofthe second TGAC would be contacted by
both proteins. This may indicate that some rearrange-
ments may occur upon complex formation by KNOX
and Bell proteins, either before or after binding to
DNA. In the complexes formed by PBX1 and extra-
denticle, the presence of Gly50, which does not contact
DNA, may allow the binding of an additional homeo-
domain in tandem immediately following TGAT
[35,43,45]. The presence of Ile50, that interacts with
nucleotides located at the 3¢ side ofthe core, may
explain the requirement of a larger distance between
both binding sites in the complexes formed by KNOX
and Bell proteins.
Sequences outside the homeodomain may also influ-
ence the binding properties ofthe protein. Indeed, a
stretch of 16 amino acids located immediately C-ter-
minal to the homeodomain forms an a-helix that has
been shown to influence theDNA binding affinity of
the PBX1 homeodomain [50,51]. As theprotein used
in our assays includes a C-terminal portion, we have
analysed the structure ofthe region immediately fol-
lowing theHAKN1 homeodomain using several secon-
dary structure prediction programs. We have only
observed a short region (five to eight amino acids
depending on the program) that has a propensity to
form an a-helix. Therefore, we consider it unlikely that
an effect ofthe C-terminal tail, similar to that
observed with PBX1, occurs in HAKN1 or other
KNOX proteins.
In summary, the results presented here constitute a
framework to understand at the molecular level how
KNOX proteins interact withDNAand how these
interactions contribute to the establishment of active
transcription complexes that influence defined develop-
mental pathways within plant cells.
Experimental procedures
Cloning, expression and purification of
recombinant proteins
HAKN1 homeodomain and C-terminal sequences were
amplified and cloned in-frame into the EcoRI and PstI sites
of the expression vector pMAL-c2 (New England Biolabs,
Beverly, MA, USA). Amplifications were performed using
Pfu DNA polymerase and oligonucleotides MALN1: 5¢-GC
GGAATTCAAAAAGAGAAAGAAAGGG-3¢ and MALC:
5¢-GGCCTGCAGCTAGAGAAGTGAAACATC-3¢ with
HAKN1 cDNA [28] as the template.
An I50S mutant was constructed using complementary
oligonucleotides I50SF and I50SR (5¢-CAACTGGTTC
A
GCAACCAAAGGAA-3¢ and 5¢-TTCCTTTGGTTGC
TGAACCAGTTG-3¢; introduced mutations underlined)
together with primers MALC and MALN1, respectively, to
amplify partially overlapping N-terminal and C-terminal
HAKN1 fragments. The resulting products were mixed in
buffer containing 50 mm Tris ⁄ HCl (pH 7.2), 10 mm
MgSO
4
, and 0.1 mm dithiothreitol, incubated at 95 °C dur-
ing 5 min, and annealed by allowing the solution to cool to
24 °C in approximately 1 h. After this, 0.5 mm of each
dNTP and 5 units ofthe Klenow fragment of E. coli DNA
polymerase I were added, and incubation was followed for
1 h at 37 ° C. An aliquot of this reaction was used directly
to amplify the annealed fragments using primers MALN1
and MALC. Mutants I50K, K54V, R55A and R55K were
constructed in a similar way, using oligonucleotides I50KF
(5¢-CAACTGGTTCA
AAAACCAAAGGAA-3¢), I50KR
(5¢-TTCCTTTGGTTT
TTGAACCAGTTG-3¢), K54VF (5¢-
TAAACCARAGG
GTGCGGCAYTGGA-3¢), K54VR (5¢-
TCCARTGCCGC
ACCCTYTGGTTTA-3¢), R55AF (5¢-CA
AAGGAAG
GCGCACTGGAA-3¢), R55AR (5¢-TTCCAG
TGC
GCCTTCCTTTG-3¢), R55KF (5¢-CAAAGGAAGAA
GCACTGGAA-3¢) and R55KR (5¢-TTCCAGTGC
TTCT
TCCTTTG-3¢) to introduce the mutations. The N-terminal
arm ofthe MATa2 homeodomain (amino acids 1–9) was
introduced into theHAKN1 homeodomain using two
successive rounds of amplification with oligonucleotides
M. F. Tioni et al. KNOX homeodomain–DNA interactions
FEBS Journal 272 (2005) 190–202 ª 2004 FEBS 199
[...]... [53] The corresponding bands were excised from the gel, eluted and analysed on denaturing polyacrylamide gels To determine the position ofthe footprint within the sequence, a portion ofthe same fragment was digested with defined restriction enzymes and loaded in the same gel A DNA fragment containing two copies of oligonucleotide BS(mut1,7) was obtained and analysed in a similar way A single copy of. .. R.L.C and D.H.G are members of CONICET; M.F.T and I.L.V are fellows of CONICET Footprintinganalysis For theanalysisof hydroxyl radical footprinting patterns, a double-stranded oligonucleotide containing theHAKN1 binding site (BS1) with BamHI and EcoRI compatible cohesive ends was self-ligated through its EcoRI site to obtain a dimer and then cloned into the BamHI site of pBluescript SK– From this... manufacturers ofthe pMAL-c2 system The amount of purified protein (> 95% as judged by SDS ⁄ PAGE) was determined as described by Sedmak and Grossberg [52] DNA binding assays For electrophoretic mobility shift assays, aliquots of purified proteins were incubated with double stranded DNA (0.3–0.6 ng, 10 000 c.p.m., labeled with [32P]dATP[aP] by filling-in the 3¢-ends using the Klenow fragment ofDNA polymerase)... 94-bp fragment containing two HAKN1 binding sites in opposite orientations was obtained and labeled at one of its 3¢ ends This was accomplished by PCR using reverse and universal primers, followed by cleavage with either HindIII or XbaI (from the pBluescript polylinker), incubation withthe Klenow fragment ofDNA polymerase and [32P]dATP[aP], cleavage withthe other enzyme and purification by nondenaturing... (2001) In vitro interactions between barley TALE homeodomain proteins suggest a role for proteinprotein associations in the regulation ofKnox gene function Plant J 27, 13–23 49 Chen H, Banerjee AK & Hannapel DJ (2004) The tandem complex of BEL andKNOX partners is required for transcriptional repression of Ga20ox1 Plant J 38, 276–284 50 Lu Q & Kamps MP (1996) Structural determinants within Pbx1 that... bicoid RNA in organizing the anterior pattern ofthe Drosophila embryo EMBO J 7, 1749–1756 34 Percival-Smith A, Muller M, Affolter M & Gehring WJ ¨ (1990) TheinteractionwithDNAof wild-type and mutant fushi tarazu homeodomains EMBO J 9, 3967–3974 201 KNOX homeodomain DNA interactions 35 Passner JM, Ryoo HD, Shen L, Mann RS & Aggarwal AK (1999) Structure of a DNA- bound UltrabithoraxExtradenticle homeodomain.. .KNOX homeodomain DNA interactions MAT1 (5¢-AGGGGACATAGATTTACAAAAGAAGCTC GTCAACAA-3¢; first round, MATa2 sequences underlined) or MAT2 (5¢-CGCGAATTCAAGCCGTACAGGGGAC ATAGATTTACA-3¢; second round) together with oligonucleotide MALC (both rounds) All constructions were checked by DNA sequence analysis Expression and purification ofthe recombinant proteins were carried out as indicated by the manufacturers... within Pbx1 that mediate cooperative DNA binding with pentapeptide-containing Hox proteins: proposal for a model of a Pbx1-Hox -DNA complex Mol Cell Biol 16, 1632–1640 51 Green NC, Rambaldi I, Teakles J & Featherstone MS (1998) A conserved C-terminal domain in PBX increases DNA binding by the PBX homeodomain and is not a primary site of contact for the YPWM motif of HOXA1 J Biol Chem 273, 13273–13279... Binding ofHAKN1 to this oligonucleotide (200 000 c.p.m.) was performed at 20 °C in 15 lL of 50 mm Tris ⁄ HCl (pH 7.5), 100 mm NaCl, 10 mm 2-mercaptoethanol, 0.1 mm EDTA, 22 ngÆlL)1 BSA, 10 ngÆlL)1 poly(dI-dC) and 800 ng HAKN1 After 30 min, the binding reaction was subjected to hydroxyl radical cleavage by the addition of 10.5 lL of 200 References 1 Gehring WJ (1987) Homeo boxes in the study of development... homeodomain protein encoded by the hooded gene, k, in barley (Hordeum vulgare) FEBS Lett 408, 25–29 24 Sakamoto T, Kamiya N, Ueguchi-Tanaka M, Iwahori S & Matsuoka M (2001) KNOX homeodomain protein directly suppresses the expression of a gibberellin biosynthetic gene in the tobacco shoot apical meristem Genes Dev 15, 581–590 25 Nagasaki H, Sakamoto T, Sato Y & Matsuoka M (2001) Functional analysisofthe conserved . investigated the interaction of the
homeodomain of the KNOX protein HAKN1 with
DNA. As no structural studies on the interaction of
any KNOX protein with DNA have. Site-directed mutagenesis and footprinting analysis
of the interaction of the sunflower KNOX protein HAKN1
with DNA
Mariana F. Tioni,