1. Trang chủ
  2. » Luận Văn - Báo Cáo

Structure determination of midkine a

90 153 0

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 90
Dung lượng 2,52 MB

Nội dung

STRUCTURE DETERMINATION OF MIDKINE-A MENG DAN NATIONAL UNIVERSITY OF SINGAPORE 2008 STRUCTURE DETERMINATION OF MIDKINE-A MENG DAN A THESIS SUBMITTED FOR THE DEGREE OF MASTER OF SCIENCE DEPARTMENT OF BIOLOGICAL SCIENCES NATIONAL UNIVERSITY OF SINGAPORE 2008 Acknowledgement I would like to express my sincere appreciation to my supervisor, Associate Professor Yang Daiwen, for his kind guidance and trust during the two years in the course of my research project. Also thank Dr Mok Yu-Keung for his advice and critical suggestions. I would like to give special thanks to Dr Lin Zhi for his helpful advices on my project, and thank Lim Jack Wee and Dr Fang Jingsong for their help in the experiments. I also want to thank A/P Christoph Winkler and his student Yao Sheng for the functional study of Mdka. I had a pleasant learning experience here thanks to the friendship of my fellow graduates, lab mates and friends at NUS. Particularly, I am thankful to Professor Wong Sek Man, Dr Zhang Jingfeng, Yang Shuai, BC Karthik, Long Dong, Yong Yee Heng, Zheng Yu, Qin Haina, Zhang Xin, and Zhang Xiaolu. I am deeply grateful to my parents in China and my boyfriend in USA for their spiritual support and love to overcome the difficult times. Finally, I am thankful to NUS for offering me the research scholarship and the valuable opportunity here for postgraduate study. I Table of Contents Table of Contents ........................................................................................................ II Abbreviations .............................................................................................................. V List of Figures........................................................................................................... VII List of Tables............................................................................................................ VIII Summary.....................................................................................................................IX Chapter 1 Introduction................................................................................................1 1.1 Studies on Mdka and Mdkb of zebrafish: heparin binding growth factors .................................................................................................................................1 1.1.1 Biological background of zebrafish embryonic and neural development....................................................................................................1 1.1.2 Characterization of Mdka and Mdkb of zebrafish............................3 1.1.3 Biological activities of Mdka and Mdkb .............................................5 1.1.4 Zebrafish PTN: another member of Mikine family in zebrafish....10 1.2 Structural and functional studies on Midkine and PTN of human ..........10 1.2.1 Introduction.........................................................................................10 1.2.2 Protein structures................................................................................ 11 1.2.3 Receptors of MK and PTN.................................................................13 1.2.4 Biological activities..............................................................................14 1.2.5 Medical significance............................................................................15 1.3 Protein structure determination by NMR ..................................................16 1.3.1 Introduction to NMR spectroscopy ...................................................16 1.3.1.1 Development of NMR ...............................................................16 1.3.1.2 Basic theories of NMR ..............................................................17 1.3.1.2.1 Chemical shift .................................................................................17 1.3.1.2.2 J coupling ........................................................................................17 1.3.1.2.3 NOE .................................................................................................18 1.3.2 General strategies of protein structure determination by NMR ....18 1.3.2.1 Sample Preparation ..................................................................18 1.3.2.2 Spectrum collection and Resonance Assignments..................20 1.3.2.3 Collection of NMR Restraints..................................................20 1.3.2.4 Structure Calculations and Refinement..................................20 1.3.2.5 Evaluation of protein NMR structure .....................................21 1.3.3 Advantages of structure study by NMR............................................22 1.4 Objectives of this project..............................................................................22 II Chapter 2 Materials and Methods............................................................................24 2.1 NMR sample preparation.............................................................................24 2.1.1 Media....................................................................................................24 2.1.2 Preparation of DNA plasmid..............................................................24 2.1.3 Transformation of E .coli competent cells ........................................25 2.1.4 Protein expression and purification ..................................................25 2.1.4.1 Expression of unlabeled Mdka.................................................25 2.1.4.2 Expression of labeled Mdka.....................................................25 2.1.4.3 Purification of Mdka.................................................................26 2.1.5 General protein assays........................................................................26 2.1.5.1 Sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) ..........................................................................................26 2.1.5.2 Protein quantitative assay ........................................................27 2.1.5.3 Protein circular dichroism (CD) ..............................................27 2.1.5.4 Protein dynamic light scattering (DLS) ..................................28 2.2 NMR experiments .........................................................................................28 2.2.1 1D NMR experiment...........................................................................28 1 15 1 13 2.2.2 2D H- N HSQC experiment.............................................................29 2.2.3 2D H- C HSQC experiment.............................................................29 2.2.4 H/D exchange experiment ..................................................................29 2.2.5 3D HNCA and 3D HNCOCA experiments .......................................29 2.2.6 3D MQ-CCH-TOCSY experiment ....................................................30 2.2.7 4D time shared NOESY experiment..................................................30 2.3 Data processing .............................................................................................30 2.4 Resonance assignment ..................................................................................31 2.4.1 Backbone assignment..........................................................................31 2.4.2 Sidechain assignment..........................................................................32 2.4.3 NOE assignment, structure calculation and refinement .................32 Chapter 3 Results and Discussion.............................................................................34 3.1 Mdka construct .............................................................................................34 3.2 Expression and purification of Mdka..........................................................34 3.2.1 Expression of GST-Mdka in E. coli ...................................................34 3.2.2 Purification of Mdka...........................................................................34 3.3 General properties of Mdka.........................................................................37 3.4 Mdka in vivo assay: zebrafish embryonic development activity ..............37 3.5 NMR assignment of Mdka ...........................................................................43 3.5.1 Backbone assignment of Mdka ..........................................................43 3.5.2 Sidechain assignment of Mdka ..........................................................43 3.5.3 Secondary structure characterization from backbone assignment 47 3.5.4 NOE assignment..................................................................................47 3.5.5 Structure calculation and refinement................................................48 3.5.6 Disulfide restraints determination.....................................................52 3.6 NMR structure of Mdka...............................................................................55 3.7 Structural comparison to other Midkine family members........................60 3.7.1 Structure comparison of Mdka and human MK .............................60 3.7.2 Structure comparison of Mdka with human PTN ...........................64 III Chapter 4 Conclusion and Future Work .................................................................69 Chapter 5 References.................................................................................................73 IV Abbreviations 1D One-dimensional 2D Two-dimensional 3D Three-dimensional 4D Four-dimensional a.a. Amino acid E. coli Escherichia coli EDTA Ethylenediamine tetraacetic acid HSQC Heteronuclear single quantum correlation spectrum IPTG Isopropyl β-D-thiogalactoside MQ Multiple-quantum NMR Nuclear magnetic resonance NOE Nuclear Overhauser effect NOESY Nuclear Overhauser enhanced spectroscopy Ppm Parts per million RMSD/rmsd Root mean square deviation SDS Sodium dodecyl sulphate Tris 2-amino-2-(hydroxymethyl-1,3-propanediol TOCSY Total Correlation Spectroscopy Ala, A Alanine V Arg, R Arginine Asn, N Asparagine Asp, D Aspartic acid Cys, C Cysteine Gly, G Glycine Glu, Q Glutamic acid Gln, E Glutamine His, H Histidine Ile, I Isoleucine Leu, L Leucine Lys, K Lysine Met, M Methionine Phe, F Phenylalanine Pro, P Proline Ser, S Serine Thr, T Threonine Trp, W Tryptophan Tyr, Y Tyrosine Val, V Valine VI List of Figures Fig1.1 Process of neurulation .................................................................................2 Fig1.2 Organization of the neural tube ...................................................................4 Fig1.3 Sequence of Mdka. ......................................................................................6 Fig1.4 Sequence of Mdkb. ......................................................................................7 Fig1.5 Sequence alignment of MK family..............................................................8 Fig1.6 Human Midkine family sequence alignment and structure. ......................12 Fig1.7 General strategy for structure determination by NMR..............................19 Fig3.1 Final construct of Mdka.............................................................................35 Fig3.2 Mdka expression and purification by GST beads......................................36 Fig3.3 Heparin affinity column purification of Mdka. .........................................38 Fig3.4 Q-TOF mass spectrum of Mdka. ...............................................................39 Fig3.5 CD spectrum of Mdka ...............................................................................40 Fig3.6 Apparent hydrodynamic MW of Mdka from DLS measurement..............41 Fig3.7 1D and 2D NMR experiments of Mdka. ...................................................42 Fig3.8 Backbone connectivity of Mdka................................................................44 Fig3.9 Backbone assignment of Mdka..................................................................45 Fig3.10 Assigned 1H-15N HSQC spectrum of Mdka backbone peaks. .................46 Fig3.11 Chemical shift index of Mdka. ................................................................49 Fig3.12 NOE assignment. .....................................................................................50 Fig3.13 Sequential- and medium-range NOE pattern of Mdka............................51 Fig3.14 Comparison of Mdka domain structures calculated with and without disulfide constraints. .....................................................................................53 Fig3.15 Solution structure of Mdka. .....................................................................57 Fig3.16 Ramachandran plot of Mdka. ..................................................................58 Fig3.17 Configuration of proline residues. ...........................................................59 Fig3.18 Comparison of Mdka with human MK....................................................61 Fig3.19 Sequence comparison of Mdka and human MK. ....................................62 Fig3.20 Sequence alignment of Mdka and PTN. ..................................................65 Fig3.21 Secondary structure of PTN. ...................................................................66 Fig3.22 Sequence comparison of Mdka and Mdkb. .............................................68 VII List of Tables Table 1 Distances between the Cβ atoms of nearby Cys residues in the 10 conformers. ..................................................................................................54 Table 2 Structural statistics of Mdkaa ...............................................................56 VIII Summary Midkine-a (Mdka) is a heparin-binding growth factor from zebrafish, which regulates the fish brain formation during embryogenesis. Its homologs in zebrafish, Midkine-b, and in human, named Midkine, have 58% and 68% amino acid identity with Mdka respectively, but show distinctively different expression and functional patterns from Mdka. In order to understand more about how this growth factor functions, and why similar molecules can function differently, we sought to investigate their structural and mechanistic characteristics. In this study, the solution structure of Mdka is solved based on the multi-dimensional NMR spectroscopy using a novel assignment method developed by our group, which shows that this method works as well for small proteins as larger ones. Mdka contains two domains, and each of them consists of three anti-parallel β-sheets, resembling that of human Midkine. The N domain backbone architectures of Mdka and human Midkine are essentially the same, but C domains have major differences, which could be the reason for the differential functions of the two proteins. Our results provide new insights into the functional mechanism of the Midkine family; and directions for future structural study of this protein include: 1) comparing the structures of Mdka and Mdkb to understand why they have different functions; and 2) mapping the heparin binding site of Mdka to identify the specific residues important for the function of the protein. IX Chapter 1 Introduction 1.1 Studies on Mdka and Mdkb of zebrafish: heparin binding growth factors 1.1.1 Biological background of zebrafish embryonic and neural development Zebrafish Danio rerio is a common and useful model organism for studies of vertebrate development and gene function (Mayden et al., 2007), since zebrafish embryos are large, robust, and transparent, and have the properties of developing rapidly and externally to the mother, characteristics which all facilitate experimental manipulation and observation (Dahm, 2006). Like other vertebrates’ embryo development, zebrafish development consists of stages of fertilization, cleavage, gastrulation, organogenesis and hatching. During gastrulation, three germ layers are formed in the early embryo: endoderm, mesoderm and ectoderm, which develop into different groups of the tissues. Among them, the ectoderm layer is supposed to develop into future neural systems by neurulation. Neurulation begins with the signals emitted from one discrete stripe of mesoderm to the ectoderm instructing the cells of the ectodermal tissue to change shape (Fig1.1). This process is coordinated as a whole, where the cells migrate to form a neural tube. The neural tube is the embryo's precursor to the central nervous system, which comprises the brain and spinal cord. During its formation, cells at the edges of the dorsal midline will sort themselves out by sticking to the non-neural ectoderm or sticking with the neural ectoderm. The former would develop into the epidermal epithelium while the latter would become part of the neural tube epithelium. 1 A Fig1.1 Process of neurulation. (Kristjan R. Jessen & Rhona Mirsky, 2005) 2 However, of unusual significance are some in-between cells that form a loose aggregate in the space when the two epitheliums separate. These are neural crest cells which quickly migrate during or shortly after neurulation. Following their migration the neural crest cells differentiate into a wide variety of cell types throughout the body (Bronner-Fraser, 1995). The junction between neural and non-neural ectoderm, also called neural plate border, also gives rise to another kind of cells called Rohon-Beard (RB) sensory neurons. Different from neural crest cells, RB sensory neurons do not migrate away from the neural tube upon its closure, but remain within the central nervous system instead, and eventually comprise bilateral stripes in the developing dorsal spinal cord (Roberts, 2000). In the neural tube, a signaling center called the roof plate is formed when the Bone morphogenetic protein (BMP) from the epidermis permeated in. Meanwhile, in the ventral floor of the neural tube, a floor plate is formed due to Sonic hedgehog (Shh) protein from the notochord. These two molecules form a double gradient by spreading through the neural tube, and the identity of neurons in the tube can be determined by the relative concentrations of the two factors (Fig1.2). Therefore, the two signaling centers, the floor plate and the roof plate, affect the cell differentiation according to the position in the neural tube, thus determine the pattern formation in the embryonic spinal cord. At the dorsal mesoderm, cells that adjacent to the notochord during vertebrate organogenesis form transient structures called somite, which define the segmental pattern of the embryo, and subsequently give rise to vertebrae and ribs, dermis of the back, and skeletal muscles of the back, body wall and limbs (Wikipedia). 1.1.2 Characterization of Mdka and Mdkb of zebrafish 3 Fig1.2 Organization of the neural tube. Mdka regulates floor plate formation. (Schäfer et al., 2005; Schäfer et al., 2007, Squire et al., 2003). 4 Mdka and Mdkb are the two heparin binding growth factors in zebrafish, while in mammals there is only one mdk gene encoding one secreted heparin binding growth factor. Mdkb was identified using an expression-cloning screen for neural inducing factors (Winkler and Moon, 2001). Using Mdkb as a query, Mdka was identified from searching the EST database, showing only ~70% nucleotide identity with Mdkb, 68% amino acid identity, and 72% similarity with Mdkb (Winkler et al., 2003). Both Mdka and Mdkb contain more basic residues than acid ones, being positive charged proteins with theoretic isoelectric point (PI) of 9.59 and 9.53 respectively. Both proteins are soluble proteins according to the hydropathy profiles (Fig1.3 & Fig1.4). The N terminals of both proteins are highly hydrophobic and the first 23 amino acids of each protein are supposed to form a signal peptide. The rest of the amino acids are mainly hydrophilic although there are two short hydrophobic segments at the amino acid positions of 43-47 and 110-116 in Mdka (Fig1.3). Mdka shows higher sequence identity (58%) with human Midkine compared to Mdkb (51%). Sequence comparison between the zebrafish Midkines and homologs from other species revealed the presence of 10 conserved cysteine residues, two clusters of basic residues (cluster I and cluster II) that are important for heparin binding, and a highly conserved hinge region that separates the amino-and carboxyterminal domains (Iwasaki et al., 1997; Winkler, 2003; Fig1.5). In all fish Midkines, a highly conserved Arg residue that is essential for binding of Midkine to its receptor PTPζ is also present (Maeda et al., 1999). Two glutamine residues that are important for the formation of covalently linked dimers by tissue-type 2 transglutaminases are conserved in zebrafish (Kojima et al., 1997). 1.1.3 Biological activities of Mdka and Mdkb 5 A B Fig1.3 Sequence of Mdka. (A) The nucleotide sequence of the Mdka cDNA and its deduced amino acid sequences. (B) Hydropathy profile of Mdka (Software from ExPSAy, ProtScale). 6 A B Fig1.4 Sequence of Mdkb. (A) The nucleotide sequence of the Mdkb cDNA and its deduced amino acid sequences. (B) Hydropathy profile of Mdkb (Software from ExPSAy, ProtScale). 7 Fig1.5 Sequence alignment of MK family. (Chang et al., 2004) heparin binding; cluster I a.a. for cluster II a.a. for heparin binding. 8 Winkler (2003) first reported the expression patterns and biological activities of Mdka and Mdkb, revealing their important roles in the early neural development of zebrafish. Both Mdka and Mdkb are expressed in very dynamic and regionally restricted patterns during embryogenesis. Spatially, Mdkb expression is restricted to the dorsal regions of the developing nervous system, but Mdka is strongly expressed in a broad area in the central neural tube. At the beginning, Mdka expression is found in the paraxial mesoderm, which indicates the role of Mdka in somite formation. At 16-h post fertilization (hpf), Mdka expression is found in epithelialized somites and also in a very dynamic fashion in the head region and in the spinal cord. Its expression is excluded from dorsal neural tube and the floor plate, which is the ventral most structure in the neural tube. In addition to spatial differences, the onset of transcription also differs for both genes. Mdkb expression starts during early gastrulation, about 4 h prior to Mdka that is first detectable at 10 hpf. Onset of Mdka expression is at tailbud stage in two lateral clusters of cells of the paraxial mesoderm (Fig 1.2B). Since Mdka and Mdkb show an overall 68% amino acid identity and the heparin binding as well as the hinge region are highly conserved, it opens to the question whether these two proteins share similar activities. To answer this question, an over-expression approach was used, which was to ectopically express mdka and mdkb genes after microinjection of in vitro-synthesized RNA into early zebrafish embryos. The resulting embryonic development was significantly different. Over-expression of Mdka affected the formation of somites and neural tube (Winkler et al., 2003) while injection of mdkb RNA resulted in generally posteriorized embryos lacking head structures rostral to the mid-hindbrain boundary (Winkler and Moon, 2001). Therefore, both secreted growth factors, although structurally related, show different functions, and it would be interesting to investigate the functions of each 9 protein and why such structurally related proteins would have functional divergence. Although the exact functional mechanisms are not known, Mdka is reported to be required for the formation of the medial floor plate (Schäfer et al., 2005). On the contrary, Mdkb is the downstream signaling factors of several pathways, most notably retinoic acid, although the processes are still unclear. Other possible pathways include FGF signaling pathway, Wnt signaling pathway, and BMP signaling pathway. In addition, Mdkb controls the cell determination at the neural plate border and affects the formation of neural crest cells and Rohon-Beard sensory neurons (Liedtke et al., 2008). As secreted growth factors, both proteins might bind to the transmembrane heparan sulfate proteoglycans. These receptors have the heparin-like carbohydrates; thus the growth factors for these kinds of receptors are classified as heparin-binding growth factors. 1.1.4 Zebrafish PTN: another member of Mikine family in zebrafish Zebrafish heparin-binding neurotrophic factor (HBNF or PTN) is another secreted heparin-binding protein which is highly basic and contains ten cysteine residues (Chang et al., 2004). The amino acid sequence comparison shows that it displays 60% identity to human PTN, while only 39%-40% identity to human midkine, Mdka, and Mdkb (Fig1.5). During zebrafish embryogenesis, the expression of zebrafish PTN was observed upon fertilization and onwards. In adult fish, it is highly expressed in brain and intestine. The in vivo biological function of zebrafish PTN is found to promote neurite outgrowth. 1.2 Structural and functional studies on Midkine and PTN of human 1.2.1 Introduction 10 Midkine protein (MK) and Pleiotrophin (PTN; also called HB-GAM) are the two members of the MK family in human. Both proteins were discovered two decades ago (Kadomatsu et al., 1988; Rauvala, 1989), and have been studied extensively in vitro since then. As extracellular signaling molecules, MK and PTN play pivotal roles in neural development, pathogenesis of neurodegenerative diseases, and cancer development (Kadomatsu et al., 2004). 1.2.2 Protein structures MK precursor protein with an intact signal peptide (22 amino acids) has a molecular mass of 15.6 kDa whereas the matured form of MK is calculated to be 13.2 kDa. Mature PTN protein, which shares 45% amino acid sequence identity with MK (Fig1.6A), has a molecular weight of 19 kDa. The individual domain structures of MK have been solved by NMR (Iwasaki et al., 1997; Fig1.6B), and the NMR analysis has demonstrated that the three dimensional structures of MK and PTN are very similar (Kilpeläinen et al., 2000). Both proteins are essentially composed of two domains, the N-terminally located domain and the C-terminally located domain. These two domains are held by five disulfide bridges from ten conserved cysteine residues. Each domain contains three anti-parallel β-sheets. It is noteworthy that the heparin binding site of MK is mainly located in the C-domain which is responsible for most of the Midkine activities (Muramatsu et al., 1994 ), while that of PTN is equally distributed in both N- and C-domains (Kilpelainen et al., 2000). In the C domain of MK, two heparin binding clusters have been identified (Fig1.6B). Cluster I contains residues K79 (βC2), R81 (βC2), and K102 (βC3), all of which are located on the two β-sheets. All these three residues are conserved in the MK family from invertebrates to vertebrates. Residues, K86, K87, and R89, from the long loop between βC2 and 11 A B N domain C domain Fig1.6 Human Midkine family sequence alignment and structure. (A) Sequence alignment of MK and PTN (Kadomatsu and Muramatsu, 2004). (B) NMR solution structure of human MK domains (Muramatsu, 2002). 12 βC3 form cluster II (Iwasaki et al., 1997; Fig1.6B). Upon binding to heparin ligand, MK could form a dimer, and this dimerization is important for the activity of MK. Three glutamine residues in human MK have been identified to be responsible for the transglutaminase-mediated dimerization (Kojima et al., 1997). 1.2.3 Receptors of MK and PTN Heparin binding growth factors, like MK and PTN, could bind to a variety of transmembrane receptors on neurons and osteoblasts. Four kinds of receptors have been identified to have interactions with the MK family so far, including N-syndecan (Raulo et al., 1994; Mitsiadis et al., 1995; Kojima et al., 1996; Nakanishi et al., 1997), receptor-type protein tyrosine phophafase ζ (PTPζ) (Maeda et al., 1996; Maeda et al., 1998; Maeda et al., 1999), anaplastic leukemia kinase (ALK) (Stoica et al., 2001; Stoica et al., 2002), and low density lipoprotein receptor-related protein (LRP) (Herz et al., 2000). N-syndecan belongs to the syndecan family that comprises four transmembrane heparan sulfate proteoglycans. Binding of MK and PTN to syndecans is mediated by the heparin sulfate chain, which is the heparin-like domain. This domain consists of a variably sulfated repeating disaccharide unit. PTPζ is transmembrane protein with chondroitin sulfate chains, where MK and PTN bind, and an intracellular tyrosine phosphatase domain. LRP and ALK can bind MK and PTN via extracellular potion of the receptor, but how exactly they interact is not clear. These receptors might be differentially utilized for specific biological activities such as neurite outgrowth (Li et al., 1990; Rauvala et al., 1994; Kaneda et al., 1996a), nerve cell migration (Maeda et al., 1998; Maeda et al., 1999), neurodegenerative diseases (Wisniewski et al., 1996; Yasuhara et al., 1993), and cancer (Kadomatsu et al., 2004). Many of the down-regulation pathways are still unclear, but it is pointed 13 out that the intracellular signaling of N-syndecan family receptors involves cortactin-src pathway, PTPζ and ALK receptors’ signaling pathways involve PI3-kinase and Erk, and the LRP receptor is responsible for the anti-apoptotic activity of MK. 1.2.4 Biological activities MK and PTN are strongly expressed during embryogenesis, but their expression in adults are only restricted to kidney and tumors, and very low levels in brain (Kato et al., 2000; Mishima et al., 1997). Both MK and PTN have diverse biological functions, among which their roles in neural development and pathological aspects are of significant importance (Kadomatsu and Muramatsu, 2004). Both MK and PTN have been reported to be involved in the neurite outgrowth and nerve cell migration. MK also shows neuroprotective activity by preventing cell degeneration (Unoki et al., 1994). As to neurodegenerative diseases, both MK and PTN are reported to deposit at the senile plaques in Alzheimer’s patients. MK is also found at the neurofibrillary tangles in Alzheimer’s patients. Since MK and PTN can bind to LRP, and LRP has featured roles in Alzheimer’s disease, thus MK and PTN might be related to the pathogenesis of Alzheimer’s disease. Besides, MK knockout mice show that MK is involved in the pathogenesis of interstitial nephritis and vascular restenosis. The pathogenesis of both diseases might result from the recruitment of inflammatory cells by MK. But no receptors and signaling pathways have been discussed on this subject. Moreover, both MK and PTN are reported to be involved in cancer-related activities. For normal tissues of human adults, MK shows restricted expression, but for carcinoma specimens, MK expresses at a high level in a tissue type-independent manner, and its expression is more intensely and in a wider range of human 14 carcinomas than PTN. It is also reported that MK can promote cell growth, cell survival, and cell migration. And MK is shown to be highly expressed in all stages of neuroblastomas (malignant tumor that usually metastasizes quickly), and weakly in ganglioneuromas (benign tumors), while PTN expression can be high in both. Hitherto the exact signaling pathways involved in all these activities are still unknown, but it is possible that MK and PTN have different receptors or signal transduction pathways. 1.2.5 Medical significance Since MK and PTN are closely involved in several diseases, especially cancer, their medical significances were proposed by Kadomatsu and Muramatsu (2004). One possible application is that they could be used as tumor markers, since there are elevated blood levels of both proteins in cancer patients, and the protein level would be decreased when the tumors are removed. Besides, the blood level of MK is significantly correlated with prognostic factors of neuroblastomas, while PTN level is reported to be correlated with the prognosis in pancreatic carcinoma patients. Another possible application is that they might be used as gene therapy targets since a suicide gene under the control of the mdk promoter is identified as a highly potential strategy for curing carcinomas. The third application is that they could be the molecular targets of therapy for carcinomas. Since both in vitro and in vivo assays using mice have demonstrated that antisense MK oligodeoxyribonucleotides suppressed cell growth, anchorage-independent growth, tumor growth of mouse colorectal carcinoma cells, and pre-grown tumors in nude mice via atelocollagen-mediated gene transfer; disruption of MK or its signaling pathway could be a strong means of curing human carcinomas. Similar situations applies to PTN, except that it is PTN-targeted 15 ribozymes (enzymatically active RNAs, i.e. ribozymes, that were designed to specifically cleave the PTN mRNA thus leading to significantly reduced PTN levels) that can suppress the growth of tumors, and a part of the N-terminal of PTN shows a dominant-negative form in the growth of human breast cancer cells. Therefore, PTN and its signaling molecules could also be a molecular targeting candidate for cancer therapy. 1.3 Protein structure determination by NMR 1.3.1 Introduction to NMR spectroscopy 1.3.1.1 Development of NMR NMR development began in the area of physics in 1938 when Isidor Rabi (Nobel Prize 1944) firstly described and measured the resonance in molecular beams. Eight years later, Purcell et al., and Bloch independently refined the technique and reported the phenomenon of NMR, for which they shared the Noble Prize in Physics in 1952. In the 1940s and 1950s, many basic parameters and theories of NMR were discovered such as chemical shift, spin-spin coupling, spin relaxation, and Overhauser effect, which laid the foundation of NMR method development. In 1966, Fourier Transform (FT) techniques were introduced by Ernst, and later he developed the multi-dimensional NMR, thus began the era of FT NMR (Nobel Prize 1991). Its application in biological sciences was marked by Kurt Wüthrich when he developed the NMR spectroscopy for determining 3D structures of biological macromolecules in solution (Nobel Prize 2002). Today, with the accessibility of improved NMR hardware and better developed NMR methodology, NMR has gained wide application in the areas of chemistry, biology and medicine. In biological sciences, with the development of molecular biology and 16 biochemical methods for preparing and isolating isotope labeled biomolecules, NMR has become a powerful tool for biomolecule structure characterization, protein dynamics, and identification of bioactive compounds. 1.3.1.2 Basic theories of NMR 1.3.1.2.1 Chemical shift Neutrons and protons that composing an atomic nucleus have an intrinsic quantum mechanical property called spin. Spins within different chemical environments would have different resonance frequencies. This is because of the electrons that circulating about the direction of the applied external magnetic field. The circulation would cause a small magnetic field at the nucleus; thus the actual magnetic field at the nucleus is different from the applied one. As in an NMR spectrum, each nucleus gives rise to a resonance which is characterized by chemical shift reflecting its unique chemical environment. Chemical shift is an important parameter for identifying individual nucleus and assigning the resonances in the spectrum to their corresponding atoms in a molecule. Moreover, characteristic chemical shifts in an amino acid residue can be helpful for identifying individual residues in a protein sequence and determining protein secondary structure (Wishart et al., 1991). Chemical shift is measured in parts per million (ppm) in order that the value is independent of the static magnetic field strength. 1.3.1.2.2 J coupling Scalar couplings or J-couplings are mediated through chemical bonds between two spins. The energy levels of each spin are slightly altered depending on the spin state of a scalar coupled spin (α or β). This gives rise to a splitting of the resonance 17 lines. Scalar couplings are used in multidimensional (2D, 3D, 4D) correlation experiments to transfer magnetization from one spin to another in order to identify spin systems, e.g. spins which are connected by not more than three chemical bonds (Sattler, 2004). 1.3.1.2.3 NOE The nuclear Overhauser effect (NOE) is a result of cross-relaxation between dipolar coupled spins as a result of spin/spin interactions through space. The NOE allows to transfer magnetization from one spin to another through space, and scales 6 with the distance r between the two spins (NOE ~ 1/r ), e.g. two protons in a protein. When interproton distances are larger than 5 Å, the NOE may be too small to be observed. Therefore NOEs are related to the three-dimensional structure of a molecule (Sattler, 2004). 1.3.2 General strategies of protein structure determination by NMR 1.3.2.1 Sample Preparation The protein sample for NMR experiments is often prepared by overexpressing proteins in a host, like E. coli or yeast, and the media used to incubate the host is enriched with 13 C and 15 N isotopes. About 0.5 ml protein solution with a concentration of ~1 mM protein is usually required in structure elucidation. Different buffer conditions (pH, salt concentrations, and types of buffer) need to be tested to optimize the quality of the NMR spectra and to avoid protein aggregation. Sometimes, the native protein needs to be modified, such as cutting the protein into separate domains, or making constructs with various sizes and domain boundaries, in order to obtain samples which give good NMR spectra. Before labeling the protein with 13 C 18 Sample preparation Cloning, expression, purification, isotope labeling Resonance assignment Backbone, sidechain, NOE Distance and torsion angle constraints NOE, dihedral angles, hydrogen bond, disulfide bond Structure calculation and refinement DYANA, CNS, structure energy minimization Evaluation of structure quality Fig1.7 General strategy for structure determination by NMR. 19 and 15 N isotopes, unlabeled sample can be used to determine if a protein sample 1 prepared under a certain condition is folded or not using 1D H NMR experiment. 1.3.2.2 Spectrum collection and Resonance Assignments A set of heteronuclear multidimensional NMR experiments and NOESY experiments (From 2D to 4D) are recorded for the assignment of the chemical shifts of all spins and NOE peaks. The total time for NMR data acquisition varies from ten days to four weeks depending on the experiments adopted. The chemical shift assignments of backbone and sidechain atoms are typically obtained from the triple-resonance experiments, which can be done manually or automatically. The chemical shifts alone would allow the determination of protein secondary structure. H-H NOE peaks observed in the NOESY spectrum can be used to assign the inter-atomic distances. 1.3.2.3 Collection of NMR Restraints Besides the distance constraints derived from NOE, other structural 3 information can be obtained for structure calculation. These include J-coupling constants that provide dihedral angles information, residual dipolar couplings that show information about bond projection angles, and solvent exchange that measures α β α hydrogen bonds. Torsion angles can also be predicted based on C , C , H chemical shifts with the program TALOS (Cornilescu et al., 1999). 1.3.2.4 Structure Calculations and Refinement A restrained molecular dynamics/simulated annealing (MD/SA) or torsion angel dynamics (TAD) is chosen for structure calculation, which would use all the 20 available structural parameters mentioned above. Structure calculation with NOE assignment is an iterative process. The result of an NMR structure calculation is an ensemble of structures, all of which are consistent with the experimental NMR data. The calculated structures should converge to the same fold. Poor convergence may indicate problems of the experimental constraints and/or the MD/SA/TAD protocol (Sattler, 2004). 1.3.2.5 Evaluation of protein NMR structure Factors for evaluation of protein NMR structures include the root-mean-square deviation (RMSD), Distance constraints violation, dihedral angle violation, electrostatic potential term, van der Waals force, and the Ramachandran plot. The RMSD indicates the coordinate precision of the structure ensembles calculated. The smaller the value of RMSD between the calculated conformers and the mean structures (usually less than 1 Å), the better the quality of the structures calculated. Distance constrains violation should be within 0.5 Å, while dihedral angle violation should be within 5°. As for the electrostatic potential term, negative results indicate good structure quality. Besides, the number of bad contacts, e.g. the number of restraints with too short atom-atom distances in a molecule should be small, otherwise the experimental constraints might be applied too strong and cause van der Waals clashes. Furthermore, the Ramachandran plot, which specifies the fraction of backbone φ,ψ angles which correspond to favored, additionally allowed, generally allowed and disallowed conformations based on statistical analysis of high-resolution crystal structures, is a good quality measure. Most φ,ψ angles should be within the allowed regions, while poor quality structures usually have a large number of φ,ψ angles in forbidden regions in the Ramachandran plot. 21 1.3.3 Advantages of structure study by NMR Nuclear Magnetic Resonance, X-ray Crystallography, and Cryo-electron Microscopy are the only three methods to solve protein structures so far. Only Nuclear Magnetic Resonance and X-ray Crystallography can be used to solve high resolution structures of proteins, nuclear acids and their complexes. The uniqueness of NMR methods is that the samples are in solution. The pH, salt concentration and temperature can all be adjusted so that the biomoleucles’ physiological condition can be mimicked. Besides, NMR can be used to study the dynamics of biomoleucles, to investigate structural, thermodynamic and kinetic aspects of interactions between proteins and other solution components. 1.4 Objectives of this project Mdka and Mdkb are new heparin binding growth factors found in zebrafish seven years ago, and many functions of the proteins are still unclear. These two proteins share high sequence identity, but perform different expression patterns and functions. On the one hand, Mdka, expressed in the paraxial mesoderm, is involved in the medial floor plate formation. On the other hand, the role of Mdkb is proposed to be specifying the formation of neural crest cells and Rohon-Beard sensory neuron by acting downstream of several signaling pathways at the neural plate border near the roof plate. Therefore it is interesting to understand why two growth factors with high sequence similarity would have different functions in the embryogenesis of zebrafish. In this study, the structure of Mdka is characterized. The major objective of this study is to elucidate the structural information of this protein, and compare the structure of Mdka with those of other homologs to understand the functional prosperities of Mdka. 22 The structural study of Mdka would lay foundation for the functional study of this protein family, such as the binding properties of the protein with heparin ligand and receptors. By comparing the structures of Mdka and Mdkb, we might gain new insights into the functions of the two proteins. 23 Chapter 2 Materials and Methods 2.1 NMR sample preparation 2.1.1 Media LB Broth: 1 L culture contains 10 g tryptone, 5 g of yeast extract, and 10 g of NaCl and 100 μg/ml ampicilin LB Agar: 1 L culture contains 10 g tryptone, 5 g yeast extract, 10 g NaCl, 15 g agar and 100 μg/ml ampicilin M9 minimal medium: 1 L culture contains 1 g NH4Cl, 2 g Glucose, 0.0147 g CaCl2(H2O), 0.493 g MgSO4(7H2O), 7.52 g Na2HPO4, 3 g KH2PO4, 0.5 g NaCl and 100 μg/ml ampicilin 2.1.2 Preparation of DNA plasmid The gene encoding Mdka protein was obtained from our collaborator, Prof Winkler (Dept. of Biological Sciences, NUS), in the template plasmid pCS2 Zmk-2. Oligonucleotide primers were used to amplify the mdka gene. The sense primer is 5’-CGCGGATCCAAAAACAAGAAAGAGAAGAA-3’ and antisense primer is 3’-GGGGATATCACTCAGCATAAT-5’. DNA gel electrophoresis (Agarose from Bio-Rad) was subsequently performed to separate the template plasmid from the desired PCR products. The recovered PCR products were digested with BamHI and XholI restriction enzymes (New England Biolabs). After purification using the QIAquick Gel Extraction Kit 250 (QIAGEN), the mdka DNA fractions were ligated to the pGEX-4T-1 vector, which has a GST tag. The obtained Plasmids were sequenced (reaction Kit from Perkin Elmer) using the automated DNA sequencer for verification. 24 2.1.3 Transformation of E .coli competent cells The mdka plasmids were transformed into E. coli BL21 (DE3) competent cells and were spread onto LB agar plates containing the ampicilin antibiotic selective for the plasmid. The plates were incubated for 12~16 hours at 37 °C to enlarge the bacterial colony. 2.1.4 Protein expression and purification 2.1.4.1 Expression of unlabeled Mdka The construct containing the mdka gene was expressed in the E. coli Strain BL21 (DE3) using LB medium. Procedure begins with inoculating one single colony containing mdka plasmid in 5 ml LB culture with 100 μg/ml ampicilin at 37 °C shaking overnight at 200 rmp. Then the overnight culture was transferred to 1 L fresh LB broth to inoculate at the same condition until the OD600 was around 0.4. To have an expression control, 1 ml of the culture was taken out, centrifuged, and the pellet was re-suspended with 50 μl SDS-PAGE sample buffer. The rest of the culture was induced with 0.05 mM IPTG at 16 °C for 12 hours. 1 ml of the induced culture was taken out, centrifuged, and the pellet was re-suspended with 50 μl SDS-PAGE sample buffer to test the protein expression level. The rest of the culture was centrifuged and pellets were collected for protein purification. 2.1.4.2 Expression of labeled Mdka The expression protocol was quite similar as that of unlabeled Mdka except 15 that the culture used was M9 medium. The culture used for N labeled Mdka was M9 15 15 13 medium with N labeled NH4Cl, and that for N and C labeled Mdka was medium with 15 N labeled NH4Cl and 13 C labeled glucose. After the cell density reached 25 OD600 around 0.6, 0.3 mM IPTG was added to induce protein expression at 20 °C for 20 hours. 2.1.4.3 Purification of Mdka The cell pellets collected by centrifuge were re-suspended with lysis buffer (20 mM Tris buffer, pH 8.0, 150 mM NaCl). After sonication thoroughly, the cell lysate was centrifuged to remove the unsonicated pellets. The supernatant was mixed with GST beads which have been wash with water and equilibrated with lysis buffer. The mixture was shaken at 4 °C for one hour followed by separating the resin from the flow-through. The resin was washed with lysis buffer four times to wash out the unspecific bindings. Then 10 U thrombin was added to the resin with 5 ml thrombin cleavage buffer (20 mM Tris buffer, pH 8.0, 150 mM NaCl, 2.5 mM CaCl2) to cleave GST tag from Mdka protein. Finally, the untagged protein was eluted with lysis buffer. After GST beads purification, there were still a few impurities. Therefore, heparin affinity column (HiTrap Heparin HP Column, GE Healthcare) was used with linear elution to further purify Mdka. The starting buffer was 20 mM Tris buffer, pH 8.0, 150 mM NaCl, 2 mM EDTA, while the elution buffer was 20 mM Tris buffer, pH 8.0, 2 M NaCl, 2 mM EDTA. The peak fractions collected from the FPLC machine were combined and concentrated for further protein assays. 2.1.5 General protein assays 2.1.5.1 Sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) 12% and 15% SDS-PAGE were used to analyze the protein samples in 1 × sample buffer with electrophoresis apparatus (Bio-Rad). The 5 × sample buffer used 26 was 0.2 M Tris-HCl, pH 6.8, 25% (v/v) glycerol, 25% (v/v) SDS, 12.5% (v/v) 2-mercaptoethanol, 0.005% (w/v) bromophenol blue. The running buffer was 25 mM Tris, 192 mM glycine and 0.1% (v/v) SDS. After mixing the protein sample with the sample buffer, the mixture was heated for 5 min before loading into the well of the SDS-PAGE. When the gel finished running, coomassie-blue solution [0.05% (w/v) Coomassie brilliant blue R, 25% (v/v) isopropanol, 10% (v/v) acetic acid] was used to stain the gel for 30 min followed by distaining with 10% acetic acid. 2.1.5.2 Protein quantitative assay The protein concentration could be determined by UV absorbance at 280 nm, which was convenient and fast to use. The concentrated protein was diluted so that the UV absorbance was between 0.1 and 1.0 to ensure accuracy of the method. The concentration of the protein was calculated by the equation: A = εIC, where A stands -1 -1 for absorbance of UV, ε is the molar absorption coefficient (M cm ), I denotes the path length (cm), and C is the protein concentration (M). The value of ε can be estimated using the ProtParam in the ExPASy Proteomics Server. 2.1.5.3 Protein circular dichroism (CD) Far UV Circular Dichroism Spectroscopy was used to evaluate the secondary structure of Mdka. Experiments were carried out on a Jasco-810 spectropolarimeter (Jasco International Co. Ltd.) with a 0.1 nm quartz cuvette with a minimal sample volume of 250 μl. The protein sample was prepared by dialyzing against 10 mM phosphate buffer, and the protein concentration was within the range of 10~100 μM. The result was the average of 10 scans so that the background noise could be reduced. 27 2.1.5.4 Protein dynamic light scattering (DLS) To see if Mdka was still in the monomer form when the protein concentration was as high as 1 mM, Dynamic Light Scattering (Protein Solutions Inc) was used to evaluate the molecular weight of the protein particle. The protein sample was centrifuged at 15,000×g for 30 min at 4 ºC before adding the sample into a cuvette. The experiment was carried out at 30 ºC with 1 mM Mdka in the same buffer as the final NMR sample (20 mM phosphate buffer, pH 6.5, 100 mM NaCl, 2 mM EDTA). The maximal acquisition time was 10 sec and the data were analyzed using Dynamics 5.0 software (Moradian-Oldak et al., 1998), where the standard curve of globular protein was selected for MW model. 2.2 NMR experiments Bruker Avance 500 MHz and 800 MHz spectrometer equipped with pulse field gradient units and actively shielded cryoprobes were used to perform NMR experiments. Software used to process and analyze NMR data involved NMRPipe software (Delaglio et al., 1995) and NMRView (Johnson, 2004) on Linux workstations. A homemade NMRview plug-in was installed to assist the 4D NOE 1 15 1 13 assignment (Written by Lin Zhi). 2D H- N HSQC, 2D H- C HSQC, 3D HNCA, 3D HNCOCA, 3D CCH-TOCSY, and 4D time shared NOESY experiments were used to do backbone, side-chain, and NOE assignments (Xu et al., 2006). 2.2.1 1D NMR experiment 1D NMR experiments were used to test whether the protein was folded properly in in-vitro condition so that the following NMR experiments could be performed. Unlabeled Mdka sample was used to acquire data at 20 ºC in H2O 28 (10%D2O) eluted using 20 mM Tris buffer, 100 mM NaCl, 2 mM EDTA at pH 8.0, while experiment on labeled sample was recorded at 30 ºC. 1 15 2.2.2 2D H- N HSQC experiment 1 15 A 2D H- N HSQC experiment was used to show the dispersion of Mdka backbone amide signals, from which we could tell whether the protein was in 15 aggregation form. It was also used as a reference for backbone assignment. Both N 13 15 labeled and C, N labeled samples were used to record the experiments on the 800 Hz and 500 Hz NMR at 30 ºC. 1 13 2.2.3 2D H- C HSQC experiment 1 13 A 2D H- C HSQC experiment with aliphatic C-H correlation information 13 was performed, which was used as a reference for the sidechain assignment. A C, 15 N labeled sample was used to record the experiments on the 800 Hz NMR at 30 ºC. 2.2.4 H/D exchange experiment Hydrogen bond restraints were obtained by observing the presence of any hydrogen-bonded amide proton peaks in the HSQC spectrum. These amide protons 1 15 were resistant to exchange with deuterons in D2O. A H- N HSQC spectrum was recorded within 20 min after changing the sample buffer to D2O buffer. 2.2.5 3D HNCA and 3D HNCOCA experiments An HNCA experiment, which is often used in tandem with an HNCOCA experiment, can provide linkage information for the adjacent residues in a protein 29 sequence. In HNCA experiment, the magnetization of the amide proton of an amino acid residue is transferred to the amide nitrogen, and then to the alpha carbons of both the starting residue and the preceding residue in the protein's amino acid sequence. In contrast, the complementary HNCOCA experiment transfers magnetization only to 13 15 the alpha carbon of the preceding residue (Bax et al., 1991). A C, N labeled sample was used to record the experiments on the 800 Hz NMR at 30 ºC. 2.2.6 3D MQ-CCH-TOCSY experiment Like the traditional 3D HCCH-TOCSY experiment, the 3D MQ-CCH-TOCSY can correlate the sidechain aliphatic protons with the carbon spins in the same residue. 13 15 But the 3D MQ-CCH-TOCSY is more sensitive (Yang et al., 2004). A C, N labeled sample was used to record the experiment on the 800 Hz NMR at 30 ºC. 2.2.7 4D time shared NOESY experiment A time-shared 4D NOESY experiment (Xu et al., 2007) that records HN-HN NOE, HN-HC NOE, HC-HN NOE, and HC-HC NOE was used, which greatly reduced the time needed for data collection. The time-shared 4D NOESY spectrum can be split into three sub-spectra (4D NOESY and 4D 13 13 15 15 N, N-edited NOESY, 4D 13 13 15 C, N-edited C, C-edited NOESY) during data processing. A C, 15 N labeled sample was used to record the 4D NOESY on the 500 Hz NMR at 30 ºC. 2.3 Data processing NMR data were processed with NMRPipe, and NMRDraw was used to phase the spectra. All processed spectra were converted to the NMRView format for further analysis by NMRView. 30 2.4 Resonance assignment 2.4.1 Backbone assignment Peak picking of all NMR spectra was performed using the automatic function 1 15 in NMRview. All cross peaks from H- N HSQC, HNCA, HNCOCA, and 4D 13 15 15 1 C, N-edited NOESY sub-spectrum were grouped into clusters based on their H, N chemical shifts with tolerances of 0.02 ppm for HN from intralist and interlist, 0.2 ppm for nitrogen chemical shifts and 0.3 ppm for carbon chemical shifts. Chemical 1 15 shifts of the aliased peaks in the spectra were corrected according to H- N HSQC. Each cluster usually contains one HNCA peak of residual i, its preceding residual i-1, and a few HC-HN NOE peaks. Among these NOE peaks, those belonging to residual i and i-1 could be identified and differentiated from long range NOE peaks by comparing the TOCSY slices defined by the (H, C) chemical shifts of each NOE peak. α Those NOE peaks that have matched C chemical shifts with the HNCA and α HNCOCA of residual i belong to the intra-residual and sequential NOEs. Since C chemical shifts alone can be ambiguous for assignment, MQ-CCH-TOCSY that provides sidechain chemical shift is used together to confirm assignment. Spin-system classification by residue type was done according to the chemical shifts of the spins in a spin-system. Four types of amino acid residues (Ala, Thr, Ser, and Gly) have α β characteristic C and/or C chemical shifts, which were unambiguously recognized. Val and Leu were easily classified based on their characteristic sidechain chemical shifts, and Arg, Lys and Pro were also distinguished according to sidechain chemical shifts. The spin-systems were linked to build fragments when each two have identical α C chemical shifts, the largest number of matched NOE peaks and no inconsistent (C, 31 H) chemical shifts. Finally, fragments containing the characteristic residues can be mapped onto the amino acid sequence of the Mdka by matching the positions of those characteristic residues. A general protocol for backbone sequential assignment is available online at http://www.natureprotocols.com/2006/11/09/noesybased_strategy_ for_assign.php. 2.4.2 Sidechain assignment Most of the sidechain assignments were already done when doing backbone assignment (Xu et al., 2005). The H and C chemical shifts were extracted from the 4D 13 15 C, N-edited NOESY subspectrum. Some sidechain chemical shifts were assigned based on the NOE peaks of two adjacent spin-systems. The 3D MQ-CCH-TOCSY spectrum was used in combination with the 4D 13 13 15 C, N-edited NOESY and 4D 13 C, C-edited NOESY sub-spectra to perform or confirm the assignments. 2.4.3 NOE assignment, structure calculation and refinement 13 15 15 15 The NOE peaks in 4D C, N-edited NOESY, 4D N, N-edited NOESY and 4D 13 13 C, C-edited NOESY sub-spectra were assigned when a relatively complete chemical shift list was available. NOE distance constraints generated by the unambiguously assigned NOE peaks from 4D 13 13 15 C, N-edited NOESY and 4D 13 C, C-edited NOESY sub-spectra were used for the initial structure calculation. The distance upper bounds of the constraints were calibrated to three categories (2.8, 3.5, 5.5 angstroms) according to the peak volumes. Dihedral angle constraints were generated by TALOS (Cornilescu, 1999), which empirically predicts the φ and ψ α α β backbone torsion angles using a combination of three kinds (H , C , C ) of chemical 32 shifts for a given protein sequence. Only constraints that were classified as good prediction results were used for the structure calculation. Disulfide bond restraints were added only in the later stage, but not at the beginning. The initial structure calculation was done with DYANA (Herrmann et al., 2002), which started structure calculation from 50 conformers with random torsion angle values by simulated annealing with 10,000 steps per conformer. The software MOLMOL (Koradi, 1996) was used to display the structures. Energy minimization of the 5 conformers with the lowest target function values was selected. NOE peaks that gave distance violations >0.4 Å were checked and the range of angular constraints was set to two times of that predicted range by TALOS if specific dihedral angle violations were constantly larger than 5º. Initial structures were refined to minimize the violations. When a relatively good quality initial structure was acquired, ambiguous NOE peaks were then assigned according to the initial structure. Finally, 100 conformers with random torsion angle values by simulated annealing with 15,000 steps per conformer were calculated and the best 10 structures were selected for analysis. 33 Chapter 3 Results and Discussion 3.1 Mdka construct Analysis of Mdka shows that the first twenty-three amino acids are highly hydrophobic, and alignment of Mdka and human MK indicates that these amino acids are the signaling peptide of this growth factor (Fig1.5). Therefore in the construct of Mdka for protein structure elucidation, the signal peptide was removed to get the mature protein directly. With signal peptide deleted, the final plasmid of Mdka in the vector of pGEX4T-1 was successfully constructed, which was used in the expression of Mdka with a GST fusion tag at the N terminal of the protein (Fig3.1). 3.2 Expression and purification of Mdka 3.2.1 Expression of GST-Mdka in E. coli Both labeled and unlabeled Mdka with a GST tag attached to the N terminal of the protein could be easily over-expressed by E. coli BL21 (DE3), although its expression was slightly lower in M9 medium than in LB medium. 2 L M9 culture was used to obtain a sample of 500 μl 1 mM double labeled Mdka. Since GST tag has a molecular weight of around 27 kDa and the molecular weight of Mdka is 13.5 kDa, GST tagged Mdka shows a molecular weight of around 40 kDa (Fig3.2A & B). 3.2.2 Purification of Mdka After sonication, most of the protein was in supernatant (Fig3.2C). After 4 times of washing, most of the unspecific bindings could be washed out except for a few impurities (Fig3.2D). The GST tag could be easily cleaved with 1 U thrombin cleaving about 1 mg protein within 1 hour. After GST beads purification, relatively pure protein was obtained, but it was not pure enough for NMR experiments until 34 A B Fig3.1 Final construct of Mdka. (A) Final amino acid sequence of the expressed Mdka in the E. coli expression system. (B) Amino acid components of Mdka. 35 Fig3.2 Mdka expression and purification by GST beads. (A) Expression in LB media. Lane 1: marker; Lane 2: pre-induction when the cell density reached OD600 0.6, which was expression control; Lane 3: post-induction with 0.05 mM IPTG at 16 °C for 12 hours, the GST-tagged Mdka shows a molecular weight of around 40 kDa. (B) Expression in M9 media. Lane 1: post-induction with 0.3 mM IPTG at 20 °C for 20 hours; Lane 2: protein marker. (C) Solubility of Mdka. Lane 1: marker; Lane 2: pellets after sonication; Lane 3: supernatant after sonication. (D) GST purification. Lane 1: thrombin cleavage of Mdka for 3 hours (1 mg Mdka/ 1 U thrombin); Lane 2: thrombin cleavage of Mdka for 1 hour; Lane 3: marker; Lane 4: Mdka on GST beads after washing and before thrombin cleavage; Lane 5: the first time washing of Mdka with lysis buffer; Lane 6: the fourth time washing with lysis buffer; Lane 7: GST tag on GST beads after elution of Mdka from beads; Lane 8: the first time elution of Mdka from GST beads; Lane 9: the second time elution of Mdka from GST beads; Lane 10: the third time elution of Mdka from GST beads. 36 further purification of Mdka with a heparin affinity column (Fig3.3). 3.3 General properties of Mdka Mdka is a small protein with a molecular weight of 13.5 kDa, which was verified by Q-TOF mass spectrum (Fig3.4). The theoretical molecular weight for Mdka is 13521.4 Da (Cys in reduced form; software ProtParam), thus if Cys residues are oxidized, the molecular weight should be 13511.4 Da. The Q-TOF mass spectrum shows that the molecular weight is 13510.6 Da, which indicates that the Cys residues of Mdka could be in the oxidized form. Besides, the CD spectrum profile (Fig3.5), especially the positive hump at the position of 230 nm, suggests that Mdka has a β-turn structure. The solubility of the protein could be as high as 1 mM as measured with UVA280. When the protein is concentrated to 1 mM, it is still in the monomer form as evidenced by the DLS result (Fig3.6). Furthermore, 1D NMR and 2D HSQC experiments demonstrate that the recombinant Mdka purified from E. coli in this way is well structured (Fig3.7). Mdka is also a stable protein that can stand 30 °C for one week (Fig3.7). Since during the heparin affinity column purification, high concentration of NaCl is needed to elute the protein out of the column, this suggests that the recombinant protein has strong binding affinity to heparin (Fig3.3A). 3.4 Mdka in vivo assay: zebrafish embryonic development activity The activity of recombinant Mdka was tested in vivo using zebrafish embryo by our collaborator (Winkler, Dept of Biological Sciences, NUS). The recombinant Mdka was injected into zebrafish embryos, and the development of these embryos at different stages are compared with the wild type embryos as well as those injected with in-vitro-synthesized mdka RNA (Winkler et al., 2003). As a result, embryos 37 A volume (ml) Fig3.3 Heparin affinity column purification of Mdka. (A) Heparin column profile of Mdka. Mdka peak is indicated by Peak 2. (B) Peak fracture of Mdka from heparin affinity column. Lane 1: marker; Lane 2: from Peak 1; Lane3-7: from Peak 2. (C) Final concentrated Mdka for NMR experiment. 38 Fig3.4 Q-TOF mass spectrum of Mdka. The protein was desalted and prepared in water. The molecular weight of Mdka is 13510.6 Da. 39 Fig3.5 CD spectrum of Mdka. Sample was prepared with 10 mM phosphate buffer at pH 6.5. Spectrum was recorded at room temperature. 40 A Fig3.6 Apparent hydrodynamic MW of Mdka from DLS measurement. (A) Size distribution of Mdka protein particle in the buffer of 20 mM phosphate, 100 mM NaCl, pH 6.5, at 30 °C. The average size of Mdka protein is 2.02 nm. (B) Average molecular weight (MW) of Mdka. 20 MW values were analyzed, and the average MW is 13.7 kDa with a standard deviation of 0.66 kDa. 41 Fig3.7 1D and 2D NMR experiments of Mdka. (A) 1D NMR spectrum was acquired on a 500 MHz NMR machine at 20 ºC in 20 mM Tris buffer, 100 mM NaCl, 2 mM EDTA at pH 8.0. (B) Overlay of two sets of 2D 1H-15N HSQC spectra (magenta: acquired one week before, black: acquired one week later). The sample was stored at 30 ºC for one week. Spectra were recorded at 800 MHz NMR at 30 ºC in 20 mM phosphate buffer, pH 6.5, 100 mM NaCl, 1 mM EDTA. 42 injected with recombinant Mdka show the same development phenotypes as RNA-injected embryos, which indicates that the recombinant protein is properly folded and has the proposed biological functions (Winkler, data not published). 3.5 NMR assignment of Mdka 3.5.1 Backbone assignment of Mdka Based on the 3D HNCA, 3D HNCOCA, 4D 13 15 C, N-edited NOESY and 3D CCH-TOCSY spectra, all the spin systems in the two domain regions of Mdka were connected together, and the loop regions were assigned as long as the peaks appeared 1 15 in the H- N HSQC. Totally, there are about ten peaks missing (Fig3.9B) in the 1 15 H- N HSQC among the 125 amino acids of Mdka (with GS extension at the N terminal of the protein resulting from thrombin cleavage). Figure 3.8 is a part of the 1 15 backbone assignment result for residues from E39 to T41. The H- N HSQC spectrum of assigned Mdka backbone peaks was shown in figure 3.10. 3.5.2 Sidechain assignment of Mdka When backbone assignment was finished, the spin systems were identified, and therefore most of the sidechain chemical shifts were available. Some sidechain assignments need to compare two adjacent 4D 13 15 C, N-edited NOESY planes to decide the resonances. Some need to compare the 4D 13 13 C, C-edited NOESY subspectrum. The chemical shifts of the CH groups that have no NOE on the 4D 13 15 C, N-edited NOESY could be assigned by examining the CCH-TOCSY spectrum and 4D 13 13 C, C-edited NOESY. The assigned proton chemical shifts should be in consistent with the chemical shifts statistics of the twenty amino acids. The 43 Fig3.8 Backbone connectivity of Mdka. (A) Cα connectivity for a stretch of residues from E39 to T41. (B) 4D 13C,15N-edited NOESY slices of E39, G40 and T41. Each 1 H-13C slice is labeled with proton identities with sequential connectivity information. 15 N and 1H frequencies in ppm with assigned residues are indicated at the top of each slice. (Labels are in the NMRView format.) 44 B Fig3.9 Backbone assignment of Mdka. (A) Spin system identification. T41 was indentified because of its characteristic Cβ and Cγ chemical shifts. G40 was indentified according to its small Cα chemical shift and two HαCα peaks. G40 and T41 form a fragment, and when comparing with the sequence of Mdka, there are only two GT fragments (EGT and TGT). Since E39 clearly is not Thr, thus, it could be identified to be Glu. (B) Assigned backbone peaks (magenta) and missing backbone peaks (black). (Labels are in the NMRView format.) 45 Fig3.10 Assigned 1H-15N HSQC spectrum of Mdka backbone peaks. Spectrum was acquired at 800 MHz and 30 ºC on a sample of 0.8 mM protein and pH 6.5. The three Trp sidechain peaks are assigned. The labeled sequence numbers correspond to those in the final construct, and are consistent with NMRView format. 46 assignment of sidechain proton and carbon spins could further be verified by 1H-13C HSQC spectrum. Sidechain assignment for the loop regions was not as complete as the domain region. Totally, 78.3% aliphatic sidechain resonances were assigned. Aromatic sidechain assignment was also finished. 3.5.3 Secondary structure characterization from backbone assignment α β Since the C and C chemical shifts of an amino acid could be used as an indicator of the local secondary structure information, the secondary structure of α β Mdka was analyzed by comparing the C and C chemical shifts of each amino acid of α β Mdka with those of the random coil. The deviation ΔC /ΔC equals the chemical shift α β of the observed C /C for an Mdka residue minus that for the same residue in a α random coil structure. Positive secondary shift for C indicates that the secondary structure is helix, while negative result means that it is β-strand conformation. This β α β trend is opposite in sign for C secondary shift. Thus ΔC -ΔC could give an enhanced effect of the secondary structure information. Chemical shift index (CSI) of Mdka α β (Fig3.11) was obtained by plotting ΔC -ΔC against the amino acid sequence. From the CSI, it can be inferred that the protein is mainly composed of β-sheets in the two domain regions, and there are three long loops at the two ends of the two domains as well as in-between. 3.5.4 NOE assignment When the backbone and sidechain assignments were finished, most of the NOE peaks in the 4D 13 13 15 C, N-edited NOESY, 4D 15 15 N, N-edited NOESY and 4D 13 C, C-edited NOESY sub-spectra could be unambiguously assigned (Fig3.12). 47 Ambiguous NOE peaks were assigned based on the initial structure calculated with the unambiguously assigned NOE peaks. HN-HN NOE peaks were assigned on the 4D 15 15 N, N-edited NOESY subspectrum where the C chemical shift could be converted into the N chemical shift with the equation: N (ppm) = (C-Cref)Csf/Nsf (ppm) + Nref (ppm) (Fig3.14C). Totally, there are 2085 NOE peaks: 926 intra-residual NOEs, 618 sequencial NOEs, 152 medium-range NOEs, and 389 long-range NOEs. As shown in the analysis of sequential- and medium-range NOEs (Fig3.13), strong α α NOE peaks between H C i-1 and HNi were observed, which are consistent with the β-structure of Mdka. Successive strong long-range NOEs of HN-HC and HC-HC were observed, which are from the opposite strands of the antiparallel β-sheet. 3.5.5 Structure calculation and refinement Initial structures calculated using the unambiguously assigned NOE peaks (distance constraints) and torsion angle constraints initially showed relatively large average target function. The main reason is because of the overlap of NOE peaks that results in larger peak volumes than they should be. After all the NOE peaks were assigned, 100 structures were calculated, and the best 10 structures show an average target function of 1.49. The N domain and C domain RMSD values between the 10 conformers and the mean structures for the backbone heavy atoms are 0.76 and 1.65 respectively. Each domain contains three anti-parallel β-sheets (Fig3.14). The β distances between the C atoms of the two nearby Cys residues of the ten structures are shown in Table 1. As can be seen, except the pair of C75-C107 and C65-C97, all β the Cys pairs show distances of the C atoms between 3 Å to 8 Å. The C65-C97 β β shows very close distances because both of the Cys H C protons have NOEs with 48 Fig3.11 Chemical shift index of Mdka. ΔCα/ΔCβ equals Cα/Cβ of Mdka amino acid minus that of the same amino acid in a random coil structure. 49 A C B 1 15 T87 ( HN:8.27ppm, N: 115.65ppm) Fig3.12 NOE assignment. (A) Representative slices of HN-HC assignment from the 4D 13C,15N-edited NOESY subspectrum. (B) HC-HC NOE assignment from the 4D 13 13 C, C-edited NOESY subspectrum. (C) HN-HN assignment from the 4D 15 15 N, N-edited NOESY subspectrum. The C chemical shifts and N chemical shifts can be transferred using the equation of Ncs = (Ccs-Cref)Csf/Nsf+Nref (Labels are in the NMRView format.) 50 Fig3.13 Sequential- and medium-range NOE pattern of Mdka. The NOE intensities are represented by the thickness of the solid bars. 51 L88 HδCδ proton. C75-C107 pair shows relatively larger distances than the other pairs, ranging from 7.3 Å to 12.3 Å. 3.5.6 Disulfide restraints determination β β Although there is no direct NOE between the H C of any Cys residues observed, the ten Cys residues in Mdka should form five pairs of disulfide bonds, which was determined empirically and experimentally. The sequence alignment of the Midkine family shows that the Cys residues are highly conserved among the family (Fig1.5), and Mdka has sequence identity as high as 58% with human MK. Thus it is expected that the overall structure of the two proteins would be more or less the same, which indicates that the disulfide linkage of the two proteins would be identical. Therefore the five pairs of the disulfide bridge should be: C18-C42, C26-C51, C33-C55, C65-C97, and C75-C107 according to the sequence of Mdka construct in this study. In addition, Q-TOF mass spectrum shows that the molecular weight of the recombinant Mdka is 13510.6 Da. Theoretically, the molecular weight of Mdka is 13511.4 Da if five pairs of disulfide bond are formed, and 13521.4 Da if not (software ProtParam). This suggests that there are five pairs of disulfide bridges in the β recombinant Mdka protein. Besides, NMR analysis shows that the C chemical shifts of all Cys in the protein sequence are larger than 30 ppm, which also indicates disulfide bonds formation (Sharma and Rajarathnam, 2000). Moreover, the initial β structure of Mdka shows that the distances between the C atoms of the Cys pairs are all around 3.5 Å to 8 Å (C75-C107 within 7.3 Å to 12.3 Å ), which is near enough to form disulfide bonds. Therefore, structures with disulfide constraints were calculated. The target function of the structures calculated with these five pairs of disulfide constraints is similar with that calculated without. Therefore, although there is no 52 A B Fig3.14 Comparison of Mdka domain structures calculated with and without disulfide constraints. (A) Initial Mdka domain structures without disulfide constraints. Cys residues are colored in blue. (B) Mdka domain structures calculated with disulfide constraints. 53 Table 1 Distances between the Cβ atoms of nearby Cys residues in the 10 conformers. 54 β β β β direct H C -H C NOE observed among the ten Cys residues, it can be concluded that the disulfide bonds should be formed in this way. 3.6 NMR structure of Mdka The final structure of Mdka was calculated with distance constraints, dihedral angle constraints, hydrogen bond constraints, and disulfide bond constraints. The final structural statistics are listed in Table 2. G1-D17 is the N terminal loop; C18-C55 forms the N domain; N56-D64 is the liker between the two domains; C65-C107 is the C domain; and T108 to the end is the C terminal tail. Since there is no confirmed interaction observed among the three loops and the two domains, these parts are supposed to be able to move freely from each other (Fig3.15). The RMSD between the 10 conformers and the mean structures are 0.56 Å (L21-C55) and 0.54 Å (Y67-C107, excluding the long loop) for backbone heavy atoms, and 1.12 Å (L21-C55) and 0.82 Å (Y67-C107, excluding the long loop) for all non-hydrogen atoms in the well-defined regions, respectively. N domain (C18-C55) consists of three antiparallel β-sheets, βN1 (L22-A28), βN2 (G36-C42), βN3 (T46-V53), which are connected by two loops. C domain (C65-C107) also consists of three antiparallel β sheets: βC1 (W67-D76), βC2 (T81-Q87), βC3 (T100-K105). There are also two loops connecting these three β strands, but the loop between βC1 and βC2 is longer than the others. All the three proline residues are in the trans configuration as evidenced by γ γ α α the observation of NOE connectivity of proline H C and H C of the preceding residues (Fig3.17). Hydrophobic, charged, and other polar residues are colored on the space filling model of Mdka (Fig3.15). Given that Mdka is highly basic charged, and it has strong binding ability to the negative charged heparin molecule, the basic 55 Table 2 Structural statistics of Mdkaa a 10 energy-minimized 10 structures were investigated. bDihedral angle constraints were generated by TALOS. cHydrogen bonds were identified on the basis of NH-D2O exchange experiment. Two constraints per hydrogen bond (dHN-O ≤ 2.2 Å and dN-O ≤ 3.2 Å) were included in the final structure calculations by visual inspection of preliminary structures derived from the NOE data. dPROCHECK-NMR was used to assess the quality of the structures. 56 Mdka N domain Mdka C domain Fig3.15 Solution structure of Mdka. (A/E) Superposition of the backbone heavy atoms of the 10 best calculated Mdka structures. The Mdka conformers are fitted for minimal RMSD of the backbone atoms of all residues in both domains. (B/F) Ribbon drawing of the best conformer of Mdka. (C/G) Space filling model of Mdka domains with annotations of basic (blue), acidic (red), hydrophobic (pink), and polar (purple) residues on the surface. (D/H) Back view of C/G. (I) Ribbon model of the best conformer of the whole protein. 57 Fig3.16 Ramachandran plot of Mdka. 58 Fig3.17 Configuration of proline residues. 4D are listed on the left. 13 C,13C-edited NOESY subspectra 59 residues in the two domains of Mdka colored in blue seem to form clusters and could be the possible binding sites for heparin. 3.7 Structural comparison to other Midkine family members 3.7.1 Structure comparison of Mdka and human MK The three dimensional domain structures of Mdka were compared with those of human MK. The N domain backbone architectures of the two proteins are relatively similar, but C domains have larger differences (Fig3.18). The loop between βC2 and βC3 in human MK is bent towards the three β sheets because NOEs are detected among them. In Mdka no such NOE is observed, and the position of the βC1 seems to prevent the long loop from bending over. The linear sequence alignment of the two proteins shows an overall amino acid identity of 58% (Fig1.5), with 57% and 65% identity for the N domain and C domain respectively. The conserved residues as well as non-conserved ones are colored on the transparent surface diagram of Mdka, where a cluster of non-conserved residues is found on one side in the C domain of Mdka (Fig3.19 C). These residues might be responsible for the different functions between Mdka and human MK. In contrast, a cluster of identical residues is found on the opposite side in the C domain, where basic residues cluster together (Fig3.22 D), which might form the heparin binding site of Mdka. In addition, there are some similarities between the structures of Mdka and human MK. Both the N domain and C domain of MK display a β-bulge-like structure because of Pro22 in the N domain and Asn68-Trp69-Gly70 fragment in the C domain. While the corresponding position of MK Pro22 is changed to Ser25 in Mdka, yet the bulge is still there. The twist in the βN1 (L22-A28) of both proteins is also similar. Resembling of the C domain of human MK, Mdka has a β-bulge-like structure due to 60 Fig3.18 Comparison of Mdka with human MK. (A/D) N/C domain backbone superimposition of Mdka (cyan) and MK (magenta). (B/E) Ribbon diagram of Mdka N domain and C domain. (C/F) Ribbon diagram of human MK N domain and C domain. [Human MK structure is from PDB file 1mkc and 1mkn. Ribbon structures are from the reference (Muramatsu, 2002)]. 61 B C 1800 1800 Fig3.19 Sequence comparison of Mdka and human MK. (A) Sequence alignment of N domain and C domain of Mdka and human MK. {Note: red: small [small + hydrophobic (incl.aromatic -Y)], blue: acidic, magenta: basic, green: hydroxyl + amine + basic – Q; "*" means that the residues in that column are identical in all sequences in the alignment; ":" means that conserved substitutions have been observed, according to the color table available at http://www.ebi.ac.uk/Tools /clustalw2/scorestable_frame.html; "." means that semi-conserved substitutions are observed.} (B) N domain (C) C domain of Mdka represented by the transparent surface diagram with the ribbon digram beneath it. Residues are colored according to the above sequence alignment result. (Note: blue: identical residues; yellow: conserved substitutions have been observed in human MK; orange: semi-conserved substitutions are observed in human MK; red: non-conserved residues; green: glutamine residue that might mediate dimerization in Mdka.) 62 Gly70-Asn71-Trp72 fragment. Sequence homology that is observed in human MK between the βN2 (Arg35-Glu36-Gly37-Thr38) and βC2 (Arg81-Gln82-Gly83-Thr84) as well as βN3 (Ile46-Arg47, Pro51-Cys52) and βC3 (Ile98-Arg99, Pro103-Cys104) is not found in Mdka. Instead, a homology is found between the N terminal loop (Gly15-Ala16-Asp17 -Cys18) and the end of the linker (Gly62-Ala63-Asp64-Cys65). But we have no idea about whether the existence of these repetitive fragments in this growth factor family is a coincidence or results from evolution. The C domain heparin binding site of MK was mapped using NMR perturbation by adding heparin 12mer solution. Iwasaki et al. discovered that K79 (βC2), R81 (βC2), and K102 (βC3) from the C domain β-sheets as well as K86, K87, and R89 from the long loop between βC2 and βC3 are the key residues involving in heparin binding. The corresponding six residues in Mdka are: K82 (βC2), R84 (βC2), K105 (βC3), Q89, K90, and L92. Therefore, the three charged residues on the β-sheets are very likely to be involved in heparin binding, but whether residues from the long loop participate in this activity needs further validation. Moreover, the stoichiometry of MK-heparin oligosaccharide complex was investigated by laser light scattering experiments, and it was reported that MK formed a dimmer upon binding to heparin of >20 monosaccharide units. Kojima et al. also reported the property of MK dimmerization, and three glutamine residues, although not conserved among species through evolution, had been identified to be responsible for the transglutaminase-mediated dimer formation of MK. Among the three residues, two are conserved in Mdka (Q45 and Q98), and one is changed to R47. The two glutamine residues are not far away from the non-conserved residues (Fig3.19 B & C). So far we have no idea whether Mdka would dimerize when it binds to heparin. 63 3.7.2 Structure comparison of Mdka with human PTN The structure of E.coli-expressed human PTN was analyzed by NMR (Kilpeläinen et al., 2000; Fig3.19 & Fig3.20), and its heparin binding activity was also investigated. Although no three dimensional structure of PTN is found in PDB, its secondary structure could be compared with that of Mdka. The main difference lies in the N domain, which has amino acid identity of 47%; the loop connecting βN2 and βN3 is longer in PTN than Mdka. The C domains of the two proteins are highly conserved with sequence identity as high as 60%. Similar features of the two proteins include the antiparallel β-sheet structures of the two domains, a highly conserved linker region, and the Lys-rich N-terminal and C-terminal tails. The heparin binding activity of PTN was analyzed using NMR and heparin affinity chromatography. The results show that both the N- and C-β-sheet domains bind to heparin in solution. NMR results also suggest that the Lys-rich tails, though highly charged, do not contribute to the binding of PTN to heparin and remain unstructured in the PTN-heparin complex. Upon binding to heparin, PTN undergoes structure changes according to the CD spectroscopy, especially prominent at wavelengths of 200 and 230 nm. However, how the structure changed is still unknown. Since human MK could form a dimmer when binding to heparin, the CD spectrum profile changes of PTN might be due to the dimmerization as well. The disulfide bonds in PTN are essential to maintain the native structure since reduction of disulfide bonds dramatically reduced heparin binding. Therefore, it can be inferred that disulfide bonds in Mdka play pivotal role to the formation of the β-sheet structure. Heparin binding activity of Mdka might also be due to the β-sheet domain structure rather than the charged N-terminal and C-terminal tails. 64 Fig3.20 Sequence alignment of Mdka and PTN. β sheets of PTN were shown as white arrows (Kadomatsu and Muramatsu, 2004). {Note: red: small [small + hydrophobic (incl.aromatic -Y)], blue: acidic, magenta: basic, green: hydroxyl + amine + basic – Q; "*" means that the residues in that column are identical in all sequences in the alignment; ":" means that conserved substitutions have been observed, according to the colour table available at http://www.ebi.ac.uk/Tools /clustalw2/scorestable_frame.html; "." means that semi-conserved substitutions are observed.} (Software ClustalW2, EBI, ExPASy). 65 Fig3.21 Secondary structure of PTN. Curved arrows show the location of disulfide bonds. Straight arrows indicate long-range NOE interactions (Kilpeläinen et al., 2000). 66 3.7.3 Comparison of Mdka with Mdkb The linear sequence alignments of the two domains of Mdka and Mdkb suggest that the sequence identity of the two proteins is very high, with 71% and 69% identity for the N domain and C domain respectively. Thus the three dimensional structures of the two proteins might be of high similarity. Residues that are conserved between the two proteins as well as those have substitutions are colored according to the sequence alignment result (Fig3.22). One conserved glutamine residue, which is involved in the dimerization of MK, is in the vicinity of the non-conserved residue in the C domain of Mdka (Fig3.22 D). From the sequence comparison, nine basic residues are identified in the C domain of Mdkb, among which six are conserved in Mdka and the other three are substituted by neutral amino acids in Mdka. These nine basic residues form two basic clusters, and might be the heparin binding site of Mdkb. In contrary to C domain, one basic residue in the N domain of Mdkb has changed to a neural one. Therefore, the functional difference of Mdka and Mdkb might lie in the fact that the charges of the C domains of the two proteins are different. Since Midkine family could bind to the transmembrane heparan sulfate proteoglycans to induce intracellular signaling pathways, the charge difference of the C domains of Mdka and Mdkb might possibly lead them to bind to different kinds of transmembrane receptors, thus inducing different signaling pathways. 67 B C 1800 1800 D 1800 Fig3.22 Sequence comparison of Mdka and Mdkb. (A) Sequence alignment of N domain and C domain of Mdka and Mdkb. (Note: residues are colored in the same way as in Fig3.20) (B) N domain (C) C domain of Mdka represented by the transparent surface diagram with the ribbon diagram beneath it. Residues are colored according to the above sequence alignment result. (Note: residues are colored in the same way as in Fig3.20.) (D) Basic residues in the sequence of Mdkb are colored accordingly in the C domain of Mdka. Hot pink: residues conserved between Mdka and Mdkb. Purple: basic residues in Mdkb that are substituted by neural ones in Mdka. Green: glutamine residue that might mediate dimerization in Mdka and Mdkb. 68 Chapter 4 Conclusion and Future Work Recombinant Mdka is expressed as a well structured soluble protein using E. coli BL21 (DE3) strain, and is suitable for NMR study. It has the same disulfide linkage pattern as human MK and PTN. As a heparin binding growth factor, it shows strong binding to heparin in vitro. Zebrafish embryos after injection of the E. coli-expressed Mdka show the same development phenotypes as those injected with in-vitro-synthesized RNA, which means that the recombinant protein is properly folded and has the desired biological functions. The solubility of Mdka is quite high; when it is concentrated to 1 mM, the protein still exists in monomer form. Besides, it is also a stable protein that can stand 30 °C for one week. All these properties of Mdka make it suitable for structure elucidation by NMR. Moreover, sequence alignment of Mdka with human MK reveals many long loops present in Mdka, which suggests that it would be extremely difficult to crystallize the protein. Thus, NMR becomes the only way to solve the structure of this protein. In this study, the solution structure of recombinant Mdka is determined based on the multidimentional NMR using a novel backbone assignment method developed by our group. The 4D NMR method is originally designed for structure elucidation of larger proteins (~40-50 kDa), this work shows that this method can also be used to solve high resolution structures for small proteins despite the fact that the 4D NMR method may lose some peak intensity. The structure of Mdka shows that there are two long tails and two domains. Connecting the two domains is a linker with amino acids highly conserved among the MK family. We are not sure whether the linker has specific functions, and it seems that the linker is also flexible. Each domain is composed of three antiparallel β sheets, resembling that of human MK. Comparing 69 individual domains of Mdka with human MK indicates that the N domain backbone architectures of the two proteins are more or less the same; while C domains show larger differences between the two proteins, especially for the position of the long loop between βC2 and βC3. In human MK, this long flexible loop is bent towards the three β sheets. In Mdka, the position of the βC1 seems to prevent the long loop from bending over. Therefore, the C domain of human MK has a more globular shape, while the C domain of Mdka seems more flat. By comparing Mdka with other members of the MK family, we can infer that C domain of Mdka might contribute to most of the heparin binding activity. The highly positive charged N terminal and C terminal tails might not be involved in heparin binding. Instead, charged residues on the C domain could be the possible heparin binding sites. Besides, Mdka could possibly form a dimer upon binding to heparin. With the structure of Mdka available, future work can be carried on in the following directions. First of all, to understand why the two Midkine proteins in zebrafish, Mdka and Mdkb, have different expression and functional patterns, the structures of the two proteins can be compared (Lim Jack Wee in our group is currently solving the structure of Mdkb). Since the sequence alignment of the two proteins shows that the domain regions have high sequence identity (70%), it is very likely the two proteins have similar structural architecture. If the structure is similar, then it would be interesting to pursue the reason why similar protein structures can lead to different functions. Of interesting notice is that there is charge difference between the C domains of the two proteins; Mdkb has three more basic residues than Mdka does in the possible heparin binding site. This charge difference might possibly be the reason why Mdka and Mdkb perform differently during zebrafish 70 embryogenesis. Besides, the heparin binding sites of Mdka need to be identified to understand where the key residues are that directly involve in the functions of this protein. Methods such as titration of heparin to Mdka, NMR perturbation, and site-directed mutagenesis of Mdka in binding with heparin can be used to pursue this target. Since Mdka has neural development function as well as heparin binding activity, it is possible that these two functions are performed via two different binding sites. Since the C domain of human MK is well studied and it is believed to be essential for heparin binding activity, the same situation might also be applied to the C domain of zebrafish Mdka. Therefore, charged residues on the Mdka C domain might be the candidates to do mutagenesis analysis to map the binding sites to heparin. Yet the N domain function of this family is still unclear. Whether the N domain is responsible for the neural development activity needs further validation. In addition, if possible, the binding complex of Mdka to its partner can be analyzed to get a view of the complex structure. So far the most commonly used ligand to evaluate the activity of MK family is heparin. Native heparin is a polymer with a molecular weight ranging from 3 kDa to 40 kDa, although the average molecular weight of most commercial heparin preparations is in the range of 12 kDa to 15 kDa. Iwasaki et al. found that heparin of >20 monosaccharide units would induce MK dimerization, and Kaneda (1996b) et al. reported that the minimize size for heparin to inhibit MK from binding to neurons is approximately 22 monosaccharide units (7 kDa). Complex structure would provide answers to questions such as how the dimer is fomed, how the two domains are orientated when binding to the ligand, and what the position of the linker and the terminal tails are. Moreover, to fully understand how the two zebrafish Midkines function, why 71 two proteins are needed in zebrafish while in the higher vertebrates only one homolog is present, and what the factors are to differentiate these two proteins, the signaling pathways of the two growth factors involved should be studied before these questions can be answered. 72 Chapter 5 References Bax, A., Ikura, M. (1991) An efficient 3D NMR technique for correlating the proton and 15 N backbone amide resonances with the alpha-carbon of the preceding 15 13 residue in uniformly N/ C enriched proteins. J. Biomol. NMR 1: 99-104. Bronner-Fraser, M. (1995) Origins and developmental potential of the neural crest. Experimental Cell Research 218: 405-417. Cornilescu, G., Delaglio, F., and Bax, A. (1999) Protein backbone angle constraints from searching a database for chemical shift and sequence homology. J. Biomol. NMR 13: 289-302. Chang, M.H., Huang, C.J., Hwang, S.P., Lu, I.C., Lin, C.M., Kuo, T.F., and Chou, C.M. (2004) Zebrafish heparin-binding neurotrophic factor enhances neurite outgrowth during its development. Biochem. Biophys. Res. Commun. 321: 502-509. Dahm, R. (2006) The zebrafish exposed. American Scientist 94: 446-453. Delaglio, F., Grzesiek, S., Vuister, G. W., Zhu, G., Pfeifer, J., and Bax, A. (1995) NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR 6: 277-293. Herrmann, T., Guntert, P., and Wuthrich, K. (2002) Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA. J. Mol. Biol. 319:209-227. Herz, J. and Beffert, U. (2000) Apolipoprotein E receptors: linking brain development and Alzheimer’s disease. Nat. Rev. Neurosci. 1: 51–58. Iwasaki, W., Nagata, K., Hatanaka, H., Inui, T., Kimura, T., Muramatsu, T., Yoshida, 73 K., Tasumi, M. and Inagaki, F. (1997) Solution structure of midkine, a new heparin-binding growth factor. The EMBO Journal 16: 6936-6946. Jessen, R. K. and Mirsky, R. (2005) The origin and development of glial cells in peripheral nerves. Nature Reviews Neuroscience 6: 671-682 . Johnson, B. A. (2004) Using NMRView to visualize and analyze the NMR spectra of macromolecules. Methods Mol. Biol. 278: 313-352. Kadomatsu, K., Tomomura, M. and Muramatsu, T. (1988) cDNA cloning and sequencing of a new gene intensely expressed in early differentiation stages of embryonal carcinoma cells and in mid-gestation period of mouse embryogenesis. Biochem. Biophys. Res. Commun. 151: 1312-1318. Kadomatsu, K., Muramatsu, T. (2004) Midkine and pleiotrophin in neural development and cancer. Cancer Letters 204: 127-143. Kaneda, N., Talukder, A.H., Nishiyama, H., Koizumi, S. and Muramatsu, T. (1996a) Midkine, a heparin-binding growth/differentiation factor, exhibits nerve cell adhesion and guidance activity for neurite outgrowth in vitro. J. Biochem. 119: 1150-1156. Kaneda, N., Talukder, A. H., Ishihara, M., Hara, S., Yoshida, K. and Muramatsu, T. (1996b) Structural characteristics of heparin-like domain required for interaction of midkine with embryonic neurons. Biochem. Biophys. Res. Commnun. 220: 108–112. Kato, M., Shinozawa, T., Kato, S., and Terada, T. (2000) Divergent expression of midkine in the human fetal liver and kidney: Immunohistochemical analysis of developmental changes in hilar primitive bile ducts and hepatocytes. Liver 20: 475-481. Kilpeläinen, I., Kaksonen, M., Kinnuneni, T., Avikainen, H., Fath, M., Linhardt, R. J., 74 Rauloi, E. and Rauvalai, H. (2000) Heparin-binding growth-associated molecule contains two heparin-binding β-sheet domains that are homologous to the thrombospondin type I repeat. J. Biol. Chem. 275: 13564-13570. Kojima, T., Katsumi, A., Yamazaki, T., Muramatsu, T., Nagasaka, T., Ohsumi, K., and Saito, H. (1996) Human ryudocan from endothelium-like cells binds basic fibroblast growth factor, midkine, and tissue factor pathway inhibitor. J. Biol. Chem. 271: 5914–5920. Kojima, S., Inui, T., Muramatsu, H., Suzuki, Y., Kadomatsu, K., Yoshizawa, M., Hirose, S., Kimura, T., Sakakibara, S. and Muramatsu, T. (1997) Dimerization of midkine by tissue transglutaminase and its functional implication. J. Biol. Chem. 272: 9410-9416. Koradi, R., Billeter, M., and Wuthrich, K. (1996) MOLMOL: a program for display and analysis of macromolecular structures. J. Mol. Graph. 14:51-32. Li, Y. S., Milner, P. G., Chauhan, A. K., Watson, M. A., Hoffman, R. M., Kodner, C. M., Milbrandt, J. and Deuel, T. F. (1990) Cloning and expression of a developmentally regulated protein that induces mitogenic and neurite outgrowth activity. Science 250: 1690-1694. Liedtke, D. and Winkler, C. (2008) Midkine-b regulates cell specification at the neural plate border in zebrafish. Developmental Dynamics 237: 62-74. Maeda, N., Nishiwaki, T., Shintani, T., Hamanaka, H., and Noda, M. (1996) 6B4 Proteoglycan/phosphacan, an extracellular variant of receptor-like protein-tyrosine phosphatase ζ/RPTPβ, binds pleiotrophin/heparin-binding growth-associated molecule (HB-GAM). J. Biol. Chem. 271: 21446–21452. Maeda, N. and Noda, M. (1998) Involvement of receptor-like protein tyrosine phosphatase ζ/RPTPβ and its ligand pleiotrophin/heparin-binding 75 growth-associated molecule (HB-GAM) in neuronal migration. J. Cell Biol. 142: 203-216. Maeda, N., Ichihara-Tanaka, K., Kimura, T., Kadomatsu, K., Muramatsu, T. and Noda, M. (1999) A receptor-like protein-tyrosine phosphatase PTPζ/RPTPβ binds a heparin-binding growth factor midkine. Involvement of arginine 78 of midkine in the high affinity binding to PTPζ. J. Biol. Chem. 274: 12474-12479. Mayden, R. L., Tang, K. L., Conway, K. W., Freyhof, J., Chamberlain, S., Haskins, M., Schneider, L., Sudkamp, M., Wood R. M., Agnew, M., Bufalino, A., Sulaiman, Z., Miya, M., Saitoh, K., He, S. (2007) Phylogenetic relationships of Danio within the order Cypriniformes: a framework for comparative and evolutionary studies of a model species. J. Exp. Zool. (Mol. Dev. Evol.) 6: 17554749. Mishima, K., Asai, A., Kadomatsu, K., Ino, Y., Nomura, K., Narita, Y., Muramatsu, T., and Kirino, T. (1997) Increased expression of midkine during the progression of human astrocytomas. Neurosci. Lett. 233: 29-32. Mitsiadis, T., Salmivirta, M., Muramatsu, Muramatsu, T. H., Rauvala, H., Lehtonen, E., Jalkanen, H., and Thesleff, I. (1995) Expression of the heparin-binding cytokines, midkine (MK) and HBGAM (pleiotrophin) is associated with epithelial–mesenchymal interactions during fetal development and organogenesis. Development 121: 37–51. Moradian-Oldak, J., Leung, W., and Fincham, A. G. (1998) Temperature and pH-dependent supramolecular self-assembly of amelogenin molecules: a dynamic light-scattering analysis. J. Struct. Biol. 122: 320-327. Muramatsu, H., Inui, T, Kimura, T, Sakakibara, S., Song, X., Murata, H. and Muramatsu, T. (1994) Localization of heparinbinding, neurite outgrowth and antigenic regions in midkine molecule. Biochem, Biophys. Res. Commun. 203: 76 1131. Muramatsu, T. (2002) Midkine and pleiotrophin: two related proteins involved in development, survival, inflammation and tumorigenesis. J. Biochem. 132: 359-371. Nakanishi, T., Kadomatsu, K., Okamoto, T., Ichihara-Tanaka, K., Kojima, T., Saito, H., Tomoda, Y., and Muramatsu, T. (1997) Expression of syndecan-1 and -3 during embryogenesis of the central nervous system in relation to binding with midkine. J. Biochem. 121: 197–205. Rabi, I. I., Zacharias, J. R., Millman, S. and Kusch, P. (1938) A new method of measuring nuclear magnetic moment. Physical Review 53: 318-318. Raulo, E., Chernousov, M.A., Carey, D.J., Nolo, R., and Rauvala, H. (1994) Isolation of a neuronal cell surface receptor of heparin binding growth-associated molecule (HB-GAM). Identification as N-syndecan (syndecan-3). J. Biol. Chem. 269: 12999–13004. Rauvala, H. (1989) An 18-kd heparin-binding protein of developing brain that is distinct from fibroblast growth factors. EMBO J. 8: 2933-2941. Rauvala, H., Vanhala, A., Castren, E., Nolo, R., Raulo, E., Merenmies, J. and Panula, P. (1994) Expression of HB-GAM (heparin-binding growth-associated molecules) in the pathways of developing axonal processes in vivo and neurite outgrowth in vitro induced by HB-GAM. Brain Res. Dev. Brain Res. 79: 157-176. Roberts, A. (2000) Early functional organization of spinal neurons in developing lower vertebrates. Brain Res. Bull. 53: 585-593. Sattler, M. (2004) Introduction to biomolecular NMR spectroscopy. (http://www. embl-heidelberg.de/nmr/sattler/teaching/teaching_pdf/predoc_intro.pdf) Schäfer, M., Rembold, M., Wittbrodt, J., Schartl, M. and Winkler C. (2005) Medial 77 floor plate formation in zebrafish consists of two phases and requires trunk-derived Midkine-a. Genes & Dev. 19: 897-902. Schäfer, M., Kinzel, D., and Winkler, C. (2007) Discontinuous organization and specification of the lateral floor plate in zebrafish. Dev Biol. 301: 117-29 Sharma, D. and Rajarathnam, K. (2000) 13 C NMR chemical shifts can predict disulfide bond formation. Journal of Biomolecular NMR 18: 165-171. Squire R. L., McConnell, K. S., Zigmond, J. M. Fundamental neuroscience. Edition: 2, Published by Academic Press, 2003 Stoica, G. E., Kuo, A., Aigner, A., Sunitha, I., Souttou, B., Malerczyk, C., Caughey, D. J., Wen, D., Karavanov, A., Riegel, A. T., and Wellstein, A. (2001) Identification of anaplastic lymphoma kinase as a receptor for the growth factor pleiotrophin. J. Biol. Chem. 276: 16772–16779. Stoica, G. E., Kuo, A., Powers, C., Bowden, E. T., Sale, E. B., Riegel, A. T., and Wellstein, A. (2002) Midkine binds to anaplastic lymphoma kinase (ALK) and acts as a growth factor for different cell types. J. Biol. Chem. 277: 35990–35998. Unoki, K., Ohba, N., Arimura, H., Muramatsu, H. and Muramatsu, T. (1994) Rescue of photoreceptors from the damaging effects of constant light by midkine, a retinoic acid responsive gene product. Invest. Opthalmol. Vis. Sci. 35: 4063-4068. Wikipedia: http://en.wikipedia.org/wiki/Main_Page Winkler, C. and Moon, R.T. (2001) Zebrafish mdk2, a novel secreted midkine, participates in posterior neurogenesis. Dev. Biol. 229: 102-118. Winkler, C., Schäfer, M., Duschl, J., Schartl, M. and Volff, J. (2003) Functional divergence of two zebrafish midkine growth factors following fish-specific gene duplication. Genome Res. 13: 1067-1081. Wishart, D. S., Sykes, B. D., and Richards, F. M. (1991) Relationship between nuclear 78 magnetic resonance chemical shift and protein secondary structure. J. Mol. Biol. 222: 1-33. Wisniewski, T., Lalowski, M., Baumann, M., Rauvala, H., Raulo, E., Nolo, R., and Frangione, B. (1996) HB-GAM is a cytokine present in Alzheimer’s and Down’s syndrome lesions. Neuroreport 7: 667–671. Xu, Y., Lin, Z., Chien, H., Yang, D. (2005) A general strategy for the assignment of 13 15 aliphatic side-chain resonances of uniformly C, N-labeled large proteins. J. Am. Chem. Soc. 127: 11920-11921. Xu, Y., Zheng, Y., Fan, J., Yang, D. (2006) A novel strategy for structure determination of large proteins without deuteration. Nature Methods 3: 931-937. Xu, Y., Long, D., and Yang D. (2007) Rapid data collection for protein structure determination by NMR spectroscopy. J. Am. Chem. Soc. 129: 7722-7723. Yang, D., Zheng, Y., Liu, D., Wyss, D. F. (2004) Sequence-specific assignments of methyl groups in high-molecular weight proteins. Journal of the American Chemical Society 126: 3710-3711. Yasuhara, O., Muramatsu, H., Kim, S. U., Muramatsu, T., Maruta, H. and McGeer, P. L. (1993) Midkine, a novel neurotrophic factor, is present in senile plaques of Alzheimer disease. Biochem. Biophys. Res. Commun. 192: 246-251. 79 [...]... Na2HPO4, 3 g KH2PO4, 0.5 g NaCl and 100 μg/ml ampicilin 2.1.2 Preparation of DNA plasmid The gene encoding Mdka protein was obtained from our collaborator, Prof Winkler (Dept of Biological Sciences, NUS), in the template plasmid pCS2 Zmk-2 Oligonucleotide primers were used to amplify the mdka gene The sense primer is 5’-CGCGGATCCAAAAACAAGAAAGAGAAGAA-3’ and antisense primer is 3’-GGGGATATCACTCAGCATAAT-5’... (Li et al., 1990; Rauvala et al., 1994; Kaneda et al., 199 6a) , nerve cell migration (Maeda et al., 1998; Maeda et al., 1999), neurodegenerative diseases (Wisniewski et al., 1996; Yasuhara et al., 1993), and cancer (Kadomatsu et al., 2004) Many of the down-regulation pathways are still unclear, but it is pointed 13 out that the intracellular signaling of N-syndecan family receptors involves cortactin-src... al., 1999) 1.3.2.4 Structure Calculations and Refinement A restrained molecular dynamics/simulated annealing (MD/SA) or torsion angel dynamics (TAD) is chosen for structure calculation, which would use all the 20 available structural parameters mentioned above Structure calculation with NOE assignment is an iterative process The result of an NMR structure calculation is an ensemble of structures, all... development (Kadomatsu et al., 2004) 1.2.2 Protein structures MK precursor protein with an intact signal peptide (22 amino acids) has a molecular mass of 15.6 kDa whereas the matured form of MK is calculated to be 13.2 kDa Mature PTN protein, which shares 45% amino acid sequence identity with MK (Fig1. 6A) , has a molecular weight of 19 kDa The individual domain structures of MK have been solved by NMR (Iwasaki... specifies the fraction of backbone φ,ψ angles which correspond to favored, additionally allowed, generally allowed and disallowed conformations based on statistical analysis of high-resolution crystal structures, is a good quality measure Most φ,ψ angles should be within the allowed regions, while poor quality structures usually have a large number of φ,ψ angles in forbidden regions in the Ramachandran plot... cDNA and its deduced amino acid sequences (B) Hydropathy profile of Mdka (Software from ExPSAy, ProtScale) 6 A B Fig1.4 Sequence of Mdkb (A) The nucleotide sequence of the Mdkb cDNA and its deduced amino acid sequences (B) Hydropathy profile of Mdkb (Software from ExPSAy, ProtScale) 7 Fig1.5 Sequence alignment of MK family (Chang et al., 2004) heparin binding; cluster I a. a for cluster II a. a for heparin... violation, dihedral angle violation, electrostatic potential term, van der Waals force, and the Ramachandran plot The RMSD indicates the coordinate precision of the structure ensembles calculated The smaller the value of RMSD between the calculated conformers and the mean structures (usually less than 1 Å), the better the quality of the structures calculated Distance constrains violation should be within... signaling pathway, Wnt signaling pathway, and BMP signaling pathway In addition, Mdkb controls the cell determination at the neural plate border and affects the formation of neural crest cells and Rohon-Beard sensory neurons (Liedtke et al., 2008) As secreted growth factors, both proteins might bind to the transmembrane heparan sulfate proteoglycans These receptors have the heparin-like carbohydrates;... dorsal mesoderm, cells that adjacent to the notochord during vertebrate organogenesis form transient structures called somite, which define the segmental pattern of the embryo, and subsequently give rise to vertebrae and ribs, dermis of the back, and skeletal muscles of the back, body wall and limbs (Wikipedia) 1.1.2 Characterization of Mdka and Mdkb of zebrafish 3 Fig1.2 Organization of the neural... 1999), anaplastic leukemia kinase (ALK) (Stoica et al., 2001; Stoica et al., 2002), and low density lipoprotein receptor-related protein (LRP) (Herz et al., 2000) N-syndecan belongs to the syndecan family that comprises four transmembrane heparan sulfate proteoglycans Binding of MK and PTN to syndecans is mediated by the heparin sulfate chain, which is the heparin-like domain This domain consists of a variably ... used to amplify the mdka gene The sense primer is 5’-CGCGGATCCAAAAACAAGAAAGAGAAGAA-3’ and antisense primer is 3’-GGGGATATCACTCAGCATAAT-5’ DNA gel electrophoresis (Agarose from Bio-Rad) was subsequently... vertebrae and ribs, dermis of the back, and skeletal muscles of the back, body wall and limbs (Wikipedia) 1.1.2 Characterization of Mdka and Mdkb of zebrafish Fig1.2 Organization of the neural tube... quality structures usually have a large number of φ,ψ angles in forbidden regions in the Ramachandran plot 21 1.3.3 Advantages of structure study by NMR Nuclear Magnetic Resonance, X-ray Crystallography,

Ngày đăng: 16/10/2015, 15:35

TÀI LIỆU CÙNG NGƯỜI DÙNG

TÀI LIỆU LIÊN QUAN