1. Trang chủ
  2. » Thể loại khác

DSpace at VNU: Loss of matK RNA editing in seed plant chloroplasts

10 114 0

Đang tải... (xem toàn văn)

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 10
Dung lượng 586 KB

Nội dung

DSpace at VNU: Loss of matK RNA editing in seed plant chloroplasts tài liệu, giáo án, bài giảng , luận văn, luận án, đồ...

BMC Evolutionary Biology BioMed Central Open Access Research article Loss of matK RNA editing in seed plant chloroplasts Michael Tillich1, Vinh Le Sy4, Katrin Schulerowitz3, Arndt von Haeseler2, Uwe G Maier3 and Christian Schmitz-Linneweber*1 Address: 1Institut für Biologie, Humboldt Universität zu Berlin, Molekulare Genetik, D-10115 Berlin, Germany, 2Center for Integrative Bioinformatics Vienna, Max F Perutz Laboratories, University of Vienna, Medical University Vienna, University of Veterinary Medicine Vienna, A1030 Vienna, Austria, 3Fachbereich Biologie – Zellbiologie, Philipps-Universität Marburg, Karl-von-Frisch-Str, D-35032 Marburg, Germany and 4Department of Computer Sciences, College of Technology, Vietnam National University, Hanoi, Vietnam Email: Michael Tillich - tillichm@staff.hu-berlin.de; Vinh Le Sy - vinhbio@gmail.com; Katrin Schulerowitz - 33Katrin@gmx.de; Arndt von Haeseler - arndt.von.haeseler@univie.ac.at; Uwe G Maier - maier@staff.uni-marburg.de; Christian Schmitz-Linneweber* - christian.schmitzlinneweber@rz.hu-berlin.de * Corresponding author Published: 13 August 2009 BMC Evolutionary Biology 2009, 9:201 doi:10.1186/1471-2148-9-201 Received: January 2009 Accepted: 13 August 2009 This article is available from: http://www.biomedcentral.com/1471-2148/9/201 © 2009 Tillich et al; licensee BioMed Central Ltd This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited Abstract Background: RNA editing in chloroplasts of angiosperms proceeds by C-to-U conversions at specific sites Nuclear-encoded factors are required for the recognition of cis-elements located immediately upstream of editing sites The ensemble of editing sites in a chloroplast genome differs widely between species, and editing sites are thought to evolve rapidly However, large-scale analyses of the evolution of individual editing sites have not yet been undertaken Results: Here, we analyzed the evolution of two chloroplast editing sites, matK-2 and matK-3, for which DNA sequences from thousands of angiosperm species are available Both sites are found in most major taxa, including deep-branching families such as the nymphaeaceae However, 36 isolated taxa scattered across the entire tree lack a C at one of the two matK editing sites Tests of several exemplary species from this in silico analysis of matK processing unexpectedly revealed that one of the two sites remain unedited in almost half of all species examined A comparison of sequences between editors and non-editors showed that specific nucleotides co-evolve with the C at the matK editing sites, suggesting that these nucleotides are critical for editing-site recognition Conclusion: (i) Both matK editing sites were present in the common ancestor of all angiosperms and have been independently lost multiple times during angiosperm evolution (ii) The editing activities corresponding to matK-2 and matK-3 are unstable (iii) A small number of third-codon positions in the vicinity of editing sites are selectively constrained independent of the presence of the editing site, most likely because of interacting RNA-binding proteins Background Chloroplast RNA metabolism is characterized by extensive RNA processing, including RNA editing In chloroplasts of angiosperms, RNA editing proceeds by C-to-U base conversions at specific sites, while in chloroplasts of hornworts, many bryophytes and ferns, U-to-C conversions take place as well [1-3] RNA editing events almost exclusively change codon identities, and usually restore Page of 10 (page number not for citation purposes) BMC Evolutionary Biology 2009, 9:201 codons conserved during land plant evolution Mutational analyses of edited codons have demonstrated that editing is essential for protein function in vivo [4,5] The corresponding machinery is nuclear encoded, and recognizes short stretches of sequence immediately upstream of the C to be converted [6] RNA editing has been found in chloroplasts of all major land plants To date, there is no evidence for RNA editing in cyanobacteria, the closest prokaryotic relatives of chloroplasts, or in chlorophyte algae, the closest aquatic relatives of land plants This phylogenetic distribution suggests that chloroplast RNA editing was "invented" close to the root of land plant radiation [3] Within land plants, the number of chloroplast RNA editing sites per genome differs among species Bryophytes and ferns may possess several hundred C-to-U as well as U-to-C RNA editing sites [1-3] The chloroplast genomes of seed plants harbor far fewer (~30) editing sites, and their location varies even between closely related taxa [6] At least one land plant, the liverwort Marchantia polymorpha, apparently contains no RNA editing sites [7], suggesting that, in principle, RNA editing can become lost from a chloroplast genome An important question is how the species-specific patterns of editing sites – the editotypes – of seed plant chloroplasts evolved Differences in editotypes between even closely related species, such as Nicotiana sylvestris, Nicotiana tomentosiformis and other Solanacean relatives, point to a rapid evolution of editing sites [8,9] A comparison of editing sites between dicot and monocot organelles supports this notion, demonstrating that the speed of editing site evolution equals or exceeds that of third-codon positions [10] Analyses of selected transcripts from exemplary species over a wide range of land plants have led to similar conclusions [3,11,12] While these analyses were meant to illuminate the evolution of editing sites, they not necessarily shed any light on the evolution of the corresponding editing machinery To date, the only genetically identified essential editing factors are required for editing specific sites and belong to a family of nuclear-encoded RNA binding proteins, the pentatricopeptide repeat proteins (PPR) [13-19] Most PPR genes are conserved throughout angiosperm evolution [20] and, unlike editing sites, not rapidly evolve In fact, in at least five specific cases, specific nuclear activity is retained in a species despite the loss of the corresponding editing site [5,21,22] If a site-recognition factor is conserved throughout evolution, this should be reflected in the conservation of the corresponding editingsite cis-element, an assumption that was supported by a recent analysis of the psbL start codon editing site in 28 species, and the ndhD start codon editing site in 21 species [12] In an attempt to understand editing-site evolution at a higher resolution, we took advantage of the thousands http://www.biomedcentral.com/1471-2148/9/201 of sequences from previous phylogenetic studies that are available for the chloroplast reading frame of the matK protein We analyzed (i) the evolutionary pattern of matK editing sites in angiosperm evolution; (ii) the conservation of editing activity in angiosperms; and (iii) the conservation of editing cis-elements throughout angiosperm phylogeny Results Intrageneric loss of matK editing sites in angiosperms matK is a chloroplast gene located within the trnK intron that is believed to play a role in RNA splicing of tRNA-K [UUU, [23,24]] matK is an expressed gene [25], and in many monocots, matK transcripts are edited at a single site, termed matK-1 [26] We recently identified an additional editing site in Arabidopsis, referred to as matK-2, at nucleotide position 706 (codon 236) relative to the start codon [27] The corresponding editing event leads to a codon change from histidine (CAU) to tyrosine (UAU) Here, we found a third site, matK-3, located 70 nucleotides downstream of site that leads to a serine (UCU) to phenylalanine (UUU) codon transition (codon 259, see below) The rapidly evolving matK gene has been a favorite for determining phylogenetic relationships in angiosperms As a consequence, several thousand matK entries covering the entire angiosperm phylogenetic tree have accumulated in Genbank We obtained and aligned 1255 matK sequences from all major angiosperm groups as well as several gymnosperm species, focusing our analysis on determining whether a C or a T was present at these two newly identified editing sites For phylogenetic analysis, we mapped our findings onto two phylogenetic trees, one for each editing site [[28], see Additional files and 2] The leaves of the tree represent genera, which can include several species Because both trees consist predominately of C-containing genera, the most parsimonious assumption is that the common ancestors of all angiosperms had a C at the editing site In contrast, the gymnosperm taxa analyzed have a T at matK-2 and an A at matK-3 Whether the site was lost in gymosperms or gained in angiosperms cannot be determined based on our data We were unable to extend our alignment to more basal embryophyte groups, such as mosses and ferns, due to extreme sequence divergence Taken together, these data suggest that the matK-2 and matK-3 editing sites were already present in the ancestor of all angiosperms Given that the editing sites are ancestral, we next asked how many times the sites have been lost during angiosperm evolution We first sought situations in the tree that are indicative of C-to-T transitions within genera In most cases, all species within a genus share the same editing site For example, 24 species in the genus Cean- Page of 10 (page number not for citation purposes) BMC Evolutionary Biology 2009, 9:201 othus carry a C at matK-2 (see Additional file 3) However, in six of the 298 genera analyzed, there are species that possessed either a C or a T at matK-2, suggestive of a recent base transition Similarly, seven of the genera analyzed include species with either a C or a T at matK-3 We call such taxa "mixed genera" (see Additional File – Table S1) Rarely, we also found mixed genera with A- or G-containing species in addition to T- or C-containing species (see Additional File – Table S2) All mixed genera are nested in branches heavily dominated by pure C-containing genera (e.g., see Additional file 3), suggesting that Closses occurred independently within these genera Frequent and widespread loss of editing sites within larger angiosperm taxa If intrageneric loss of editing does occur, it should be also evident on a larger scale We therefore assessed the distribution of pre-edited (T at the DNA-level) branches of the angiosperm phylogenetic tree that are particularly rich in available matK sequences (i.e., Rosids, Saxifragales, Asterids, Caryophyllids, Magnoliids and basal eudicots) Coherent sections of genera without an editing site, for example the Solanaceae/Convolvulaceae, were treated as a unit We asked whether such pre-edited units are separated from other such units, which would suggest that they had lost editing independently Only pre-edited units for which sister groups at the next three nodes in the tree contained equal or more than 80% of genera with a C at the editing site were regarded as having independently lost the editing site (see Additional file 3) A- and G-containing genera were not considered By these criteria, we found evidence for 12 independent losses of edited Cs for matK-2 and another 12 for matK-3; these were widely distributed throughout the angiosperm tree (see Figure and see Additional file – Table S1) If the intrageneric losses noted above are included here, the number of independent losses for matK-2 and matK-3 rise to 17 and 19, respectively (Figure 1) Only the asterid genera Gilia and Plantago have lost both matK sites, underscoring that editing-site loss – even that of physically linked sites – is totally independent (Figure 1) http://www.biomedcentral.com/1471-2148/9/201 fully predicted by extrapolation from known sites for Atropa belladonna and Pisum sativum [29,30] Here, we sequenced amplified cDNA from leaf tissue to investigate RNA editing of matK-2 and matK-3 in 17 and 14 different angiosperm species, respectively, from disparate sections of the angiosperm phylogenetic tree (see Additional file 6) All species chosen had a C at the matK-2 editing sites in the plastid genome Unexpectedly, we found that matK2 was processed in only seven species (41.2%) In six of these, a C-peak was evident side-by-side with the T-peak in electropherograms Thus, only a fraction of all transcripts is processed No editing was detected in RNA samples from the remaining ten species The loss of editing activity for matK-3 was not quite as dramatic; but again, no evidence for editing could be found for two species, and most of the remaining species exhibited only partial editing (see Additional file 6) We call species with a C at the editing site but no detectable editing activity "non-editors", while species that process the C to a U are called "editors" We conclude that editing activities for the matK sites have most likely been lost in these species, although the possibility that editing does occur in different tissues under different conditions cannot be ruled out at the moment To understand the phylogenetic distribution of the underlying RNA editing activities, we mapped our results on a phylogenetic tree (Figure 2B) Editing activities are found at widely separated positions of this tree For example, editors and non-editors for matK-2 are found both in the eurosids I and the eurosids II Similarly, matK-2 editors and non-editors are also present side-by-side in lamiids and campanulids within the asterid clade This situation is repeated for matK-3, where the two species that have lost editing activity are from separate larger taxa: Reseda from the rosids and Buddleja from the asterids Taken together with the ancestral nature of the matK editing sites, noted above, these findings argue for multiple independent losses of the editing activities We found no evidence for reversion (i.e T-to-C backmutations) for matK-2, even within the purely T-containing, large monocot branch This might indicate the existence of a selective bias towards losing the editing site It is clear, however, that there are multiple independent losses of the matK editing sites throughout angiosperm phylogeny To investigate whether these losses are reflected in the corresponding cis-elements, we generated a consensus sequence for all plants capable of editing and compared it with sequences from the non-editing plants (Figure 2A) We found that almost all non-editors contain one or multiple deviations from the consensus sequence deduced from the set of editors, suggesting a correlation between the loss of the editing activity and the evolutionary degeneration of the cis-element Loss of C-to-U processing in independent branches of the angiosperm tree at matK-2 and matK-3 The presence of a C at a known editing site is considered good evidence for the presence of a corresponding editing activity For example, editing events have been success- Conservation of putative recognition elements for a matK-2 trans-acting factor Editing sites are recognized by RNA binding proteins that bind sequence elements immediately upstream of the Cresidue to be edited As long as binding and editing proc- Page of 10 (page number not for citation purposes) BMC Evolutionary Biology 2009, 9:201 http://www.biomedcentral.com/1471-2148/9/201 A B juglans rhamnus rosids coriaria celtis boehmeria tropaeolum batis reseda sterculia saxifragales caryophyllids plantago streptocarpus utricularia asterids helianthus tagetes apium eremosyne plantago gilia eurya ternstroemia cornus buxus nandina gyrocarpus basal eudicots gilia phlox magnoliids monocots saururus ailanthus crassula dudleya sedum penthorum haloragis myriophyllum celosia phytolacca delosperma ercilla pereskia stylidium aristolochia juncus musa musella calathea globba allium zostera Figure 1losses of matK editing sites in angiosperms Multiple Multiple losses of matK editing sites in angiosperms A) Nucleotides found at the matK-2 editing site were mapped on a phylogenetic tree encompassing all major angiosperm groups (Soltis et al 2000) Of the 298 genera investigated, only those that represent independent C-to-T mutations at the editing site are shown (criteria for an independent C-to-T loss are presented in Additional file 3) Additional C-to-T mutations for which independence could not be ascertained are not shown Branches of the tree without independent C-to-T losses are reduced The full tree is shown in Additional file Light gray = genera in which all species have a T at the editing site; dark gray = genera containing T-species and C-species B) Same analysis for matK-3; full data is shown in Additional file esses continue to occur, selection is expected to act to preserve these cis-elements By contrast, it is expected that the loss of editing would be accompanied by the loss of conservation of trans-acting factor binding-site sequences To identify such sequence elements, we prepared separate alignments of sequences containing a C and those containing a T at the matK editing sites (henceforth called Celements and T-elements, respectively) To avoid a bias toward species-rich genera, we randomly selected one sequence from each genus The sequences were aligned and analysed using the WebLogo software [31] in order to visualize sequence conservation, and alignments were scored from position -30 to +10, where the editing site is +1 Figure 2C shows a comparison of the conservation of this sequence window between C- and T-containing matK2 and matK-3 sites The following three conservational classes for individual nucleotides can be distinguished: (i) Nucleotide positions that are conserved in both C- and T-elements; for example, at positions -27 to -25, -6 to -4 and +8 to +10 relative to matK-2, and -17 to -15 upstream of matK-3 These include third-codon positions (e.g., Page of 10 (page number not for citation purposes) BMC Evolutionary Biology 2009, 9:201 A http://www.biomedcentral.com/1471-2148/9/201 628 706 776 79 49 P L S F/L H/Y F/L y Cn yTn y An C at H Y T at Y 827 90 149 160 120 201 -30 -20 -10 +10 -30 -20 -10 +10 | | | | | | | | |] -[| | | | | | | | | Editing Cons CMAMGRTTHTTYTTRTTCYTATATAATTYT | ATGKWTRYGA] -[ANMARWCYTYTYATYTACRMTYAAYVTYYT | TSGRVBYYTT Eur-1 B M Ast Car Sax V Eur-2 Euphorbia Morus Prunus Carica Reseda Arabidopsis Sinapis Theobroma Aesculus Vitis Hamamelis Paeonia Limonium Spinacia Hedera Scabiosa Buddleja Paulownia Camellia Magnolia C.A.G T C A C .C .C.A.A A C G C .C .C.C.A A C AC C .C .C.A.A G C G T .C .C.A.A T C A T .C .C.A.A T C G T .C .C.A.A T T G T .C .C.A.A A C G C .C .C.A.A T C G C .C .A.A.A T C A C .T .A.A.A A C G C .C .A.C.G C C G C .C AA.A.A C C G AC .GC A.A.A A C A C .TG A.A.AC.A C C C .C .A.A.A A C C C .C .A.A.A A C A T .C .A.A.A A T A T .C .A.A.A A C G TC .T .A.A.A C C G AC .C 3rd Cod Pos * * * * * * * * * * C C>U C C C C>U C>U C>U C>U C>U C C>U C C C C C>U C>U C C ATAACT ] -[.TC.GT.C.T.C T GA.C CA.TT .TT.GC ] -[.CC.AT.T.C.C T GA.T CA.CT .TA.GT ] -[.TC.AT.T.C.C T AA.T CC.CT .TA.GT ] -[.CC.AT.T.C.C T.G.GA.C CA.CT .TA.GT.G] -[.AC.GT.T.C.C T AA.CG.CA.CT .TA.GT ] -[.GCGGT.T.CGC T GA.CG.CA.CT .TA.GT ] -[.GC.GT.T.C.C T GA.CG.CA.CT .TA.GT ] -[.CC.AT.T.C.C T GA.C CA.CT .GA.AT ] -[.CC.AT.T.C.C T GA.C CA.CT .TA.GT ] -[.TC.AT.T.A.C T GA.C CA.CT .TA.GT ] -[.CC.AT.T.C.T T GA.C CA.CT .TA.AT ] -[.CA.AA.C.C.C T GA.C CG.CT .TG.GT ] -[.CC.AT.C.C.C T GA.C CA.CT .TA.GT ] -[.CC.AT.C.C.T T GACC CA.CT CTA.GT ] -[.CC.AT.T.C.C T GC.C CA.CT CTA.GT ] -[.CC.AT.T.C.C C AA.C CA.CT .TA.GT ] -[.TC.AT.T.C.C T AGA.C CA.CT .TA.GT ] -[.CC.AT.T.C.C T GA.C CC.TT ATA.GT ] -[.GC.AT.T.C.C T GA.C TA.CT .TA.AT ] -[.AC.AT.T.C.C T GA.C CA.CC * * * ] -[ * * * * * * * * C>U C>U C>U T C A A C>U C>U A C>U C>U A T C>U C>U C C>U C>U C>U C.AGTCT G.GGGTT G.GATCT G.AGTCC G.AGTCC GAAGTCC GAAGTAT G.AGTCT G.ACTCC G.AGCCC G.AGCTC G.GGCCT CG.AGCCT G.AGCCC G.AACCC G.AGCTT G.AGTTT G.AATTC G.AACCT G.AACTT * * * * * * C  *         * *                            * * * * * * * * *            *                                             *      * * * * * * * * * *      * *  *                                               T A * * * * * * * * * * n=98                                                  n=108    n=209                                        T                                     C  * * *                                                                                                         n=193                                      C   * * * * * * * * * * * * * *       Analysis Figure 2of the evolution of cis-elements upstream of matK-2 and matK-3 Analysis of the evolution of cis-elements upstream of matK-2 and matK-3 A) Schematic representation of the genomic region encompassing the matK-2 and matK-3 editing sites Edited Cs and corresponding codon transitions are shown in blue; other bases and corresponding codons at the editing site are shown in red Numbers above refer to the nucleotide position relative to the first base of the matK reading frame in Arabidopsis This sequence interval was used to generate matK alignments B) Alignment of the sequence interval from -30 to +10 around both matK editing sites Green = species that shows editing at respective matK site = "editors" (see Additional file 6); red = species with no detectable editing = "non editors" or with no C at editing site A consensus sequence was generated based on all edited sequences for each site Deviations from this consensus are marked in white Sequences are ordered according to phylogenetic position (Soltis et al, 2000) (Eur = eurosids; V = vitaceae; Sax = saxifragales; Car = cayophyllids; Ast = asterids; M = magnolids.) Third-codon positions are marked with asterisks C) Analysis of sequence conservation in sequences containing a C at the editing site (C-element; blue border) and in sequences without a C (T-element; red border) Sequences from n different genera were aligned and analyzed using the WebLogo software Note that n includes one species from each genus in the matK trees shown in Additional files and 2, and not just those analyzed in B Residues exhibiting differential conservation are marked with blue arrows The two most variable residues are marked with bold arrows Third-codon positions are marked with asterisks Page of 10 (page number not for citation purposes) BMC Evolutionary Biology 2009, 9:201 matK-2 positions -25 and -4; matK-3 positions -14 and 20), for which other evolutionary constraints apart from coding must be responsible (ii) Nucleotide positions that are variable in both T- and C-elements, mostly third-codon positions (e.g., matK-2 positions -22, -19 and -16) (iii) Nucleotide positions that are conserved only in editors For matK-2, we found five highly conserved positions at -7, -17, -18, -24 and -30 of C-elements, whereas the corresponding positions in T-elements are much more variable Conservation of the dominant base at these positions is 100% (-7, -17), 96% (-18), 93% (-24) and 88% (-30) in C-elements, but only 83% (-7), 55% (-17), 62% (-18, -24) and 45% (-30) in T-elements (see also arrows in Figure 2C) Notably, the highly conserved T at base -7 in C-elements is at a third-codon position An analysis of a longer stretch of sequence upstream of the matK-2 editing site revealed that differential conservation terminates at position -30, and thus coincides with the location of the expected cis-element for editing (data not shown) For matK-3, such differential conservation between C-elements and T-elements is less pronounced, although differences exist at positions -4, -7, -8, -12, -24, -27 and -28 These comparisons demonstrate that selected upstream bases and the C at the editing site have co-evolved Furthermore, high conservation of several third-codon positions in both C- and T-elements suggests a selective force that is independent of both amino-acid coding and the editing site at these positions Finally, a stronger conservation of bases in T-sites relative to C-sites was not observed for matK-2 or matK-3, supporting the conclusion that the observed conservation bias is functionally linked to the editing site Discussion Loss of matK editing sites in angiosperm evolution It is impossible to clearly infer the loss or gain of an editing site by examining a limited set of sequences because any conclusion drawn ultimately relies on only one informative site: the editing site itself Thus, an understanding of the evolutionary history of RNA editing sites requires an analysis of a large set of related sequences We have therefore investigated the evolutionary behavior of two editing sites and their presumptive cis-elements in the matK gene, an approach that allows us to track the editing site throughout a continuum of related angiosperm sequences Our results show that C dominates the phylogenetic trees for both matK-2 and matK-3 sites; thus, the most parsimonious explanation is that both editing sites were already present in the ancestor of all angiosperms A closer analysis of the distribution of species and genera lacking a C at the editing sites suggests that the C at both http://www.biomedcentral.com/1471-2148/9/201 matK sites was lost independently on multiple occasions These data support earlier work suggesting that ancient angiosperms contained high numbers of editing sites that were lost independently in separate taxonomic branches during angiosperm evolution [32] Our results are also consistent with a study on the evolution of mitochondrial editing that described multiple independent losses of editing sites in selected monocot taxa [33] Importantly, these studies collectively explain the variability of editotypes among angiosperms species solely by invoking loss of editing sites, and not require a presumption of balanced loss and gain of sites Although preliminary, our results show no evidence for re-acquisition of matK editing sites, as exemplified for matK-2 in the purely T-carrying monocots This suggests that at least these two sites, and by extrapolation, possibly all plastid-editing sites, are "on the way out" Loss of matK editing activity in angiosperm evolution An unexpected finding of this study is the loss or reduction in RNA editing in many species despite the presence of a C at the editing site Reduced editing can either be caused by the degeneration of nuclear-encoded editing factors or plastidial cis-elements that direct the editing machinery Based on the assumption that the matK editing activities are ancient (like the sites themselves; see above), we argue for multiple independent losses of editing at both sites during angiosperm evolution Editing of matK-3 leads to a change in the codons that results in incorporation of very different amino acids: serine and phenylalanine Given the nature of this difference, it is remarkable that Buddleja and Reseda tolerate the loss of this editing event By contrast, the rather minor physicochemical change provided by an H-to-Y amino acid transition mediated by matK-2 editing might be less critical for protein function Among the codon transitions caused by chloroplast RNA editing, this codon transition is one of the rarest and therefore might not be as important for protein function as the much more frequent S-to-L or P-to-L transitions The MatK protein may tolerate both amino acids, in which case the loss of RNA editing would have only limited consequences for protein function If it is indeed selectively neutral, the frequent loss of C observed here might be specific for matK editing and thus not generalizable to truly essential RNA-editing sites However, the fact that several independent C-to-T mutations, but no T-to-C back-mutations, are observed at both sites suggests that the edited amino acid is under positive selection A reduction or a loss of editing could generate such a selective pressure for a C-to-T mutation and lead to the elimination of an editing site Therefore, our results suggest that a decay in editing efficiency precedes the loss of editing sites, as proposed by Schields and Wolfe [10] Whether a degeneration of editing factors or their corresponding ciselements is responsible for the reductions in editing effi- Page of 10 (page number not for citation purposes) BMC Evolutionary Biology 2009, 9:201 ciency observed here cannot be determined by our analyses because the reductions co-occur with mutations in ciselements Notably, chloroplast genomes display an enhanced genetic drift and accumulate mildly deleterious point mutations [34] MatK is one of the most rapidly diverging plastidial genes and exhibits a relatively high rate of degeneration [25] Therefore, we speculate that it is rather the degeneration of cis-elements that leads to the observed reduction in editing efficiencies, which in turn generates the selective pressure responsible for the frequent losses of editing sites by C-to-T mutations Cis-elements of matK editing sites are under multiple selective constraints We have carried out a phylogenetic analysis of predicted matK-2 and matK-3 cis-elements in order to identify a putative conserved binding site for the corresponding (unknown) trans-acting factor(s) Many bases in these ciselements are conserved in both C- and T-elements, mostly due to coding constraints, but several third-codon positions are also conserved This could mean that selection is acting on all analyzed sequences, no matter which base is present at the editing site If this selection is sufficiently stringent and acts on all bases, there should be no conservation bias towards C-elements Irrespective of RNA editing function, a factor binding to this sequence, either on DNA or RNA, could provide such a selective force Our analysis uncovered five bases that are highly conserved in sequences containing the matK-2 editing site, but not in those lacking the site A co-evolution of these bases with the editing site most likely reflects a function for these bases in editing-site processing or recognition Such co-evolving nucleotides have recently been identified for two chloroplast editing sites, albeit in a much smaller taxon sampling [12] Intriguingly, in vitro studies have demonstrated that bases within such cis-elements have strikingly unequal impacts on RNA editing [35,36] For example, mutations of the -2 and -3 nucleotides of the psbE editing site led to a pronounced reduction in editing efficiency in vitro, while mutations at the adjacent -4/-5 and +2/+3 sites had only minor effects [35] Similar major effects of single bases on editing have also been observed in vivo [37] The position-specific inhibition of activity is not reflected in a similar inhibition of binding: all mutated versions of cis-elements appear to be equally good binding sites for (unknown) trans-acting factors [35] Thus, the bases co-conserved with the matK-2 editing site might be important for RNA editing activity, while their role in binding of trans-acting factors could be minor In other words, the same RNA-binding protein that attaches to C-elements might also bind to T-elements Such a factor could perform an additional function (or functions) unrelated to editing, and conserved bases could be important for such secondary function(s) of the editing factor This would explain why bases are con- http://www.biomedcentral.com/1471-2148/9/201 served at several third-codon positions in both C- and Telements Recently, PPR proteins have been identified as editing factors [13,14] Although these proteins are highly conserved between rice and Arabidopsis [20], their target Cs are not: only nine editing positions are conserved between rice [38] and Arabidopsis [39] For instance, the ndhD editing site, served by CRR21 in Arabidopsis, is lacking in rice; however, despite absence of the corresponding site, an orthologous protein can be readily identified (data not shown) The simplest explanation is that these factors may be involved in editing, but also serve additional, evolutionarily more stable functions Our finding that many species carrying a C at the editing site lack editing activity might indicate that the corresponding factors have been lost Such a loss-of-factor scenario would be consistent with several studies that demonstrated that transfer of editing sites from one species to another often leads to a failure to process the heterologous site, i.e are indicative of a loss of the corresponding editing factor [4,21,40] Three observations, however, speak against this simple loss-of-factor scenario: (i) several transferred sites are heterologously edited [21,22]; (ii) PPRs, the bona fide editing factors, are conserved in angiosperms and are thus not reflective of editotype variability; (iii) our phylogenetic analysis uncovered sequence conservation in cis-elements at third-codon positions, not only in editors but also in non-editors and T-carriers These considerations lead us to hypothesize that the factors are conserved and still bind cis-elements, but their editing activity is compromised because of mutations that disrupt protein structure/function or subtly alter RNA binding properties Determining whether known editing factors have additional functions and whether these functions are conserved in species that are devoid of the cognate editing site would be of great value in testing this hypothesis Conclusion In this paper, we focused on the evolution of chloroplast editing sites in angiosperms We demonstrate for the entire angiosperm radiation that editing sites have been lost multiple independent times Our data also uncover a surprisingly frequent reduction or loss of the corresponding activity in selected taxa Finally, this large-scale analysis helped to detect nucleotides with close co-evolutionary ties to the edited C The additional finding that evolutionary conservation of third-codon positions can be detected even in the absence of an edited C supports the idea that interactions of trans-acting factors with sequence elements surrounding editing sites take also place for reasons other than RNA editing Methods Plant material All leaf material was collected in the Botanical Garden of Marburg, Germany Page of 10 (page number not for citation purposes) BMC Evolutionary Biology 2009, 9:201 RNA preparation/RT-PCR analysis RNA extraction was performed using the TRIzol Reagent according to the supplier's instructions, or by a cetyltrimethylammonium bromide (CTAB)-based method as described by Zeng and Yang [41] Five to eight micrograms RNA were treated with DNaseI (Roche, 40 u, h, 37°C) to remove any DNA contamination The RNA was then purified by two phenol/chloroform extractions, one chloroform extraction and an ethanol/salt precipitation step cDNA sequences were amplified by PCR (Qiagen) after reverse transcription using the Omniscript RT-Kit (Qiagen) employing random hexamers, or the One-StepRT-PCR Kit (Qiagen) An aliquot of RNA that was not reverse transcribed served as a control PCR template for DNA contamination Total cellular DNA was extracted using a standard CTAB-based method Oligonucleotides The following oligonucleotides (5'>3') were used to amplify matK-2 and matK-3 sequences from DNA or cDNA: rctccttctttgcatttattgcg (matk.for.a), gctccttctttgcatttattgag (matk.for.b), gcctcttctttgcatttattgcg (matk.for.c), gcctcttctttgcatttattacg (matk.for.d), ccttcttctttacattttttacg, (matk.for.e), acctcttctttgcatttattaag (matk.for.f), catgaaaggatccttgaacaacc (matk.rev.z), catgaagagatcctcgaggaacc (matk.rev.y), agagaarggktctttgaaaagcc (matk.rev.x), awgaaaagkatctttgaaaaacc (matk.rev.w), catgaaaggatccttsaacaaca (matk.rev.v), tatgaaaggattcttgaacaaac (matk.rev.u) and cgcaaaaggatccttaagtaacc (matk.rev.t) Sequence analysis PCR products were purified using the NucleoSpin Extract II-Kit (Macherey and Nagel) and sequenced using DYEnamic ET chemistry (GE Healthcare) according to the supplier's instructions The products of the sequencing reactions were analyzed on an ABI 377 automated sequencer (Applied Biosystems) according to the manufacturer's instructions Phylogenetic analysis A total of 1255 matK sequences covering 298 major angiosperm genera were obtained from GenBank All genera represent leaves of phylogenetic trees constructed by Soltis et al [[28], see Additional files and 2] Sequences were aligned by ClustalW using default parameters [42] resulting in an alignment of 3455 nt A 201-bp sequence window comprising the editing sites matK-2 and matK-3 was extracted from this alignment for analysis Authors' contributions MT conceived of the study, carried out the cDNA analyses together with KS and participated in the sequence alignment VLS and AvH generated the sequence alignment and mapped the results for matK editing sites on the phy- http://www.biomedcentral.com/1471-2148/9/201 logenetic tree CSL participated in the design of the study and wrote the draft manuscript All authors contributed to writing the manuscript and drawing the figures, and all approved the final version Additional material Additional file Evolution of matK-2 editing sites in angiosperms This phylogenetic tree shows all detected losses for matK editing site during angiosperm evolution Click here for file [http://www.biomedcentral.com/content/supplementary/14712148-9-201-S1.pdf] Additional file Evolution of matK-3 editing sites in angiosperms This phylogenetic tree shows all detected losses for matK editing site during angiosperm evolution Click here for file [http://www.biomedcentral.com/content/supplementary/14712148-9-201-S2.pdf] Additional file Examples of matK-2 editing sites lost during Angiosperm evolution An excerpt of the phylogenetic tree shown in Additional file labeled to demonstrate the method used to evaluate losses of editing sites during matK evolution in angiosperms Click here for file [http://www.biomedcentral.com/content/supplementary/14712148-9-201-S3.pdf] Additional file List of independent C-to-T losses in angiosperm evolution at matK editing sites A table listing all C-to-T losses at matK editing sites identified in this study based on the analysis of the phylogenetic trees shown in Additional files and Click here for file [http://www.biomedcentral.com/content/supplementary/14712148-9-201-S4.pdf] Additional file List of independent C-to-A and C-to-G losses in angiosperm evolution at the matK-3 editing site A table listing all C-to-A and C-to-G mutations at matK editing sites identified in this study based on the analysis of the phylogenetic trees shown in Additional files and Click here for file [http://www.biomedcentral.com/content/supplementary/14712148-9-201-S5.pdf] Additional file Analysis of matK-2 and matK-3 editing in selected species Excerpts from cDNA sequencing electropherograms are shown to demonstrate the extent of editing in selected angiosperm species Click here for file [http://www.biomedcentral.com/content/supplementary/14712148-9-201-S6.pdf] Page of 10 (page number not for citation purposes) BMC Evolutionary Biology 2009, 9:201 Acknowledgements We thank students who participated in the Cell-Biology class of 2005 at the University of Marburg for their help in analyzing the editing status of matK in several angiosperm species, and Helena T Funk and Peter Poltnigg, who tutored the course together with MT Special thanks to Marc Appelhans for his guidance through the botanical garden of Marburg This work was supported by the Deutsche Forschungsgemeinschaft (SFB-TR1 to UGM, Emmy-Noether stipend to CSL); AvH also thankfully acknowledges financial support from the Vienna Science and Technology Fund (WWTF) References 10 11 12 13 14 15 16 Wolf PG, Rowe CA, Hasebe M: High levels of RNA editing in a vascular plant chloroplast genome: analysis of transcripts from the fern Adiantum capillus-veneris Gene 2004, 339:89-97 Kugita M, Yamamoto Y, Fujikawa T, Matsumoto T, Yoshinaga K: RNA editing in hornwort chloroplasts makes more than half the genes functional Nucleic Acids Res 2003, 31(9):2417-2423 Freyer R, Kiefer-Meyer MC, Kossel H: Occurrence of plastid RNA editing in all major lineages of land plants Proc Natl Acad Sci USA 1997, 94(12):6285-6290 Bock R, Kossel H, Maliga P: Introduction of a heterologous editing site into the tobacco plastid genome: the lack of RNA editing leads to a mutant phenotype Embo J 1994, 13(19):4623-4628 Schmitz-Linneweber C, Kushnir S, Babiychuk E, Poltnigg P, Herrmann RG, Maier RM: Pigment Deficiency in Nightshade/Tobacco Cybrids Is Caused by the Failure to Edit the Plastid ATPase alpha-Subunit mRNA Plant Cell 2005, 17:1815-1828 Schmitz-Linneweber C, Barkan A: RNA splicing and RNA editing in chloroplasts In Cell and Molecular Biology of Plastids Volume 19 Edited by: Bock R Berlin, Heidelberg: Springer; 2007:213-248 Maier RM, Neckermann K, Igloi GL, Kossel H: Complete sequence of the maize chloroplast genome: gene content, hotspots of divergence and fine tuning of genetic information by transcript editing J Mol Biol 1995, 251(5):614-628 Kahlau S, Aspinall S, Gray JC, Bock R: Sequence of the tomato chloroplast DNA and evolutionary comparison of solanaceous plastid genomes J Mol Evol 2006, 63(2):194-207 Sasaki T, Yukawa Y, Miyamoto T, Obokata J, Sugiura M: Identification of RNA Editing Sites in Chloroplast Transcripts from the Maternal and Paternal Progenitors of Tobacco (Nicotiana tabacum): Comparative Analysis Shows the Involvement of Distinct Trans-Factors for ndhB Editing Mol Biol Evol 2003, 20(7):1028-1035 Shields DC, Wolfe KH: Accelerated evolution of sites undergoing mRNA editing in plant mitochondria and chloroplasts Mol Biol Evol 1997, 14(3):344-349 Fiebig A, Stegemann S, Bock R: Rapid evolution of editing sites in a small non-essential plastid gene Nucl Acids Res 2004, 7:3615-3622 Hayes ML, Hanson MR: High Conservation of a 5' Element Required for RNA Editing of a C Target in Chloroplast psbE Transcripts J Mol Evol 2008, 67(3):233-245 Okuda K, Myouga F, Motohashi R, Shinozaki K, Shikanai T: Conserved domain structure of pentatricopeptide repeat proteins involved in chloroplast RNA editing Proc Natl Acad Sci USA 2007, 104(19):8178-8183 Kotera E, Tasaka M, Shikanai T: A pentatricopeptide repeat protein is essential for RNA editing in chloroplasts Nature 2005, 433(7023):326-330 Chateigner-Boutin AL, Ramos-Vega M, Guevara-Garcia A, Andres C, de la Luz Gutierrez-Nava M, Cantero A, Delannoy E, Jimenez LF, Lurin C, Small I, et al.: CLB19, a pentatricopeptide repeat protein required for editing of rpoA and clpP chloroplast transcripts Plant J 2008, 56(4):590-602 Zehrmann A, Verbitskiy D, Merwe JA van der, Brennicke A, Takenaka M: A DYW Domain-Containing Pentatricopeptide Repeat Protein Is Required for RNA Editing at Multiple Sites in Mitochondria of Arabidopsis thaliana Plant Cell 2009, 21(2):558-567 http://www.biomedcentral.com/1471-2148/9/201 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 Robbins JC, Heller WP, Hanson MR: A comparative genomics approach identifies a PPR-DYW protein that is essential for C-to-U editing of the Arabidopsis chloroplast accD transcript RNA 2009, 15(6):1142-1153 Cai W, Ji D, Peng L, Guo J, Ma J, Zou M, Lu C, Zhang L: LPA66 Is Required for Editing psbF Chloroplast Transcripts in Arabidopsis Plant Physiol 2009, 150:1260-1271 Zhou W, Cheng Y, Yap A, Chateigner-Boutin AL, Delannoy E, Hammani K, Small I, Huang J: The Arabidopsis gene YS1 encoding a DYW protein is required for editing of rpoB transcripts and the rapid development of chloroplasts during early growth Plant J 2009, 58:82-96 O'Toole N, Hattori M, Andres C, Iida K, Lurin C, Schmitz-Linneweber C, Sugita M, Small I: On the expansion of the pentatricopeptide repeat gene family in plants Mol Biol Evol 2008, 25(6):1120-1128 Tillich M, Poltnigg P, Kushnir S, Schmitz-Linneweber C: Maintenance of plastid RNA editing activities independently of their target sites EMBO Rep 2006, 7:308-313 Karcher D, Kahlau S, Bock R: Faithful editing of a tomato-specific mRNA editing site in transgenic tobacco chloroplasts RNA 2008, 14(2):217-224 Neuhaus H, Link G: The chloroplast tRNA(Lys) (UUU) gene from mustard (Sinapis alba) contains a class II intron potentially encoding for a maturase-related polypeptide Curr Genet 1987, 11:251-257 Ems SC, Morden CW, Dixon CK, Wolfe KH, dePamphilis CW, Palmer JD: Transcription, splicing and editing of plastid RNAs in the nonphotosynthetic plant Epifagus virginiana Plant Mol Biol 1995, 29(4):721-733 Barthet MM, Hilu KW: Expression of matK: functional and evolutionary implications Am J Bot 2007, 94:1402-1412 Tillich M, Schmitz-Linneweber C, Herrmann RG, Maier RM: The plastid chromosome of maize (Zea mais): Update of the complete sequence and transcript editing sites Maize Genet Corp News Letters 2001, 75:42-44 Tillich M, Funk HT, Schmitz-Linneweber C, Poltnigg P, Sabater B, Martin M, Maier RM: Editing of plastid RNA in Arabidopsis thaliana ecotypes Plant J 2005, 43(5):708-715 Soltis DE, Soltis PS, Chase MW, Mort ME, Aalbach DC, Zanis M, Savolainen V, Hahn WH, Hoot SB, Fay MF, et al.: Angiosperm phylogeny inferred from 18S rDNA, rbcL, and atpB sequences Bot J Lin Soc 2000, 133:381-461 Schmitz-Linneweber C, Regel R, Du TG, Hupfer H, Herrmann RG, Maier RM: The plastid chromosome of Atropa belladonna and its comparison with that of Nicotiana tabacum: the role of RNA editing in generating divergence in the process of plant speciation Mol Biol Evol 2002, 19(9):1602-1612 Inada M, Sasaki T, Yukawa M, Tsudzuki T, Sugiura M: A systematic search for RNA editing sites in pea chloroplasts: an editing event causes diversification from the evolutionarily conserved amino acid sequence Plant Cell Physiol 2004, 45(11):1615-1622 Crooks GE, Hon G, Chandonia JM, Brenner SE: WebLogo: a sequence logo generator Genome Res 2004, 14(6):1188-1190 Tillich M, Lehwark P, Morton BR, Maier UG: The evolution of chloroplast RNA editing Mol Biol Evol 2006, 23(10):1912-1921 Lopez L, Picardi E, Quagliariello C: RNA editing has been lost in the mitochondrial cox3 and rps13 mRNAs in Asparagales Biochimie 2007, 89(1):159-167 Lynch M, Blanchard JL: Deleterious mutation accumulation in organelle genomes Genetica 1998, 102–103(1–6):29-39 Miyamoto T, Obokata J, Sugiura M: A site-specific factor interacts directly with its cognate RNA editing site in chloroplast transcripts Proc Natl Acad Sci USA 2004, 101(1):48-52 Heller WP, Hayes ML, Hanson MR: Cross-competition in editing of chloroplast RNA transcripts in vitro implicates sharing of trans-factors between different C targets J Biol Chem 2008, 283(12):7314-7319 Reed ML, Peeters NM, Hanson MR: A single alteration 20 nt 5' to an editing target inhibits chloroplast RNA editing in vivo Nucleic Acids Res 2001, 29(7):1507-1513 Corneille S, Lutz K, Maliga P: Conservation of RNA editing between rice and maize plastids: are most editing events dispensable? Mol Gen Genet 2000, 264(4):419-424 Page of 10 (page number not for citation purposes) BMC Evolutionary Biology 2009, 9:201 39 40 41 42 http://www.biomedcentral.com/1471-2148/9/201 Chateigner-Boutin AL, Small I: A rapid high-throughput method for the detection and quantification of RNA editing based on high-resolution melting of amplicons Nucleic Acids Res 2007, 35(17):e114 Reed ML, Hanson MR: A heterologous maize rpoB editing site is recognized by transgenic tobacco chloroplasts Mol Cell Biol 1997, 17(12):6948-6952 Zeng Y, Yang T: RNA isolation from highly viscous samples rich in polyphenols and polysaccharides Plant Mol Biol Rep 2002, 20:417a-417e Thompson J, Higgins D, Gibson T: Clustal W: Improving the sensitivity of progressive multiple sequnce alignment through sequence weighting, position-specific gap penalties and weight matrix choice Nucl Acids Res 1994, 22:4673-4680 Publish with Bio Med Central and every scientist can read your work free of charge "BioMed Central will be the most significant development for disseminating the results of biomedical researc h in our lifetime." Sir Paul Nurse, Cancer Research UK Your research papers will be: available free of charge to the entire biomedical community peer reviewed and published immediately upon acceptance cited in PubMed and archived on PubMed Central yours — you keep the copyright BioMedcentral Submit your manuscript here: http://www.biomedcentral.com/info/publishing_adv.asp Page 10 of 10 (page number not for citation purposes) ... musella calathea globba allium zostera Figure 1losses of matK editing sites in angiosperms Multiple Multiple losses of matK editing sites in angiosperms A) Nucleotides found at the matK- 2 editing site... multiple independent losses of editing at both sites during angiosperm evolution Editing of matK- 3 leads to a change in the codons that results in incorporation of very different amino acids: serine... Analysis Figure 2of the evolution of cis-elements upstream of matK- 2 and matK- 3 Analysis of the evolution of cis-elements upstream of matK- 2 and matK- 3 A) Schematic representation of the genomic

Ngày đăng: 14/12/2017, 17:07