0.9 PAML BranchSite Modelc FELd p< 0.2 RELd BF > 50 25 28 0.07 Functional Informationf Cladeg AA Parallel Property Changes Changes Changese Protein Domain Ser-Arg SM, P, NEUP.POS LRR3 B, C Leu-Trp NP, NEU-P, NEU LRR3 A, D Leu-Pro NP, NEU-SM, NP, NEU SM, P, NEUNP, NEU LRR4 A, F LRR6 Yes 45 0.16 Thr-Ile Yes 104 0.17 Leu-Val NP, NEU-NP, NEU Leu-Ser NP, NEU-SM, P, NEU 128 0.875 Glu-Pro P, NEG-SM, NP, NEU LRR7 D 133 0.723 Asn-Lys SM, P, NEUP, POS LRR7 G 139 0.708 Gly-Glu SM, NP, NEU-P, NEG LRR8 149 0.565 Ser/Leu -Thr SM, P, NEU/ NP, NEU -SM, P, NEU P, POS-P, POS P, POS-SM, P, NEG SM, P, NEUSM, P, NEU SM, P, NEU/ SM, P, NEU -NP, NEU SM, P, NEUP, POS NP, NEU-SM, P, NEU P, POS-P, NEG P, NEG-P, NEU P, NEG/P, POS-P, NEU LRR8 G LRR8 A, B, C LRR9 Adjacent to site involved in A, C, G ligand binding and interaction with MD2 LRR9 A, C, F 150 0.995 228.23 His-Arg Yes His-Asp 177 179 0.992 183 61.94 Asn-Thr Asn/Thr -Ile Asn-Lys Ile-Asn 0.07 647.96 Lys-Glu Glu-Gln Glu/LysGln 0.12 51.06 Arg-Ser Arg-Thr 204 207 Glu-His 1.000 0.08 212 221 0.1 Yes Yes 1563.58 Gly/Lys -Arg Arg-Lys Arg-Thr Lys-Arg Adjacent to site involved in interaction with MD2 G P, POS-SM, P, LRR9 NEU P, POS-SM, P, NEU C, D P, NEG-P, POS LRR10 A, D SM, NP, NEU/P, POS -P, POS P, POS-P, POSP, POS-SM, P, NEU P, POS-P, POS LRR10 A, G, C, E Leu-Val Yes P, POS-NP, NEU LRR10 A, B Val-Met Yes NP, NEU-NP, NEU LRR11 C, D, F Shen et al BMC Evolutionary Biology 2012, 12:39 http://www.biomedcentral.com/1471-2148/12/39 Page of 12 Table Positive selection at amino acid sites of cetacean TLR4 (Continued) 228 0.994 230 0.978 0.15 239 247 250b 0.14 544.88 Asp/Ser/ Cys -Asn Asp-Asn SM, P, NEG/ LRR11 SM, P, NEU/ SM, NP, NEU -SM, P, NEU SM, P, NEGSM, P, NEU A, G Gly/Glu/ Asp -Arg Asp-His SM, NP, LRR11 NEU/P, NEG/ SM, P, NEG -P.POS SM, P, NEGP, POS A, E SM, NP, NEU-P, NEU NP, NEU-SM, P, NEU SM, P, NEUNP, NEU B, D, G 50.32 Cys-Tyr 86.14 Ile-Thr Thr-Ile 0.936 Yes Asp/Ala -Lys Asp/Lys/ Ala -Asn Asn-Lys 265 Phe-Leu LRR12 LRR12 SM, P, NEG/ LRR12 SM, NP, NEU -P, POS SM, P, NEG/ P, POS/SM, NP, NEU -SM, P, NEU SM, P, NEUP, POS Yes Adjacent to site involved in C, G interaction with ligand binding Ligand binding A, E, G NP, NEU-NP, NEU SM, NP, NEU/SM, P, NEG-P, POS SM, NP, NEU-P, POS P, POS-SM, NP, NEU LRR13 B, E LRR13 Adjacent to site involved in A, C interaction with ligand binding P, NEG-SM, NP, NEU P, NEU/P, NEG-SM, NP, NEU LRR13 272 0.997 0.13 188.28 Gly/AspHis Gly-His His-Gly 280 0.952 0.18 191.07 Glu-Ala Gln/GluAla His-Arg P, POS-P.POS LRR14 D 55.05 Asp-Asn Asn-Pro G 301.87 Asn-Ser Asn-Lys Gly-Asn SM, P, NEG- LRR14 SM, P, NEU SM, P, NEUSM, NP, NEU SM, P, NEU- LRR15 SM, P, NEU SM, P, NEUP, POS SM, NP, NEU-SM, P, NEU 342 53.56 Asn-Ser Asn/SerThr 351 0.17 Ile/AlaVal 302 0.624 304 324 368 0.996 0.576 Ile-Thr Yes SM, P, NEU- LRR16 SM, P, NEU SM, P, NEU/ SM, P, NEUSM, P, NEU NP, NEU/SM, LRR16 NP, NEU-NP, NEU NP, NEU-SM, P, NEU LRR17 A, B, E Adjacent to site involved in C, E, G interaction with ligand binding (hydrogen bond) Adjacent to site involved in A interaction with ligand binding (hydrogen bond) Adjacent to site involved in G interaction with ligand binding (hydrophobic interaction) G Adjacent to site involved in interaction with ligand binding (hydrophobic interaction) Shen et al BMC Evolutionary Biology 2012, 12:39 http://www.biomedcentral.com/1471-2148/12/39 Page of 12 Table Positive selection at amino acid sites of cetacean TLR4 (Continued) 404 0.08 408 Leu-Met Ile-Thr Yes NP, NEU-NP, NEU LRR18 C NP, NEU-SM, P, NEU LRR19 A, G 409 0.19 Leu/Ile/ Phe -Val NP, NEU/NP, NEU/NP, NEU -NP, NEU LRR19 A 482 0.16 Ser/TrpPhe Phe/Ser/ Trp -Leu SM, P, NEU/ LRR22 P, NEU-NP, NEU NP, NEU/SM, P, NEU/P, NEU -NP, NEU A 542 0.903 Met-Thr Yes NP, NEU-SM, P, NEU LRRCT A, D 551 0.938 Ile-Val Val-Ile Yes NP, NEU-NP, NEU NP, NEU-NP, NEU Transmembrane B, F Val-Ala NP, NEU-SM, NP, NEU Transmembrane G 559 0.16 690 0.564 Arg-Gln P, POS-P, NEU TIR D 740 0.790 Glu-Asp P, NEG-SM, P, NEG TIR G 742 0.697 Asn-Arg SM, P, NEUP, POS TIR G Gln-Glu P, NEU-P, NEG TIR A, F 743 0.18 a Codons identified by more than one ML method were in bold and underlined Site 250 in italic was mapped onto the 3D structure of TLR4, since it directly participates in binding of LPS to TLR4 c Codons were identified by branch-site model in PAML Details were in Materials and Methods and Additional file 2: Table S2 d Codons were estimated in DATAMONKEY e SM, small; NP, nonpolar; P, polar; NEU, neutral; POS, positively charged; NEG, negatively charged f Codons were in the functional regions predicted by the three-dimensional structure in Shishido et al 2010 LRR = Leucine-rich repeat, CT = C-terminal, TIR = cytoplasmic Toll/IL-1 receptor g Amino acid substitutions occurred in the following clades: A = even-toed ungulates, B = river dolphins, C = oceanic dolpins, D = porpoises and white whales, E = sperm whales, F = baleen whales, G = more than one equally parsimonious reconstruction b Discussion Strong adaptive evolution of TLR4 during the habitat shift from land to water The present study revealed that the branch leading to whale + hippo was under the strongest positive selection at TLR4, evidenced by the highest ω value (4.59, p = 0.02) and the maximum number of specific codons (n = 9) detected by branch site model (Figure and Additional file 2: Table S2) This lineage was just before the differentiation between cetacean and hippo, both of which are regarded to share a common semi-aquatic ancestor that branched off from other artiodactyls [38] In other words, this lineage represents the habitat transition of the terrestrial ancestors of cetaceans from land to semi-aquatic habitat It is clear that pathogens were dramatically different in terms of diversity and abundance between land and water Therefore, in such a phase of habitat shift, TLR4, which interacted directly with environmental pathogenic microbes, must have been subjected to strong selective pressures Moreover, a signal of positive selection was also detected in the lineage leading to the common ancestor of cetaceans (branch f in Figure 1) This lineage represents the early evolutionary history of cetaceans from semi-aquatic to full aquatic (marine) habitat, during which the cetaceans were faced with the challenges of infectious pathogens in changing habitats Although the ω value of this branch was less than (0.4), one positively selected codon (AA324) was identified, which caused radical amino acid change from a nonpolar Gly to a polar Asn That is to say, TLR4 must have adaptively modified to recognize and bind potential novel pathogens in the new environment, which is again in accordance with the expectation of the co-evolution arms race model Shen et al BMC Evolutionary Biology 2012, 12:39 http://www.biomedcentral.com/1471-2148/12/39 Page of 12 4.5 dN/dS 3.5 2.5 1.5 0.5 0 100 200 300 400 500 600 700 800 Position of Amino Acids at TLR4 in Cetacea Figure Average ω ratio of a 20-codon sliding window along cetacean TLR4 protein sequences High values (ω > 1) indicate positive selection, whereas low values (ω < 1) indicate purifying selection The black box indicates the transmembrane domain Adaptive evolution of TLR4 associated with rapid diversification of oceanic dolphins Another strong signature of positive selection was detected along the lineage leading to oceanic dolphins, i e., the family Delphinidae (delphinids) Four (150H-R, Figure Distribution of positively selected codons in the three-dimensional structure of cetacean TLR4 The area important for ligand binding is squared in pink 179 K-E, 272 G-H, 324 N-S) adaptive AA changes were found on this lineage with a ω value of 1.33 In particular, site 272 in oceanic dolphins was identified by three ML methods and constituted the most radical change from small, nonpolar, and neutral Gly to polar and positively charged His (Table 2) The stronger level of positive selection on this lineage might have resulted from the rapid diversification and adaptive radiation that this group has experienced Molecular phylogenetic studies [24,32,33,39] have suggested that a rapid radiation and diversification that occurred near the Miocene/Pliocene boundary The delphinid clade has been the most speciose living group of Cetacea [25] (containing 35 of 89 known species) and the most ecologically versatile, occupying tropical to polar latitudes, coastal and oceanic waters, estuaries, and sometimes freshwater rivers In response to the dramatic changes in the prevalence, intensity, virulence, and diversity of microbial pathogens in various aquatic environments, innate immune genes such as TLR4, as expected, had to make evolutionarily adaptive changes that were necessary to ensure the long-term survival and successful radiation of dolphins and porpoises in the sea Domain-specific selective pressure Of the three functional domains of TLR molecules, the EXT domain is at the first line of defense against invasive pathogens and plays a key role in directly recognizing and binding PAMPs such as LPS from Gramnegative bacteria [40] According to the hypothesis of an arms race between pathogens and vertebrate immune systems, it is reasonable to find a stronger effect of positive selection in the EXT domain than in the TM and CY domains This was corroborated by most codons under positive selection being located within this region and the predominant higher codon-specific ω values being scattered in the LRR region of the EXT domain In particular, most sites under positive selection were found to fall in EXT regions interacting with LPS (Figure 3), which is similar to that found in primate TLR4 [10] It is somewhat surprising, however, that the overall ω value in the TM region (2.1712) is much higher than those in the CY (0.3131) and the EXT (0.6613) domains Actually, this is not a novel finding of this study A similar phenomenon was reported in primates [10] and ruminant [11], but no explanation was given Nevertheless, it seems irrational to explain this strange higher ω value with a strong signature of positive selection, because only two sites in this region were identified as candidates under positive selection, although with only one ML method (Table 2) Sliding window analysis also verified that most codons with higher ω values > were Shen et al BMC Evolutionary Biology 2012, 12:39 http://www.biomedcentral.com/1471-2148/12/39 scattered in the EXT domain, whereas only very few of such codons were found in the TM and CY domains Given that the TM domain was only 23 amino acids in length and only a very small number of candidate selective sites were identified with weak support, it is difficult to obtain an estimate with high statistical significance The highest ω value in the TM domain, therefore, was most likely a biased estimate or an artifact Species-specific pattern of positive selection Evolutionary analysis of cetacean TLR4 revealed an inconstant pattern of positive selection across the cetacean phylogeny, with different species of extant cetaceans (terminal branches in Figure 1) displaying contrasted selective pressures (Figure 1) What factors triggered or correlated with heterogeneity in the evolutionary rate of cetacean TLR4 will be an interesting question to answer To our knowledge, many life-history traits and species or population-level factors such as mating system, distribution area, habitat type, migration or dispersal pattern, and social structure, are different among cetacean species, and thus might have caused the variation in pathogen pressures and disease risks To avoid the problem of uncertainty in these factors along the long branches, we focused only on the extant cetacean species (terminal branches in Figure 1) Unfortunately, at present, due to insufficient understanding of these factors for different cetacean species, it is not possible for us to address their relationships with heterogeneity in the evolutionary rate of cetacean TLR4 using quantitative association analyses However, some preliminary direct comparisons between life-history traits or population-level factors and selective pressures suggest that a complex species-specific effect might have been an important mechanism to control the heterogeneity in the evolutionary rate of cetacean TLR4 For example, the two river dolphins examined in this study, namely, the Ganges river dolphin Platanista gangetica and the Yangtze river dolphin Lipotes vexillifer, both showed similarly lower ω values; however, two positively selective sites were identified in the former while no such site was detected in the latter In addition, a representative species from the most inshore shallow waters (the Indo-Pacific humpback dolphin) showed four sites under positive selection, which might imply the negative anthropogenic impacts (direct or indirect) in coastal waters on the immune system However, another species from coastal waters (the finless porpoise Neophocaena phocaenoides) did not display a similar enhanced selection over other offshore or oceanic species Furthermore, some closely related species showed significantly contrasted levels of selection For instance, oceanic dolphins within the family Delphinidae showed great divergence in evolutionary rates of TLR4, from nearly Page of 12 (bottlenose dolphin and long-beaked common dolphin Delphinus capensis) to 0.89 (the striped dolphin Stenella coeruleoalba) Although there is a tendency of group size increasing in delphinoids [37], there seems to be no strong effect on the evolution of TLR4, because no significant association between group sizes and ω values was found not only for all cetaceans but only for delphinids For this reason, it is necessary to further investigate this issue in the future, with an increasing uncovering of life history and population characteristics of different cetacean species, and a more comprehensive understanding of the molecular evolution of cetacean TLRs as well Conclusions In summary, our data presented in this study strongly suggest that TLR4 has undergone adaptive evolution against the background of purifying selection across cetacean enigmatic history of transition from land to full aquatic habitats and subsequent adaptive radiation in waters around the world Most sites under positive selection were found to fall in the LRR region of the EXT domain interacting with LPS, which was accordance with the hypothesis of an arms race between pathogens and vertebrate immune systems In addition, some preliminary direct comparisons between life-history traits or population-level factors and selective pressures suggest that a complex species-specific effect might have been an important mechanism to trigger the heterogeneity in the evolutionary rate of cetacean TLR4 Methods Samples and DNA sequencing Total genomic DNA was extracted from muscle and blood samples from 11 cetacean species (Additional file 1: Table S1) and a hippopotamus (Hippopotamus amphibius) using Dneasy Blood & Tissue Kit (Qiagen) according to the manufacturer’s instructions This research is compliant with the “Animal Research: Reporting In Vivo Experiments” (ARRIVE) guidelines Because these samples were collected from stranded or incidentally captured/killed animals in coastal China seas, ethical approval was not needed in such a situation Voucher specimens were preserved at Nanjing Normal University In addition, coding sequences of the sperm whale (Physeter catodon), killer whale (Orcinus orca), Pacific white-sided dolphin (Lagenorhynchus obliquidens), and water buffalo (Bubalus bubalis) were downloaded from GenBank with accession numbers AB500181, AB492857, AB492856 and HM469969, respectively, whereas the coding sequence of the pig (Sus scrofa) was retrieved from Ensemble Database with accession no ENSSSCG00000005503 Shen et al BMC Evolutionary Biology 2012, 12:39 http://www.biomedcentral.com/1471-2148/12/39 To amplify the ORF region of TLR4, we designed a series of overlapping primers (Additional file 3: Table S3) in conserved ORF regions searched with ORF Finder http://www.ncbi.nlm.nih.gov/gorf/ in the bottlenose dolphin (Tursiops truncatus) (Ensemble GeneScaffold_1465), dog (Canis familiaris) (Ensemble Gene ID ENSCAFG00000003518), and water buffalo (GenBank accession no HM469969) PCR mixtures (30 μl) contained 0.2 μmol of each primer, μl of 10× PCR buffer, 0.2 mmol of dNTP, unit of Taq polymerase (Takara), and 0.8 μl of genomic DNA The PCR condition was as follows: 95°C denaturation for min, then running 35 cycles of 95°C 30 s, 55-58°C 30 s, 72°C 40 s, and 72°C elongation for 10 PCR products were purified using a Gel Extraction Kit (Promega) and sequenced in both directions using ABI PRISM 3730 DNA Sequencer Statistical analysis The specificity of these newly generated sequences was examined by comparison with the published nucleotide database at GenBank by BLAST (NCBI) Protein sequences were aligned using FASTA [41] and Muscle vs3.7 [42] The nucleotide sequences and putative amino acid sequences were further aligned using MEGA4 [43] Phylogenetic relationships were reconstructed using Bayesian inference (BI) in MrBayes 3.1.2 [44] and the NJ method in MEGA4 In Bayesian analysis, the WAG model [45] was selected using Modeltest [46] Four Markov chains were run for 10 generations and were sampled every 100 generations to yield a posterior probability distribution of 104 trees The first 2000 trees were discarded as burn-in A three-dimensional (3D) domain structure of the cetacean TLR4 was predicted using CPHmodels-3.0 Server http://www.cbs.dtu.dk/services/CPHmodels/ Detections of positive selection Comparisons of nonsynonymous/synonymous substitution ratios (ω = dN/dS) has become a useful means for quantifying the impact of natural selection on molecular evolution [47,48] If ω = 1, amino acid substitutions may be largely neutral; ω > is evidence of positive selection, whereas ω < is consistent with purifying selection although the possibility of positive selection cannot be excluded in such a case However, the straightforward use of the ω ratio to detect positive selection, through direct calculation of dN and dS between sequences, has become rarely effective, because adaptive evolution most likely occurs at a few time points and at most times has an effect on only a few amino acids In such cases, the ω ratio averaged over time and over sites will not be significantly > 1, even if adaptive molecular evolution may have occurred [49] Thus, the codon-based maximum likelihood (CodeML) method in the PAML package [50] was used Page 10 of 12 to detect lineage- or site-specific selection Nested models were compared with critical values of the Chi square distribution using the LRT statistic (-2[LogLikelihood1 LogLikelihood2]), and degrees of freedom as the difference in the number of parameters were estimated with each model A model of codon frequencies, i.e F3 × 4, was used for the present analyses To check for convergence, all analyses were run twice, respectively using initial ω values of 0.5 and 1.5 To evaluate positive selection on TLR4 across the presently examined cetacean species, we first used site models implemented in the CodeML program in PAML version 4.0 [50], not allowing variation among lineages Models M1, M7, and M8a restricted sites with ω ≤ 1, whereas models M2 and M8 included a class of sites with ω > The sites with a posterior probability > 0.9 were considered as candidates for selection Then we used improved statistical methods in Datamonkey web server [51], which computed nonsynonymous and synonymous substitutions at each codon position to further evaluate the selection Three ML methods with default settings applied in this web were used: SLAC, REL, and FEL SLAC, which calculates the expected and observed numbers of synonymous and nonsynonymous substitutions to infer selection, is a conservative test FEL directly estimates dN and dS based on a codon-substitution model, whereas REL, allowing the synonymous and nonsynonymous substitution rates to vary among codon sites [52], uses the Bayes factors to determine a site as selected The default settings with significance levels of 0.1 for SLAC and 0.2 for FEL were used Bayes factor > 50 for REL was implemented Normally, REL is more powerful than SLAC and FEL, but it has the highest rate of false positives [52] These three predictions were conducted using the HKY85 model, which is thought to perform well for a low number of sequences [13] To detect the independent ω ratio for each branch of the tree, a free-ratio model was run with CodeML in PAML version 4, which allows each branch to have a separate dN/dS [50] This involves as many ω parameters as the number of branches in the tree and is parameter-rich for a tree of many species, which is applicable only to a small data set [53] Positive selection was further detected with the improved branch-site likelihood method as described in Zhang et al [35] This test appeared to be conservative overall, but exhibited better power than did the branchbased test This is a simple modification to the branchsite model proposed by Yang and Nielsen [54] and was used to construct two new LRTs, referred to as test and test Test is unable to reliably distinguish between positive selection and relaxed constraint on the foreground branches, whereas test can accurately distinguish between them and thus often has stronger Shen et al BMC Evolutionary Biology 2012, 12:39 http://www.biomedcentral.com/1471-2148/12/39 power than test in detecting positive selection It is worth noting that when positive selection operates episodically on a few amino acid sites, the signal may be masked by negative selection Especially if positive selection has affected only one lineage or a very few lineages on the tree, the tested-positive selection at any single site may not be strong enough for the BEB probability to reach high levels In this case, however, in this case, Zhang et al [35] still suggested the use of this method to detect positive selection even if the affected sites cannot be reliably inferred The amino acid changes that occurred in the positively selected sites were inferred using maximum parsimony by Mesquite [55] We marked the positively selective sites detected by more than one ML method (Table 2) and those detected by the branch-site model (Additional file 2: Table S2) onto the phylogenetic tree (Figure 1) to observe the distribution of these sites across cetacean phylogeny To further visualize variation of ω at TLR4 across cetacean phylogeny, we undertook a sliding window analysis using the software SWAAP1.0.2 [56], with window size at 60 bp (20 codons) and step size at 15 bp (5 codons) In addition, the ω value in each of three domains, i.e., the EXT, TM, and CY, was estimated using model M0 to evaluate the relative extent of functional constraint among these domains The domains were identified with Motifscan http://myhits.isb-sib.ch/ cgi-bin/motif_scan[57] and Simple Modular Architecture Research Tool http://smart.embl-heidelberg.de/[58] To gain insight into the functional significance of the putatively selected sites, we also constructed the 3D structure of this protein and mapped selective sites onto it Analysis of associations between ω and group size A linear regression analysis was performed with R [59] to assess association between selection on TLR4 (terminal branch’s ω (dN/dS) of the tree) and group sizes of cetaceans derived from May-Collado et al [37] Fourteen cetacean species with available data were included in this analysis We calculated independent ω ratio for each branch of the tree by free-ratio model with CodeML in PAML version Additional material Additional file 1: Table S1 Information about 17 representative cetaceans and some relative even-toed ungulates Additional file 2: Table S2 Detailing the results of branch-site model analysis for positive selection at cetacean TLR4 Additional file 3: Table S3 Primers amplifying the complete ORF of representative cetaceans and some relative even-toed ungulates TLR4 Page 11 of 12 Abbreviations TLRs: Toll-like receptors; LPS: Lipopolysaccharides; MD-2: Myeloid differentiation factor 2; PRRs: Pattern recognition receptors; ORF: Open reading frame; CodeML: Codon-based maximum likelihood; SLAC: Single likelihood ancestor counting; REL: Random effects likelihood; FEL: Fixed effects likelihood; BEB: Bayes empirical Bayes; LRT: Likelihood ratio test; PAML: Phylogenetic Analysis by Maximum Likelihood; ML: Maximum Likelihood; NJ: Neighbor-Joining; EXT: Extracellular domain; TM: Transmembrane domain; CY: Cytoplasmic domain; MYA: Million years ago; 3D: Three-dimensional Acknowledgements This research was financially supported by the National Natural Science Foundation of China (NSFC) grant nos 30830016 and 31172069 to GY, and the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD) to GY and SX We thank Dr Anli Gao, Prof Qing Chang, Mr Xinrong Xu, and some students who have ever studied or are studying at NJNU for their assistance in sample collection Authors’ contributions GY conceived and designed the study, helped to perform data analyses and improve the manuscript TS and SX performed the experiment and data analysis, and drafted the manuscript XW helped to perform experiment WY helped to perform data analysis KZ helped to improve the manuscript All authors read and approved the final manuscript Received: 22 November 2011 Accepted: 24 March 2012 Published: 24 March 2012 References Medzhitov R, Janeway CA Jr: Innate Immunity: The virtues of a nonclonal system of recognition Cell 1997, 91:295-298 Smith KD, Andersen-Nissen E, Hayashi F, Strobe K, Bergman MA, Barrett SL, Cookson BT, Aderem A: Toll-like receptor recognizes a conserved site on flagellin required for protofilament formation and bacterial motility Nat Immunol 2004, 4:1247-1253 Mukherjee S, Sarkar-Roy N, Wagener DK, Majumder PP: Signatures of natural selection are not uniform across genes of innate immune system, but purifying selection is the dominant signature Proc Natl Acad Sci USA 2009, 106:7073-7078 Barreiro LB, Ben-Ali M, Quach H, Laval G, Patin E, Pickrell JK, Bouchier C, Tichit M, Neyrolles O, Gicquel B, Kidd JR, Kidd KK, Alcais A, Ragimbeau J, Pellegrini S, Abel L, Casanova JL, Quintana-Murci L: Evolutionary dynamics of human Toll-like receptors and their different contributions to host defense PLoS Genet 2009, 5:e1000562 Sackton TB, Lazzaro BP, Schlenke TA, Evans JD, Hultmark D, Clark AG: Dynamic evolution of the innate immune system in Drosophila Nat Genet 2007, 39:1461-1468 Ferrer-Admetlla A, Bosch E, Sikora M, Marque’s-Bonet T, Ramı’rez-Soriano A, Muntasell A, Navarro A, Lazarus R, Calafell F, Bertranpetit J, Casals F: Balancing selection is the main force shaping the evolution of innate immunity genes J Immunol 2008, 181:1315-1322 Arora P, Porcelli SA: A glycan shield for bacterial sphingolipids Chem Biol 2008, 15:642-644 Nakajima T, Ohtani H, Satta Y, Uno Y, Akari H, Ishida T, Kimura A: Natural selection in the TLR-related genes in the course of primate evolution Immunogenetics 2008, 60:727-735 Wlasiuk G, Khan S, Switzer WM, Nachman MW: A history of recurrent positive selection at the toll-like receptor in primates Mol Biol Evol 2009, 26:937-949 10 Wlasiuk G, Nachman MW: Adaptation and constraint at toll-like receptors in primates Mol Biol Evol 2010, 27:2172-2186 11 Jann OC, Werling D, Chang JS, Haig D, Glass EJ: Molecular evolution of bovine Toll-like receptor suggests substitutions of functional relevance BMC Evol Biol 2008, 8:288 12 Yilmaz A, Shen S, Adelson DL, Xavier S, Zhu JJ: Identification and sequence analysis of chicken Toll-like receptors Immunogenetics 2005, 56:743-753 13 Alcaide M, Edwards SV: Molecular evolution of the Toll-like receptor multigene family in birds Mol Biol Evol 2011, 28:1703-1715 Shen et al BMC Evolutionary Biology 2012, 12:39 http://www.biomedcentral.com/1471-2148/12/39 14 Chen JS, Wang TY, Tzeng TD, Wang CY, Wang D: Evidence for positive selection in the TLR9 gene of teleosts Fish Shellfish Immunol 2008, 24:234-242 15 Ortiz M, Kaessmann H, Zhang K, Bashirova A, Carrington M, QuintanaMurci L, Telenti A: The evolutionary history of the CD209 (DC-SIGN) family in humans and nonhuman primates Genes and Immunity 2008, 9:483-492 16 Thewissen JGM, Cooper LN, Clementz MT, Bajpai S, Tiwari BN: Whales originated from aquatic artiodactyls in the Eocene epoch of India Nature 2007, 450:1190-1194 17 Nikaido M, Rooney AP, Okada N: Phylogenetic relationships among cetartiodactyls based on insertions of short and long interspersed elements: hippopotamuses are the closest extant relatives of whales Proc Natl Acad Sci USA 1999, 96:10261-10266 18 Thewissen JGM, Cooper LN, George JC, Bajpai S: From land to water: the origin of whales, dolphins, and porpoises Evol Edu Outreach 2009, 2:272-288 19 Williams EM: Synopsis of the earliest cetaceans In The emergence of whales: evolutionary patterns in the origin of Cetacea Edited by: Thewissen JGM New York: Plenum; 1998:1-28 20 Uhen M: The origin (s) of whales Annu Rev Earth Planet Sci 2010, 38:189-219 21 Geisler JH, Sanders AE: Morphological evidence for the phylogeny of Cetacea J Mammal Evol 2003, 10:23-129 22 Thewissen JGM, Williams EM: The early evolution of Cetacea (whales, dolphins, and porpoises) Ann Rev Ecol Syst 2002, 33:73-90 23 Fordyce RE, de Muizon C: Evolutionary history of cetaceans: a review In Secondary adaptation of tetrapods to life in water Edited by: Mazin JM, de Buffr’enil V 2001, 169-233 24 Steeman M, Hebsgaard MB, Fordyce RE, Ho SYW, Rabosky DL, Nielsen R, Rahbek C, Glenner H: SØrensen MV, Willerslev E: Radiation of extant cetaceans driven by restructuring of the oceans Syst Biol 2009, 58:573-585 25 Rice DW: Marine mammals of the world: systematics and distribution., 1998, Society of Marine Mammalogy Special Publication Number 26 Hoelzel AR: Marine mammal biology: an evolutionary approach Oxford, United Kingdom: Blackwell Publishing Ltd.; 2002 27 Slater GJ, Price SA, Santini F, Alfaro ME: Diversity versus disparity and the radiation of modern cetaceans Proc R Soc B 2010, 277:3097-3104 28 Poltorak A, He X, Smirnova I, Liu MY, Huffel CV, Du X, Birdwell D, Alejos E, Silva M, Galanos C, Freudenberg M, Ricciardi-Castagnoli P, Layton B, Beutler B: Defective LPS signaling in C3H/HeJ and C57BL/10ScCr mice: mutations in Tlr4 gene Science 1998, 282:2085-2088 29 Kumar H, Kawai T, Akira S: Pathogen recognition in the innate immune response Biochem J 2009, 420:1-16 30 da Silva Correia J, Soldau K, Christen U, Tobias PS, Ulevitch RJ: Lipopolysaccharide is in close proximity to each of the proteins in its membrane receptor complex Transfer from cd14 to tlr4 and md-2 J Biol Chem 2001, 276:21129-21135 31 Agnarsson I, May-Collado LJ: The phylogeny of Cetartiodactyla: the importance of dense taxon sampling, missing data, and the remarkable promise of cytochrome b to provide reliable species-level phylogenies Mol Phylogenet Evol 2008, 48:964-985 32 Xiong Y, Brandley MC, Xu S, Zhou K, Yang G: Seven new dolphin mitochondrial genomes and a time-calibrated phylogeny of whales BMC Evol Biol 2009, 9:20 33 Zhou X, Xu S, Yang Y, Zhou K, Yang G: Phylogenomic analyses and improved resolution of Cetartiodactyla Mol Phyloget Evol 2011, 61:255-264 34 Yampolsky LY, Stoltzfus A: The exchangeability of amino acids in proteins Genetics 2005, 170:1459-1472 35 Zhang J, Nielsen R, Yang Z: Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level Mol Biol Evol 2005, 22:1-8 36 Shishido R, Ohishia K, Suzukic R, Takishitaa K, Ohtsud D, Okutsud K, Tokutaked K, Katsumatae E, Bandof T, Fujise Y, Murayamag T, Maruyamaa T: Cetacean Toll-like receptor and myeloid differentiation factor 2, and possible cetacean-specific responses against Gram-negative bacteria Comp Immunol Microb 2010, 33:89-98 Page 12 of 12 37 May-Collado LJ, Agnarsson I, Wartzok D: Phylogenetic review of tonal sound production in whales in relation to sociality BMC Evol Biol 2007, 7:136 38 Gatesy J: More DNA support for a Cetacea/Hippopotamidae clade: the blood-clotting protein gene gamma-fibrinogen Mol Biol Evol 1997, 14:537-543 39 McGowen MR, Spaulding M, Gatesy J: Divergence date estimation and a comprehensive molecular tree of extant cetaceans Mol Phylogenet Evol 2009, 53:891-906 40 Janeway CA Jr, Medzhitov R: Innate immune recognition Annu Rev Immunol 2002, 20:197-216 41 Pearson WR, Lipman DJ: Improved tools for biological sequence comparison Proc Natl Acad Sci USA 1988, 85:2444-2448 42 Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput Nucleic Acids Res 2004, 32:1792-1797 43 Kumar S, Dudley J, Nei M, Tamura K: MEGA: a biologistcentric software for evolutionary analysis of DNA and protein sequences Brief Bioinform 2008, 9:299-306 44 Ronquist F, Huelsenbeck JP: MRBAYES 3: Bayesian phylogenetic inference under mixed models Bioinformatics 2003, 19:1572-1574 45 Whelan S, Goldman N: A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach Mol Biol Evol 2001, 18:691-699 46 Posada D: jModelTest: phylogenetic model averaging Mol Biol Evol 2008, 25:1253-1256 47 Kimura M: The neutral theory of molecular evolution New York: Cambridge University Press; 1983 48 Ohta T: The nearly neutral theory of molecular evolution Ann Rev Ecol Syst 1992, 23:263-286 49 Yang Z, Nielsen R, Goldman N, Pedersen AMK: Codon-substitution models for heterogeneous selection pressure at amino acid sites Genetics 2000, 155:431-449 50 Yang Z: PAML 4: phylogenetic analysis by maximum likelihood Mol Biol Evol 2007, 24:1586-1591 51 Pond SLK, Frost SDW: Datamonkey: rapid detection of selective pressure on individual sites of codon alignments Bioinformatics 2005, 21:2531-2533 52 Pond SLK, Frost SDW: Not so different after all: a comparison of methods for detecting amino acid sites under selection Mol Biol Evol 2005, 22:1208-1222 53 Yang Z: Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution Mol Biol Evol 1998, 15:568-573 54 Yang Z, Nielsen R: Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages Mol Biol Evol 2002, 19:908-917 55 Maddison WP, Maddison DR: Mesquite: A modular system for evolutionary analysis, version 1.01.[http://mesquiteproject.org] 56 Pride DT: SWAAP: a tool for analyzing substitutions and similarity in multiple alignments Version 1.0.2.[http://www.bacteriamuseum.org/ SWAAP/SwaapPage.htm] 57 Hulo N, Bairoch A, Bulliard V, Cerutti L, Cuche BA, Castro Ed, Lachaize C, Langendijk-Genevaux PS, Sigrist CJA: The 20 years of PROSITE Nucleic Acids Res 2008, 36:245-249 58 Schultz J, Copley RR, Doerks T, Ponting CP, Bork P: SMART: a web-based tool for the study of genetically mobile domains Nucleic Acids Res 2000, 28:231-234 59 R Development Core Team: R: A language and environment for statistical computing R Foundation for Statistical Computing, Vienna, Austria; 2010, ISBN 3-900051-07-0, URL http://www.R-project.org/ doi:10.1186/1471-2148-12-39 Cite this article as: Shen et al.: Adaptive evolution and functional constraint at TLR4 during the secondary aquatic adaptation and diversification of cetaceans BMC Evolutionary Biology 2012 12:39 ... this article as: Shen et al.: Adaptive evolution and functional constraint at TLR4 during the secondary aquatic adaptation and diversification of cetaceans BMC Evolutionary Biology 2012 12:39... cetacean evolutionary history The aims of this study were 1) to find evidence of positive selection at TLR4 in cetacean origin and evolution, and 2) to evaluate whether the evolutionary rate of TLR4. .. Strong adaptive evolution of TLR4 during the habitat shift from land to water The present study revealed that the branch leading to whale + hippo was under the strongest positive selection at TLR4,