- Open Access
History of a prolific family: the Hes/Hey-related genes of the annelid Platynereis
EvoDevo volume 5, Article number: 29 (2014)
The Hes superfamily or Hes/Hey-related genes encompass a variety of metazoan-specific bHLH genes, with somewhat fuzzy phylogenetic relationships. Hes superfamily members are involved in a variety of major developmental mechanisms in metazoans, notably in neurogenesis and segmentation processes, in which they often act as direct effector genes of the Notch signaling pathway.
We have investigated the molecular and functional evolution of the Hes superfamily in metazoans using the lophotrochozoan Platynereis dumerilii as model. Our phylogenetic analyses of more than 200 Metazoan Hes/Hey-related genes revealed the presence of five families, three of them (Hes, Hey and Helt) being pan-metazoan. Those families were likely composed of a unique representative in the last common metazoan ancestor. The evolution of the Hes family was shaped by many independent lineage specific tandem duplication events. The expression patterns of 13 of the 15 Hes/Hey-related genes in Platynereis indicate a broad functional diversification. Nevertheless, a majority of these genes are involved in two crucial developmental processes in annelids: neurogenesis and segmentation, resembling functions highlighted in other animal models.
Combining phylogenetic and expression data, our study suggests an unusual evolutionary history for the Hes superfamily. An ancestral multifunctional annelid Hes gene may have undergone multiples rounds of duplication-degeneration-complementation processes in the lineage leading to Platynereis, each gene copies ensuring their maintenance in the genome by subfunctionalisation. Similar but independent waves of duplications are at the origin of the multiplicity of Hes genes in other metazoan lineages.
The basic helix-loop-helix (bHLH) protein superfamily comprises an ancient class of eukaryotic transcription factors (TFs) that are found in fungi, plants and metazoans . These TFs are defined by the presence of a bHLH domain that is, a DNA-binding basic region (b) followed by two α-helices separated by a variable loop region (HLH), that serves as a dimerization domain and as a platform for protein interactions . The bHLH superfamily is considered to be subdivided into 6 higher-order groups (named A to F) composed of evolutionarily related families of orthologous genes that share structural and biochemical properties. Among them, the Hes (Hairy/enhancer of Split) and the Hey (Hairy/Enhancer of Split related with YRPW motif) genes belong to two closely related families among the group E , and possess an additional protein-protein interaction domain, the Orange domain required for their function as transcriptional regulators . Another molecular property of the HES/HEY proteins is the presence of a C-terminal tetrapeptide motif (WRPW or YRPW), which is known for HES proteins to recruit co-repressors of the groucho/TLE1-4 family [4, 5]. The Hes and Hey families include the well-known HAIRY, HAIRY-related, ENHANCER OF SPLIT proteins of Drosophila and the numerous mammalian HES and HEY proteins, as well as several other related proteins such as HERP, HEYL, HELT, HESL, DEC1, and DEC2 whose mutual relationships and relationships with HES and HEY proteins are still poorly understood [5–9]. This lack of knowledge often results in a confusing, non-consensual nomenclature of these genes.
These HES/HEY-related proteins are involved in a broad variety of molecular and developmental mechanisms across metazoans. They function as DNA-binding transcriptional repressors that control cell fate decisions in several contexts. These proteins are often, but not always, found as direct effector genes of the Notch signaling pathway [10–13]. This pathway, a direct juxtacrine signaling system, is involved in the control of cell identity, proliferation, differentiation and apoptosis in animals (see reviews [14–19]).
In deuterostomes and ecdysozoans, Hes/Hey- related genes are involved in crucial developmental events, in particular nervous system (NS) patterning and segment formation [20, 21]. In mammalians, for example, Hes genes (notably Hes1, Hes3 and Hes5) play an essential role in neural development by regulating proliferation, differentiation and specification of neural stem cells in both Notch-dependent and -independent manners . These genes are also involved in regulating the maintenance of boundaries, which partition the NS into many compartments in a Notch-independent way [12, 22]. Still in mouse, another Hes-like member, HeyL promotes neuronal differentiation of brain neural progenitor cells through the control of the BMP signaling, . Hes7 and Hes1 are also key elements of the mouse molecular clock that, through the control of Notch, induce somite formation and are periodically expressed in anterograde wave-like fashion in the presomitic mesoderm (PSM), each wave leading to the generation of a pair of somites [22, 24–26]. Other roles of Hes genes in mouse have been evidenced, such as regulating the maintenance of stem cells in digestive organs , the development of sensory organs (eye, inner ear)  and a critical role of Hey genes (Hey1, Hey2, HeyL) in the development of the cardiovascular system [5, 27] in a Notch-dependent manner. Similar roles for Hes/Her/Hey genes in zebrafish and chick have been documented [12, 25, 26].
In Drosophila, the Hairy gene is involved in segmentation, during which it acts as a primary pair-rule gene required for the establishment of segments  but it also helps in defining the pattern of sensory bristles by repressing the formation of sense organ precursors , in a Notch-independent way in both cases [28–30]. In contrast, the genes of the Enhancer of split (Espl) complex mediate the effects of Notch signaling in a process named lateral inhibition, during embryonic and adult neurogenesis. Activation of the Espl genes (except m1) blocks the accumulation of large amounts of proneural protein in most cells of the proneural clusters, preventing them from adopting a neural fate [31, 32]. The Hes family gene deadpan, have been shown to regulate the self-renewal and specification of Drosophila neural stem cells, and to be involved in sex determination, both independently of Notch [33, 34]. Drosophila Hey participates in alternative neuronal fate establishment during asymmetric divisions, both in a Notch-dependent and -independent manner . In long germ-band arthropods, such as the spider Cupiennius salei[36, 37], the myriapod Strigamia maritima, and the cockroach Periplaneta americana, some Hairy- related genes are expressed in segmental patterns through the control of Notch signaling, suggesting a role in the segmentation process, while in the short germ-band Tribolium castaneum a Notch-independent expression of Hairy is observed [40, 41]. In the nematode Caenorhabditis elegans, a unique gene closely related to Hes/Hey, named lin-22 was reported to be involved in patterning the peripheral NS (PNS), in a Notch-independent manner . In addition, the members of the Ref-1 family that encode unusual proteins containing two distinct bHLH domains, may be very divergent relatives of Hes/Hey genes and mainly mediate Notch signaling in various developmental processes, although Notch-independent expressions are also observed .
In lophotrochozoans, a major clade of protostomes often neglected in evolutionary developmental biology studies, the few data on Hes genes available so far mainly come from annelids. In the leech Helobdella robusta, Hes gene is expressed in the stem and progenitor cells (teloblasts and blast cells) of the posterior addition zone , under the control of Notch, and may be implicated in posterior elongation and segment formation [45, 46]. In the polychaete Capitella teleta, three Hes genes have been shown to be expressed in a variety of embryonic territories. They are possibly regulated by Notch-dependent and -independent mechanisms, depending on the expression territories concerned . The three genes are expressed in the posterior addition zone of the juvenile worm, which is responsible for the addition of segments. Two of the genes, Cte-Hes2 and 3, are also expressed in the brain and in the elongated trunk and Cte-Hes2 is, in addition, expressed in the presumptive chaetal sacs, at the origin of the chaetae of the appendages, suggesting a role for Hes genes in neurogenesis, chaetogenesis and segmentation .
In non bilaterian metazoans, the roles of Hes genes have only been explored in cnidarians. In the anthozoan Nematostella vectensis, seven Hes/Hey-like genes have been reported to be expressed in a variety of territories, with distinct expression domains whose union seems to recapitulate the expression of the Notch receptor . Blocking Notch signaling using small molecule inhibitors suggests that four of the Hes genes are targets of Notch signalling and are involved in cnidogenesis and neurogenesis . Studies in adults and during budding of the hydrozoan Hydra suggest that Notch has a role in germ and nematocyte cell differentiation , as well as in boundary formation in the forming bud, via the regulation of the expression of HyHes (the only Hes reported in Hydra) . In demosponges (Porifera), one Hey gene was identified in the genome of Amphimedon queenslandica and recently, Hes genes were reported (by blast searches only) in the transcriptomes of two others demosponges (but without phylogenetic analyses confirming their assignment ).
In addition to the aforementioned studies aimed at defining the expressions and functions of Hes and Hey genes, there have been several analyses describing the genomic repertoire of these genes in various animals. These studies have shown a surprisingly variable number of Hes/Hey genes in these species and have suggested the occurrence of species- or lineage-specific duplications, for example, in the cnidarian Nematostella vectensis, the fruitfly Drosophila melanogaster, and the amphioxus Branchiostoma floridae[52, 53]. Attempts to resolve the evolutionary relationships among Hes/Hey families members have so far focused on vertebrates [6, 11] or insects . A recent survey of Hes/Hey genes in 17 metazoan species (mainly vertebrates, plus 2 non-bilaterian species) has led authors to suggest that Hey genes were already present in the last common ancestor of metazoans, whereas Hes genes would have arisen in the stem lineage of Eumetazoans [2, 6]. The authors of this study proposed a scenario with two rounds of expansions of this gene family, in the common ancestor of animals and vertebrates, respectively . All these studies were, however, hampered by a dataset of taxa that are poorly representative of the metazoan diversity and by poor statistical support of phylogenetic trees . Whereas the Hes and Hey genes are robustly separated into distinct clades, relationships among Hes genes are poorly resolved. A number of vertebrate sub-classes have been proposed recently and named HesL, DEC1/2, Hes1/4, Hes2, Hes3, Hes5, Hes6, Hes7 but it is unclear whether any of these sub-classes arose before the separation of the vertebrate lineage.
In this paper, we try to unravel several issues concerning the molecular evolution and functions of the Hes/Hey-related genes. (i) When and how did the multiplicity of Hes/Hey-related genes arise in the metazoan tree and how many families can be defined among them? (ii) Why have so many copies of Hes/Hey-related genes been conserved in the course of evolution? (iii) When and how have the multiple functions of Hes/Hey-related genes been acquired during metazoan evolution?
To gain insights into these questions, we studied the Hes, Hey and their related genes in the lophotrochozoan Platynereis dumerilii. Over the past decade, the annelid Platynereis has become a valuable model for evolutionary developmental biology studies. Importantly a number of comparative genomic studies have suggested that Platynereis is descending from a slow-evolving lineage and has therefore retained many ancestral bilaterian features including the ancestral composition of multigene families [54–57]. We identified a large family of 15 Hes/Hey- related genes in Platynereis. To determine whether these numerous Platynereis genes represent an ancestral bilaterian gene family or an independent gene radiation in the annelid lineage, we investigated broadly the origin and evolution of Hes/Hey genes and their distinct sub-families in animals. As a clear improvement compared to earlier studies, we sampled extensively animal lineages that branch outside bilaterians (cnidarians, ctenophores, sponges) to decipher the early steps of the family evolution in metazoans. We also sampled several lophotrochozoan species genomes, a bilaterian branch often neglected in phylogenomic studies. Our detailed phylogenetic analyses of more than 200 HES/HEY-related proteins show that three subfamilies (Hes, Hey and Helt) are pan-metazoan whereas two others seem to be restricted to protostomes (Stich) and chordates (Dec). Phylogenetic as well as genetic linkages analyses support the hypothesis of multiple independent Hes tandem duplications in almost each metazoan phylum, including in the Platynereis lineage. To test whether related Platynereis genes in the tree share similar expression patterns during embryogenesis; we determined the expression patterns through embryonic/larval development as well as during juvenile posterior elongation. We show that these genes are expressed in a wide variety of expression domains, (that is, mesodermal tissues, segments, NS). We discuss the possibility that Platynereis Hes/Hey- related genes, after duplication from a single ancestor, underwent a process of divergence by either neofunctionalization, that is, the random acquisition of a new function in the course of the accumulation of neutral mutation in duplicated genes  or subfunctionalization via the duplication-degeneration-complementation (DDC) model [58–61]. In the latter, it was postulated that degenerate mutations affect the gene functions, rendering neither copy alone sufficient to perform the ancestral functions and resulting in the partitioning of these ancestral functions in each paralogous copy .
Animal culture and collection
Platynereis embryos and juveniles were obtained from a breeding culture established in the Institut Jacques Monod (Paris), according to the protocol of Dorresteijn et al. . Staging of the embryos was done following Fischer et al. . Posterior parts of atokous worms regenerated 11 days after caudal amputation were obtained as previously described . Embryos and larvae, as well as atokous worms 11 days after caudal amputation were fixed in 4% paraformaldehyde (PFA), 1 × PBS, 0.1% Tween20 and stored at -20°C in methanol 100% .
Survey of Platynereis dumerilii Hes/Hey-related genes: identification, intron positions and cloning
Platynereis Hes/Hey- related genes were identified by sequence similarity searches against large collections of expressed sequence tags (ESTs) and genomic sequences (Platynereis resources, 4dx.embl.de/platy/, D Arendt, personal communication)  using Drosophila and/or vertebrate genes as query. Complete coding sequences were assembled from EST fragments using CodonCode Aligner (CodonCode Corporation, USA). For each Platynereis gene, putative exons positions were mapped on genomic DNA by comparison with ESTs using Artemis . Large gene fragments were subsequently cloned by PCR using sequence-specific primers on cDNAs from mixed larval stages (primer sequences and PCR conditions are available upon request). PCR products were TA cloned into the PCR2.1 vector following the manufacturer’s instructions (Invitrogen, France) and sequenced. Partial cDNA obtained were then used as templates to produce RNA antisense probes for whole-mount in situ hybridization (WMISH) using Roche (France) reagents. Orthology relationships were defined using as criteria sequence similarities, presence of specific domains and phylogenetic analyses (see below). The fifteen newly identified Platynereis genes sequences were deposited in Genbank [KC999039 to KC999053].
Data sources, sequence retrieving and domains composition
Hes/Hey gene searches were carried out using the tblastn or blastp algorithms  implemented in ngKlast (Korilog V 4.0, Questembert, France) with Drosophila, vertebrate and Nematostella proteins as query sequences, with the default BLAST parameters and a low cutoff E-value threshold of 0.1, against 24 genome datasets. Lists of BLAST hits were then reciprocally BLASTed against the human proteins dataset of the NCBI database to extract sequences related to the Hes/Hey family (reciprocal best hits ). Those genomes correspond to 24 metazoan species representatives of the main lineages of animals: Porifera, Ctenophora, Cnidaria, Placozoa, Lophotrochozoa, Ecdysozoa and Deuterostomia. For each species, we screened the genome assembly, the predicted protein sequence dataset and transcriptomes when available. Concerning the sponges, we concatenated a chimeric dataset from two Oscarella species: Oscarella carmela from which the genome is accessible and an undescribed Oscarella specimen (Oscarella sp.) from which only EST were available. Indeed, the Hes repertory of Oscarella carmela lacks several representatives that were present in the EST dataset of another Oscarella and their addition are critical to understand the origin of the Hes family.
The presence of the Hes/Hey-related specific protein domains (that is, bHLH, Orange and WRPW peptides) was systematically checked by scanning sequences with both NCBI Conserved Domain search option V3.10  and InterProscan V.42 online software . An important proportion of the Hes predicted sequences (1/4 roughly) do not harbor an Orange domain. We tried to ensure that these Orange domains are genuinely missing by checking predicted sequences against genomic scaffolds and screening specifically for Orange sequences. However, in the absence of exhaustive transcriptome data in some species, we cannot exclude that in a limited number of cases, the lack of an Orange domain results from faulty sequence prediction. Last, a complete list of genomic scaffolds carrying the predicted sequences was produced for each species and the presence of genomic clusters was established for a number of them.
The predicted amino-acid sequences of the identified Platynereis gene fragments were aligned with their presumptive orthologs from 24 metazoan species. Two group-B bHLH members: sterol regulatory element-binding protein (SREBP) and microphthalmia-associated transcription factor (Mitf) were selected as the outgroup in order to root the Hes/Hey- related tree. Only the bHLH domains of those sequences were included and aligned for two species: Danio rerio and Lottia gigantea. Alignments were performed with MUSCLE 3.7 online [72, 73] under default parameters and adjusted manually in Bioedit . Only parts of the alignments corresponding to the bHLH, Orange and WRPW peptides, when presents (112 amino acids altogether) were used for the phylogenetic analyses. Two datasets were used for the analyses, one including all sequences for all species (n = 201) and the other containing only sequences where both domains (bHLH and Orange) were identified (n = 154). The phylogenetic trees were constructed using two different approaches: the maximum likelihood (ML) and the Bayesian analyses. ML analyses were performed with the PHYML 3.0 program under an LG model of amino acid substitution , a model that was shown to be the most efficient. To take into account rate variation among sites, we computed likelihood values by using an estimated gamma law with six substitution rate categories and we let the program evaluate the proportion of invariant sites. Statistical support for the different nodes was assessed by both the approximate likelihood ratio test (aLRT)  and bootstrap (BP) analysis  with 500 replicates. Bayesian analysis was performed with MrBayes 3.2.1 , using the WAG fixed model, as the LG model is not available. Two sets of six independent simultaneous metropolis-couples Markov chains Monte Carlo were run for 10 and 20 million generations (for the restricted and all inclusive alignments respectively) and sampled every 500th generation. We estimated that convergence was obtained if the average standard deviation of split frequencies reached a threshold value of 0.05. The trees obtained were mixed and an adequate burn-in was removed (above 25% of tree and parameters). Bayesian posterior probabilities (PP) were used for assessing the confidence value of each node . Phylogenetic trees were visualized, rooted and edited using FigTree V.1.4.0 . The tree topologies showed are from the ML analysis and all nodes, even moderately supported ones, were conserved because taxa number and composition, in addition to statistical support, are keys to discussing the validity of nodes in such a broad phylogenetic analysis.
We also performed parsimony reconstruction of character evolution based on a consensus Metazoan phylogenetic tree using Mesquite software version 2.72 . Character used in those analyses is the number of gene per species that was encoded in a character matrix. Analyses were performed for Hes and Hey family as well as more than 40 other bHLH families, based on previously published datasets . Sampling of the precedent paper differs slightly from this study, implying the presence of missing data in the character matrix.
Visualization of Platynereis HES expression patterns by whole mount in situ hybridization
Single NBT/BCIP whole-mount in situ hybridization was performed as previously described  on five larval stages (24, 33, 48, 55 and 72 h post fertilization (hpf)) and during post-embryonic posterior elongation. For the latter, we performed WMISH on worms 11 days after posterior amputation as post-caudal regeneration posterior elongation is a proxy to normal posterior elongation . Bright-field images were taken on a Leica microscope. Adjustments of brightness, contrast and Z projections were performed using the ImageJ and Photoshop software.
Results and discussion
Origin and evolution of the Hes superfamily
The Hes superfamily in Platynereis
Exhaustive searches on the genome of Platynereis complemented with several EST datasets led us to identify no less than 15 Hes/Hey-related genes coding for proteins of various lengths: from 215 to 642 amino-acids. While all of them possess the conventional bHLH domain, four genes (Pdu-Hes10; Pdu-Hes11; Pdu-Hes12; Pdu-Hes13) lack the Orange domain (Figure 1). These absences represent presumably secondary evolutionary losses of an ancestral Orange domain although we cannot exclude the possibility of a non-perfect assembly of the genome that could impair our domain predictions. All but Pdu-Stich possess the WRPW terminal domain, modified in WQPW in Pdu-Hes9 and in YRPW in Pdu-Hey.
In Pdu-Hes1 to Pdu-Hes10, a conserved pattern of intron positions is observed, two of them being located at exactly homologous positions in the bHLH coding sequence, while the third one is situated between bHLH and Orange coding sequences (Figure 1). In Pdu-Hes11, Pdu-Hes12 and Pdu-Hes13, lacking the Orange domain, only the first two introns (in the bHLH domain) are found, whereas Pdu-Hey and Pdu-Stich harbor only the second homologous intron (Figure 1). Pdu-Hey and Pdu-Stich are peculiar with respectively three and six introns, only one of which is in shared positions with other Platynereis Hes-related genes.
This high number of Hes/Hey-related gene copies in the Platynereis genome is somewhat surprising, given the evolutionary conservatism displayed in other gene families such as the Wnt, Hox and bHLH  genes. A number of other metazoans share a high number of Hes-related genes. This assessment led us to question the evolutionary origin of such diversity, to shed light on this issue and prompted us to extend our genomic analyses to the scale of the whole metazoan clade.
The Hes superfamily in Metazoa consists of three pan-metazoan families: Hes, Hey and Helt
We performed a detailed search of Hes/Hey-related genes in metazoan species representatives of all main metazoan lineages (that is Deuterostomia, Ecdysozoa, Lophotrochozoa, Ctenophora, Cnidaria, Placozoa and Porifera). Details of species used, genomic resources access, sequences names, domains presence or absence as well as scaffold/chromosomes numbers where the sequences are located (when available), are presented in the Additional file 1. We especially surveyed lophotrochozoan and non-bilaterian species as they have been neglected in earlier studies and are especially informative on bilaterian, eumetazoan and metazoan ancestral states, respectively.
In a first approach, we built ML and bayesian trees of the complete dataset, that is including those sequences for which no Orange domain was found (Figure 2, Table 1). As the evolution of the Hes/Hey family is rather complex, we first assessed how many strongly supported pan-metazoan clades are evidenced by these trees. These clades reflect the existence of a number of ancestral genes that were present in a metazoan ancestor and are evidenced by highly supported clades (aLRT >90%) in which a majority of metazoan phyla are represented. Only three mutually exclusive clades of this nature exist in the complete tree: nodes C, D’ and B, corresponding respectively to Hey, Helt and a large clade grouping most remaining Hes genes. Strikingly, the emergence of these three clades predate the last common ancestor of all metazoans as genes belonging to each of them are found in three (of the four) non-bilaterian phyla considered here, the sponges, the placozoan and the cnidarians (Figures 2 and 3, Tables 1 and 2) in the hypothesis of sponges being the sister group to all other metazoans species . Recently an alternate view of the relationships of non bilaterian phylum have emerged and some authors considered that ctenophore are indeed the sister group to all others metazoans . In the case of the Hes superfamily, Mnemiopsis Hes/Hey-related genes repertory is especially poor, with only three long-branch Hes genes, and less informative compared to sponges.
In addition to B, C and D’ groups, several smaller but well-supported clades show a more restricted taxonomic composition. Node E contains only protostome genes, both from ecdysozoan and spiralian taxa, and presumably reflects a new, previously unrecognized ancestral protostome gene related to Drosophila Sticky ch1. Node F contains only vertebrate Dec genes grouped with an amphioxus gene, thus likely indicating an ancestral chordate Dec gene. Two remaining well supported clades (nodes P and Q) represent only small subsets of animal species (sponges, hemichordate and cephalochordate). In addition, these genes display long branches. Therefore, we consider that both nodes P and Q are unlikely to represent ancestral metazoan genes but are rather derived genes grouped together by artifact. While the monophyly of the Hes/Hey family as a whole is well-supported, relationships within four interphyletic clades (Hes, Hey, Helt and Stich) are poorly resolved (Figure 2, Table 1).
One possible explanation why relationships between genes are so poorly resolved within the Hes clade could be the rapid evolution of the sequences of a large number of genes. Indeed, many genes coding for a protein with no Orange domain have a long branch especially within the R clade. We wanted therefore to test the possibility of assessing the phylogeny of the genes with a conserved protein domain structure (presence of the Orange domain) separately (Figure 3, Table 2). The resulting tree was not fundamentally different in its overall architecture from the complete dataset tree. In particular, nodes corresponding to the Hey (C), Helt (D), Stich (E), Dec (F) and Hes genes (B) were still present, although with slightly diminished statistical significance. Within Hes genes, a clade with relatively short branched genes (M) was present in both trees, with a large majority of the same genes. A clade with long branched genes (R) was also found with some statistical support. This clade is however much smaller than in the complete dataset tree because many proteins with no Orange domain were initially included in this group.Based on these phylogenetic results, we can redefine five families, also supported by specific additional proteic motifs (Figure 4). Three families are pan-metazoan (the Hes, Hey and Helt families) and two others are clade-specific (the Stich and Dec families).
The evolutionary history of the well-known Hes family was already investigated in a study mainly focused on vertebrates . This recent study failed to evidence clear relationships among this large family outside vertebrates. Not surprisingly, we observed a similar fuzzy situation in our own analyses. Nevertheless, in opposition to precedent statement (based only on one sponge species, that is, Amphimedon queenslandica, Demospongiae) , we found the evidence of a real Hes gene from another sponge lineage, the Homoscleromorpha (recently nominated as the fourth sponge lineage ). Accordingly we also totally disagree with the idea of a primitive tetrapeptide FRPW, found in the A. queenslandica Hey/1/2/L gene, which could represent the ancestor of Hes WRPW domain. These points highlight the fact that a unique representative species of a large phylum is not sufficient and can lead to erroneous evolutionary interpretations.
The Hey family  is present in all metazoan lineages included in these analyses. This family is characterized in addition to bHLH and Orange domains by an N-terminal motif named MKRXX, while shorter than the motif 1 previously proposed , by an extended well-conserved C-terminal motif, 13 amino-acid-long, renamed KPYRPWGXEXGAF/Y and another short motif (7 aa) located between the bHLH and Orange domains named DAHA. Finally, we observed that a specific glycine is found only in Hey sequences, in the 6th amino acid position of the bHLH domain and can be considered as a molecular signature of Hey (Figure 4A). Our trees are compatible with the presence of a single Hey gene in the last metazoan, eumetazoan and bilaterian ancestor and a single gene has been retained in many metazoan species.
The previously poorly-defined Helt family [6, 88] encompasses 12 members of Deuterostomes, Lophotrochozoa, Cnidaria, Placozoa and Porifera but surprisingly no Ecdysozoa. This family named HESL in a previous study  was supposed to be composed of eumetazoan representatives only. The presence of sponge and placozoa sequences within this family rejected this hypothesis. This robust clade, in our phylogenetic analyses, is not supported by any discrete molecular signature. Nevertheless, intron numbers and positions are conserved in Placozoa, Cnidaria, Lophotrochozoa and Deuterostome representative species, except for B. floridae sequences. In others, the first intron is found just before the bHLH domain, the 2nd, inside the bHLH, and the third between the bHLH and the Orange domains. Trichoplax sequence harbors a supplementary intron in the Orange domain (data not shown). Trees are compatible with the existence of a single Helt gene in the last metazoan, eumetazoan and bilaterian ancestor. This Helt gene has been secondarily lost in an ecdysozoan ancestor as well as an annelid ancestor.
The new Stich (named after the fruit fly gene Sticky ch1) family forms a robust clade of protostomes sequences only, which has never been identified in previous studies . Detailed analyses of those nine sequences revealed the presence of four specific conserved motifs shared by all sequences (except for the first one) in addition to the classical bHLH and Orange domains (Figure 4B). We named XRDP the first Stich-specific motif, 9 amino-acid long, and located just in front of the bHLH. This motif seems to be absent from two sequences (Phu134640 and Tca12119) but as those sequences are incomplete in the N-terminal part (Figure 4B) we cannot exclude that their absence is due to an imperfect genome annotation. The second motif, named YKFKX is 14 aa long and is located between the Orange domain and the C-terminal part of the protein. The third Stich motif is longer (27 aa) and also located between the Orange domain and the C-terminal part of the protein. We named it FALHX while three sequences do not harbor exactly this motif (especially the Pinctada sequence). The fourth and last specific motif of the Stich family is located in the C-terminal part. Composed of 12 aa and named HPISIX, it is found in all sequences except the shorter Lottia sequence. Intron numbers and positions are not conserved among Stich sequences (data not shown). Trees are compatible with a single Stich gene having been present in the last protostome ancestor and a single gene is present in most of its extant protostome descendants.
The Dec family was already known and supposed to be composed of chordate representatives as well as a Drosophila sequence, although no phylogenetic data support this last point . Furthermore, two diagnostic motifs named motifs 2 and 3 have been proposed by Zhou et al. . Our phylogenetic analysis revealed that the Dec family is specific to chordates solely and while the motifs 1 and 2 are indeed found in the vertebrate members, they are clearly not conserved in the Branchiostoma sequence. We nevertheless found two short specific motifs of 9 aa, EDXKD and PXLYPG, respectively in the N-terminal and C-terminal parts of the proteins, that are diagnostic of Dec members (Figure 4C). Intron numbers and positions are almost totally conserved; with little variation for the Branchiostoma protein. Indeed, all of them have a first intron in the non-conserved N-terminal part of the protein, the 2nd intron is found in the middle of the newly described EDXKD motif, and the 3rd one is in the middle of the bHLH domain. For the chordate sequence, the 4th intron is located between the bHLH and the Orange domain, while is it inside the Orange domain in the Branchiostoma protein (that also possesses a supplementary 5th intron) (data not shown). One Dec gene was present in the last common ancestor of chordates.
The numbers of genes for each species in each gene clade reveals contrasting situations (Table 3). In a majority of metazoan species, a single gene was found in each species for the Hey, Helt and Stich clades. This is compatible with the hypothesis that a single gene was present in the metazoan (Hey, Helt) or protostome (Stich) last common ancestor. One exception is the presence of three Hey paralogues in vertebrates Homo and Danio, presumably the result of the double whole genome duplication (2R) postulated in a vertebrate ancestor. By contrast, the number of Hes genes is extremely variable, ranging from one single gene in the sponge Oscarella, the placozoan Trichoplax, the cnidarian Hydra, the deuterostome Saccoglossus, the ecdysozoan Caenorhabditis, and the spiralian Schmidtea to 11 in the cnidarian Nematostella, 11 in the ecdysozoan Drosophila, 13 in the spiralian Platynereis and no less than 22 in the deuterostome Danio. This indicates that the evolution of the Hes family in each of four big animal clades (cnidarians, spiralians, ecdysozoans and deuterostomes) has been complex with numerous independent gene duplications, or numerous gene losses, or a combination of both phenomena.
Our exhaustive analyses of Hes superfamily in a broad variety of metazoan organisms, especially lophotrochozoans and non-bilaterians ones, allow us to grasp the early evolutionary history of this group. Indeed, our phylogenetic data, in opposition to precedent statements , clearly show the presence of three pan-metazoan families (Hes, Hey and Helt) that we inferred from the presence of indisputable Hes, Hey and Helt orthologs in sponge species, (considering that sponges are the sister group to all other metazoan species ). Stich members are specific to the protostomes indicating a likely appearance of the Stich family in the direct ancestry of this lineage. While this is less parsimonious, the Stich family could have been already present in Urbilateria (the bilaterian common ancestor) and lost in the deuterostomes. Urbilateria possessed at least 3 Hes/Hey related genes (Hes, Hey and Helt).
Multiple Hes gene independent duplications in many metazoans
The Hes family is composed of a high number of Hes sequences; with a great variability in the number of genes found in metazoan species (Table 3) from one in the enteropneust Saccoglossus to 22 in the vertebrate Danio; more than 60 of these genes are found in a clade of relatively short branched taxa (Figures 2 and 3, M node). Many more derived sequences, with longer branches are found in a second, poorly supported clade, R.
As already observed on a smaller scale , in both clades, genes tend to be grouped into lineage-specific clades. In the clade M, a big clade of ecdysozoan genes (K or K’), a large clade of deuterostome genes (L), two groups of lophotrochozoan genes (I and J) and robust clades of cnidarian genes (H and H’) are found. Sponge and Placozoa representatives are grouped together. Six of the Platynereis Hes genes: Pdu-Hes1, Pdu-Hes3, Pdu-Hes4, Pdu-Hes5, Pdu-Hes6, Pdu-Hes8 are found in the lophotrochozoan clades I and J. The other part of Hes subfamily clade (poorly supported clade R) contains diverse divergent sequences notably the Enhancer of split complex members that are grouped together (nodes G, with two Drosophila sequences excluded in the partial dataset). Large cnidarians and lophotrochozoan-specific clades (nodes N and O) are also found, with four derived Platynereis sequences (Pdu-Hes10 to 13) within the latter.
Such phylogenetic relationships tend to indicate the presence of a limited number of ancestral genes and a large number of independent gene duplications in various lineages. Strong evidence of such gene duplications is the persistence of chromosomal gene linkages, indicative of tandem duplications. We checked chromosome locations and genetic linkages for species that present specific duplications and from which those data are available (that is, 2 Cnidaria, 3 Lophotrochozoa, 5 Ecdysozoa and 4 Deuterostomes) and have detailed the start and end positions of the genes in the scaffold as well as the gene strand, in Additional file 2. We found the presence of one or more clusters of Hes genes in all the 14 species genomes explored (Figure 5). In all but two species (S. purpuratus and H. sapiens), the phylogenetically related genes are also clustered and so physically linked. This situation is especially obvious in cnidarians where two clusters of two and three genes were found in Acropora and two clusters of two and six genes in Nematostella. For Acropora, all the clustered genes are phylogenetically related while only 6 on 8 Nematostella genes are. In Capitella teleta, three clusters were found and surprisingly one corresponds to Hey genes. This is the only case of a non-Hes tandem duplication. In the two other lophotrochozoans, several clusters of three and four genes were found. As already described, clustered genes are also found in the Ecdysozoans E(Spl) complex , in the amphioxus [52, 53] and in zebrafish . A sea urchin complex of two Hes genes that are not phylogenetically related was found, but the Spu-21608 sequence placement in the tree among lophotrochozoa is doubtful. Another argument in favor of this hypothesis is provided by the parsimony reconstruction of the character number of Hes genes (based on a theoretical metazoan tree ) analysis (Additional file 3, A). From the observed pattern, we conclude in favor of the presence of one ancestral gene that underwent several lineage specific duplications. Gene losses may also have occurred but cannot be inferred precisely.
Can we thus reconstitute an ancestral number of Hes genes in the last metazoan, eumetazoan and bilaterian ancestors? Given the relatively low significance of phylogenetic resolution in the Hes clade, any proposal will remain tentative. We however propose that a single Hes gene may have been present in the metazoan ancestor and one or possibly two in the eumetazoan and bilaterian ancestors. These ancestral genes underwent numerous gene duplications in several, but not all, metazoan lineages (Additional file 3, A). The fact that species-specific clades exist also revealed that some of these duplications are recent, as highlighted in the zebrafish , amphioxus  but also in Helobdella and Lottia, and presumably Platynereis genomes. The presence of a single indisputable Hes gene, in the sponge Oscarella, embedded in the short branch clade M, is a clear new indication that the family originated in a metazoan ancestor. The grouping of a large number of cnidarian genes in a common clade (H) and the confirmation by chromosomal linkage that many of these genes are the results of tandem duplications in the Nematostella and Acropora genomes, indicate that these genes originate from a single ancestral gene. The presence of grouped cnidarian genes in the second long branch clade (R) together with representative of genes of all bilaterian clades suggest the presence of a second Hes gene, related to the enhancer-of split cluster genes of Drosophila. This clade is persistent when eliminating the sequences without Orange domain from the dataset but remains composed of genes evolving significantly faster than those of clade M. The grouping of genes of all bilaterian phyla in clade L, with the exception of a few presumably more derived annelid and echinoderm genes, is indicative of a single short-branch Hes in the bilaterian ancestor. A second clade of enhancer-of-split bilaterian genes is also present in both trees, supporting a putative second Hes gene. This clade is again composed of fast-evolving sequences, and comprises, surprisingly, ctenophore genes. It is therefore more questionable.
We conclude from the combined results of phylogenetic analyses and genomic organization that Hes superfamily is divided into five families, three of them being already present as a single gene in the urmetazoan (Hes, Hey and Helt). The evolution of the Hes family has been shaped by many independent lineage specific tandem duplication events. Is this situation often found in the gene family or is it highly unusual? We made parsimony reconstruction of the evolution of the character, number of gene copies, among more than 40 bHLH families (Additional file 3 B, C and D and data not shown). Those analyses revealed that the Hes family duplication rate is drastically superior compared to all other families. Thus, the presence of large numbers of Hes genes in a number of animal species represents a form of evolutionary convergence.
Platynereis Hes superfamily members involved in two major, potentially ancestral, developmental processes: neurogenesis and segment addition
The unusual evolutionary history of the Hes family described above leads us to question when the multiple functions of Hes genes have been acquired during metazoan evolution and how these functions evolved. For that purpose, we monitored expression patterns of the lophotrochozoan Platynereis Hes/Hey-related genes at five different embryonic developmental stages (early, mid and late trochophore, metatrochophore and early nectochaete), and also during juvenile posterior elongation when new segments are added sequentially. The overall morphologies of the studied stages are shown in Additional file 4. Two of the analyzed genes: Pdu-Hes7 and Pdu-Hes9 show none or very weak and ubiquitous expressions at all studied stages (not shown). We investigated the presence of those two Pdu-Hes genes in six different transcriptomic databases (available with restricted access at http://4dx.embl.de/platy/). We found the presence of Pdu-Hes9 exclusively in a 454 cDNA library obtained from juvenile heads. Pdu-Hes9 appeared thus as an adult specific regulator, which is congruent with our non-conclusive in situ hybridization experiments on embryonic stages. Pdu-Hes7 is found in two of the six transcriptomic databases and is thus presumably none or weakly expressed in the studied stages. Nevertheless, we cannot exclude the fact that technical limitations have prevented us from accessing a weak or very transient expression. These two genes will not be further described. Pdu-Hes11 and Pdu-Stich expression patterns were obtained only at 72 hpf (Additional file 5). Expression patterns for relevant stages, as well as their schematic representations for Pdu-Hes1, Pdu-Hes2, Pdu-Hes3, Pdu-Hes4, Pdu-Hes5, Pdu-Hes6, Pdu-Hes8, Pdu-Hes10, Pdu-Hes12, Pdu-Hes13 and Pdu-Hey are provided in Figures 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, and 16 respectively. Detailed descriptions of those expression patterns are provided in the figure legends.
Developmental expressions of Platynereis Hes/Hey-genes suggest an involvement in nervous system patterning
All 13 Hes/Hey-related genes for which we obtained expression data for are expressed in cells that are crucial for the formation of the central and peripheral NS. These include widely distributed territories, such as the ventral midline, the ventral nerve cord (VNC), the PNS and some brain cells (Table 4). The midline corresponds to a specialized population of cells that demarcate the plane of bilateral symmetry between the two halves of the neurectoderm. This is the place where the edges of the proliferating trunk ectoderm meet and fuse during gastrulation . We observed that Pdu-Hes1, Pdu-Hes6, Pdu-Hes8 and Pdu-Stich are expressed there (Figures 6, 11, 12 and Additional file 5). Previous studies reported the specific expression of several genes in the larvae ventral midline cells such as slit, sim and netrin but also a wnt gene (Wnt 4, ), two upstream regulators of the core PCP proteins (fat and four-jointed, ) and a microRNA (mir92, ). In Drosophila, m3, m7, m γ, and m δ (E(spl) genes) are specifically expressed in the embryonic midline and m7 transcripts are present in the midline until the condensation of the nerve chord . It has been shown that the ventral midline of protostomes is homologous to the floor plate (FP) of vertebrates, thus, in mouse, Hes1 is present in the FP cells that are morphologically specialized cell populations at the ventral midline of the neural tube . Consistent with the non-neurogenic property of FP cells, persistent expression of Hes1, which suppresses proneural gene expression, is required for the establishment of FP cell fate in mouse . Functionally, the structure serves as an organizer sending a ventralizing signal to the neural tube, as well as to guide neuronal positioning and differentiation along the dorsoventral axis of the neural tube. Recently, it was proposed that Platynereis ventral midline may also act as an organizing center, which produces signals important for neuron production and the proper scaffolding of the VNC .
In the early neurectoderm, five Hes/Hey-related genes (Pdu-Hes3, Pdu-Hes10, Pdu-Hes12, Pdu-Hes1 and Pdu-Hey) are active in neurogenic cells distinct from the midline cells (Figures 8, 13, 14, 15 and 16). Among them, Pdu-Hes3 and Pdu-Hes10 are exclusively expressed in the VNC during PE (Figures 8 and 13) in opposition to Pdu-Hes12 only found in the neurectoderm at 48 hpf (Figure 14). Pdu-Hes1 is active in both the early and late neurectoderm with an expression in bilateral columns, in addition to the midline, as well as in intersegmental stripes in the prospective VNC only, that persists as VNC ganglia differentiate (Figure 6). During PE, we observed that expressions of both Pdu-Hes13 and Pdu-Hey, are maintained in the ganglions of the VNC of maturing segments, suggesting that those genes, also involved in later neurogenesis, mark differentiated neurons or neurons in the course of differentiation. As these expressions are highly restricted compared to the Elav pattern , a marker of the whole VNC, we suggest that those genes are expressed in a sub-population of neurons or neurons in differentiation of the VNC.
In the brain, 10 Hes/Hey-related genes are active in rather small specific subsets of cells. With the term brain, we refer here to the cells that occupy the dorsal half of the episphere of Platynereis larvae  and take part in the formation of the worm prostomium. Most Pdu-Hes are expressed in pairs of columnar cells in the dorsal part of the episphere of the larvae (Figures 9, 10, 11, 12, 13, 15, 16, and Additional file 5). Most of these expressions in pairs of columnar cells look similar, but we cannot establish whether the same cells expressed several Hes genes. As the precise characterization of the cells that express those genes is not the main focus of our study, we did not detail further these expressions. We nevertheless noticed that such precise Hes/Hey-related expressions in brain are also observed in several other organisms, such as deadpan in the drosophila. Similarly, in zebrafish, several Hes-related genes (Her3, Her5, Her11) are known to be expressed in the brain, specifically in the midbrain-hindbrain boundaries they contributed to form. Above those expressions in the central nervous system ((CNS): VNC and brain cells), one Hes (Pdu-Hes10) is also expressed in disparate unknown cells that also do not harbor a precise bilateral pattern and that could possibly be precursors of sensory cells of the PNS (sensory organs and neurons) (Figure 13).
One of the striking features of this study is that all Hes/Hey-related genes from which we obtained expressions are found in one or more cells/tissues/structures related to the NS patterning. Nevertheless, their expressions are often specific to a category of cells. Indeed, few genes found in the VNC or PNS are also expressed in brain cells. Notably, the Hes/Hey-related genes expressed in the ventral midline cells are almost never expressed in the VNC or PNS. This last point is in accordance to previous results showing that midline markers are never expressed in neurectoderm, suggesting a non-neurogenic property of Platynereis midline cells, similar to the FP cells. Those results may also suggest the involvement of a combinatorial code of Hes/Hey-related genes in the Platynereis NS patterning. In the annelid Capitella teleta, three Hes genes (of the six we identified, Table 3) have been previously studied and two of them (CapI-Hes2 and CapI-Hes3) are localized in a small part of the developing brain in larvae and in the forming ganglia of the VNC of juveniles . CapI-Delta and CapI-Notch expression patterns in the larvae and juvenile are reminiscent of CapI-Hes2 and CapI-Hes3 ones, except for CapI-Delta, absent in the VNC of the juvenile. Due to the pivotal role of Hes/Hey-related genes in vertebrate and arthropod neural development in regulating proliferation, differentiation and specification of neural stem cells in a Notch-dependent manner, results observed in Capitella may reinforce the view of an ancestral function of the Notch signaling pathway in patterning the NS of bilaterians. But the situation is maybe not so simple, as in Platynereis, neither Notch nor Delta seems to be expressed in cells, tissues or structures related to the NS (Gazave and Balavoine, Notch signaling in the annelid Platynereis, in preparation). Thus the 13 Hes/Hey-related genes of Platynereis that appears to be involved in neurogenesis may act in a Notch-independent manner and the nature of their regulators is still a totally open question.
Platynereis genes expression patterns support an implication in segmentation processes
Among the 13 Hes/Hey-related genes from which we obtained precise expression data, seven are localized in structures (SAZ), patterns (expression in stripes) or territories (segment epidermis) that suggest an involvement in the process of segment formation (Table 4, Additional file 5). The segmentation process can be virtually divided into three major steps in both arthropods and vertebrates and presumably in annelids: the axis growth, the specification of a segmental periodicity and the axial patterning of individual segments. In Platynereis, the first step, the production of an elongated anterior-posterior axis, relies on the presence of both ectodermal and mesodermal stem cells, called teloblasts, in a specific ring-shaped posterior zone: the SAZ . During posterior elongation, six Hes genes (Pdu-Hes1, 2, 4, 5, 6 and 8) are expressed in a 1- to 3/4-cell-wide ring of ectodermal cells, immediately anterior to the pygidium, in the SAZ (Figures 6, 7, 9, 10, 11 and 12). The ring, most clearly visible on the dorsal side of the worms, extends to their ventral face but is in most cases interrupted in the ventral-most part of the ectoderm, as illustrated by the pattern of Pdu-Hes2 (Figure 7). This ring-like expression is similar to what is observed for several genes already described in ectodermal teloblast-like cells [65, 82]. Pdu-Hes5 is not restricted to this ring-like ectodermal expression but is also expressed in a ring-like group of posterior mesodermal cells located immediately anterior to the pygidium boundary, in the ventral side (Figure 10). Several genes that are expressed in the ectodermal SAZ during PE, are also found in a group of internal cells located at the border between the 3rd chaetigerous segment and the forming pygidium at 72 hpf (Pdu-Hes4, 5, 8 and 11; Figures 9, 10, 12 and Additional file 5), or in a ring-like fashion, on the dorsal ectoderm only (Figure 7). These internal cells, first reported by Rebscher et al. [98, 99] as forming a prospective mesodermal posterior growth zone are thought to be derived from the primary mesoblasts of the 4d lineage. Gazave et al.  provided expression data for these cells, for more than 20 genes (mainly RNA-binding proteins and transcription factors) involved in the formation, behavior, or maintenance of stem cells in other metazoan organisms. These cells are proliferative, correspond most probably to the mesodermal component of the prospective SAZ and at least a part of it comprises stem cells . Thus, those Pdu-Hes expressions are very similar to what we have reported before, both at 72 hpf and during PE, suggesting that they are expressed in posterior ecto and/or meso teloblast-like stem cells of the SAZ, involved in the PE process. Although PE also occurs in some groups of non-segmented animals, this process has been so far mainly described in segmented bilaterian animals, many of which form most of their body axis through the sequential posterior addition of segments .
Interestingly, for one gene, Pdu-Hes5, expression is not only restricted to the ecto and mesodermal cells of the SAZ, but continued in mesodermal stripes, well before any trace of segmentation is visible (Figure 10). These stripes are persistent in differentiating segments and positioned in the anterior part of each segment. These mesodermal stripes also extend in the lateral sides of the trunk but are interrupted ventrally and dorsally at the level of the unsegmented ventral and dorsal vessels, respectively. A previous study of Platynereis reported the expression of mesodermal stripes for four genes of the NK family (Pdu-Msx, Pdu-Lbx, Pdu-Tlx and Pdu-NK1) that have been proposed to be associated with the segmented mesodermal epithelia that surround the coelomic cavities . One of them, Pdu-NK1 is precisely located at the anterior part of the segment, like Pdu-Hes5. The expressions of those four genes are complementary, covering most of the mesodermal segments, suggesting that they might be working in a concerted way to pattern the A/P polarity of individual mesodermal segments. A similar role for Pdu-Hes5, whereas no others Hes are expressed in mesodermal stripes, could be thus also proposed.
Five Hes/Hey-related genes (Pdu-Hes1, 4, 5, 6, and 8) are also expressed in various patterns in the segment epidermis of the larvae and/or during PE. These genes are not expressed in stripes in the vicinity of the SAZ, but in maturing segments suggesting that they are not early players in the segmental patterning (Figures 6, 9, 10, 11, 12). These late expressions are probably indicative of a role in segment differentiation rather than in segment early patterning.
In this study, we show that seven Hes/Hey-related genes are expressed in structures related to the segmentation process. Although five of them are found in two categories of expression patterns (that is, SAZ and segment epidermis), only one gene, Pdu-Hes5, is expressed in both the ectodermal and mesodermal SAZ but also in mesodermal stripes and in the segment epidermis. This leads us to propose that Pdu-Hes5 may be a key element acting during the axial patterning of Platynereis segments. The six other Hes/Hey-related genes expressed in structures related to the segmentation are localized in the teloblasts of the SAZ. The mitotic behavior of Platynereis teloblasts is coordinated to a certain degree  and they presumably undergo asymmetric divisions as leech teloblasts do . One can suppose that those genes may be involved in such cellular processes and so have a role in the specification of a segmental periodicity. This would be reminiscent of the vertebrate situation, in which Hes/Her genes are periodically expressed in a wave-like fashion in the presomitic mesoderm (PSM) and are the main elements of the molecular clock that, through the control of Notch, induce somite formation [24–26]. A possible involvement of the Notch/Delta pathway in the segmentation of an annelid has been previously questioned . During Capitella teleta larval development, neither CapI-Hes1/2/3, nor CapI-Delta and CapI-Notch are expressed in a striped pattern, suggesting they are not involved in the formation of the larval segments. Nevertheless, all of them are expressed in the mesodermal SAZ during PE of the juvenile, a fact that can be interpreted in favor of a role of these genes in the formation of segments during PE. In Platynereis, expression pattern during PE of Notch and Delta are in agreement with such a hypothesis (Gazave and Balavoine, Notch signaling in the annelid Platynereis, in preparation). If Pdu-Hes5 is a, direct or not, target gene regulated by the Notch pathway in this segmentation process is an issue not yet resolved.
All expressed Hes/Hey-related genes in Platynereis are involved in diverse organogenesis processes in addition to segmentation and nervous system patterning
All 13 Hes/Hey-related genes from which we obtained expression data are expressed in specific organs or structures, such as chaetal sacs, stomodeum, midgut and parapodes (Table 4). Thus, Pdu-Hes4, 5, 10, 13 and Pdu-Stich are expressed in the developing cone-shaped midgut at 72 hpf (Figures 9, 10, 13 and 15, Additional file 5), while for Pdu-Hes10 and 13, this expression is restricted to specific round cells. Ten Hes/Hey-related genes (Pdu-Hes1, 4, 5, 6, 8, 10, 11, 12, 13 and Pdu-Hey) are also expressed in the stomodeum of the larvae. Numerous Hes/Hey-related genes are found to be expressed more or less broadly in the parapodia, of the 72 hpf larvae and also during PE, in very different ways from one gene to another (Figures 7, 8, 9, 10 and 16).
Two Hes/Hey-related genes have an intriguing localization. These are Pdu-Hes2 and 12 that are very specifically and intensively expressed in presumptive chaetal sac anlagen (Figure 7 and 14). Chaetae are chitinous bristle-like structures displayed by the annelid parapodes. At 48 hpf, chaetae do not protrude from the epidermal layer but grow internally in the chaetae sacs. There are 12 chaetal sacs in the trochophore, two per hemi-segment, located laterally. In a lateral view, each pair of ventral and dorsal sacs corresponds to the chaetal sac of the future neuropode and notopode of the parapodia. While Pdu-Hes2 and 12 are both expressed in the same areas corresponding to the chaetal sac, their patterns are different. Indeed, Pdu-Hes2 is expressed in 12 larges patches corresponding to a large proportion of the chaetal sac cells (Figure 7). In contrast, Pdu-Hes12 expression is restricted to 12 spots of very few cells, presumably just one (6 groups in the ventral part and 6 in the dorsal one) (Figure 14) that sit at the internal tip of the chaetal sac. Morphological and ultrastructural studies revealed that chaetae emerge from epidermal follicles that in turn form the chaetal sacs. Furthermore, each follicle consists of one basal chaetoblast and several laterally surrounding follicle cells . As a consequence, it appears that Pdu-Hes2 and 12 are expressed differentially in the different cell-types of the follicles forming the chaetal sacs, Pdu-Hes12 being found in a unique cell in each chaetal sac while Pdu-Hes2 is located in the surrounding follicle cells. During PE, the situation seems similar, with precise expression at the basis of the chaetal sac harboring the already emerged chaetae, for Pdu-Hes12. Pdu-Hes2 is expressed very early during PE, long before the protrusion of the chaetae, in the recently produced segment.
Expression of annelid Hes in the chaetal sac was previously reported in Capitella teleta. Indeed, CapI-Hes2 expression coincides with those of CapI-Delta and CapI-Notch in the presumptive chaetal sacs. Those expression patterns appear just after the segments form, and their detection ceases prior to the appearance of chaetae and suggest a role (direct or not) of the Notch pathway in chaetogenesis. In Platynereis, Delta and Notch are also expressed in the chaetal sacs (Gazave and Balavoine, Notch signaling in the annelid Platynereis, in preparation) supporting the idea that the involvement of Notch signaling in chaetal development may be an ancestral feature, at least, for annelids.
Gene duplication in the Metazoan Hes superfamily: insights from the Platynereis expression data
Gene duplication is one of the major mechanisms for the origin of functions of new genes and it is now well-known that the refashioning of duplicated genes is a great contributor to the origin of the evolutionary novelties . Taking into account both evolutionary history of the family and expression data from Platynereis, we propose here two hypotheses to explain how gene duplication occurred in the Hes superfamily in metazoan.
One possibility to explain how this family is so prolific compared to other bHLH family (Additional file 3) is a high frequency of gene duplication events specifically for the Hes superfamily - a sort of hotspot of duplication, that could be explained by the presence of repeated sequences or late-replicating regions in the genomic area of Hes genes, which will raise the recombination rate, compared to other gene families . Among the very high number of copies generated by the multiple duplications, a minority of them will become fixed by selection . Nevertheless, such a hotspot process should produce large numbers of gene copies that become pseudogenes. In the species studied in this analysis we failed to find any evidence of pseudogeneization, such as in frame stop codons. Therefore, we cannot conclude that the high number of gene copies in some lineages lies in a high frequency of duplications rather than in a high retention rate of duplicated genes.
As a functionally indistinguishable duplicate has no chance to be fixed, two main models have been proposed to explain such a counterintuitive state: the neofunctionalization model and the DDC process . The neofunctionalization model proposes that the accumulation of neutral mutations in both copies of the duplicated gene will rapidly cause the appearance of a new function in at least one of them, and thus a favorable selective context to retain both copies. However, it seems unlikely that such a process could by itself explain the multiplicity of Hes gene copies in some lineages, because this would also imply the repeated appearance of similar functions in different lineages. The DDC model relies instead on the presence of an ancestral gene that carried out pleiotropic roles. At first glance, Platynereis Hes genes are expressed in a variety of cells and territories. A comprehensive overview however suggests that they are mainly involved in three main processes of annelid development: segmentation, neuron subtype-differentiations and chaetogenesis. This is congruent with the presence of an ancestral Hes gene carrying a multiplicity of functions. We also observed three cases of combinatorial patterns of Hes genes among the Platynereis development. The first one is revealed by the multiples Hes genes that may be involved in the NS patterning, the Pdu-Hes expressed in the midline cells are never expressed concomitantly with others Hes genes found in the VNC, PNS or brain cells. Also noticeable is the combination of two Hes genes (Pdu-Hes2 plus Pdu-Hes12) in the chaetal follicles. The addition of Pdu-Hes2+ and Pdu-Hes12+ cells exactly corresponds to most cells of the chaetal follicles. In a similar way, Pdu-Hes6 and Pdu-Hes8 have comparable expression patterns, but not strictly identical, suggesting also a duplication event but more recent. These three specific cases are arguments in favor of the DDC model during Hes superfamily evolution. A similar situation has been previously proposed for the Branchiostoma Hairy clustered genes .
Following this DDC hypothesis, a plausible schematic representation of expression territories in the multifunction ancestral Pdu-Hes is proposed in the Figure 17, as well as representation of specific patterns for each Platynereis Hes/Hey-related gene. When positioning Platynereis Hes expression in front of a tree of Platynereis Hes phylogenetic relationships, an intriguing pattern is revealed. Indeed, all but one genes expressed in territories related to the segmentation process are grouped together in a clade constituting the less divergent Hes (see phylogenetic part of the results and discussion). This clade also included all but one gene that harbor an expression in the midline. The more divergent Hes, which mainly correspond to those that have lost the Orange domain in the course of evolution are grouped together and are involved mainly in the NS patterning. Notably, the two markers of chaetogenesis are found in both clades.
Integrating both phylogenetic and expression data led us to hypothesize that an ancestral multifunctional Pdu-Hes gene has undergone duplication-degeneration-complementation processes, each gene copy ensuring their maintenance in the genome by subfunctionalisation. Nevertheless, we cannot totally exclude the possibility of a certain amount of neofunctionalisation and therefore a combination of those two genetic mechanisms to shape a complex evolutionary history as highlighted by the Hes superfamily case.
A still open question: are the Hes superfamily members ancestrally regulated by Notch?
Can the Hes/Hey-related genes be considered canonical target genes of the Notch pathway? As highlighted in the Introduction, most studies of the Hes/Hey-related genes have been performed in the context of the analysis of the Notch signaling pathway. This biased point of view has led many authors to consider as a general rule their regulation by the Notch pathway, characterizing the Hes/Hey-related genes as canonical targets of the pathway. However, it should be stressed that the HES proteins, in contrast to Su(H) are not core components of the Notch pathway, and are used as transcriptional outputs of the pathway in some but not all Notch-dependent processes. Also few Notch-independent expressions and functions have been reported and recent studies have demonstrated that crosstalk between Notch and other major signaling pathways, such as fibroblast growth factor (FGF), bone morphogenetic protein (BMP) and transforming growth factor (TGF)-β, results in the regulation of some Hes or Hey genes in a Notch-independent fashion [10, 105]. Thus a systematic regulation of Hes/Hey-related genes by Notch should not be expected while Hes Notch-independent roles have been marginally explored so far. As shown in this study, the last common ancestor of metazoan possessed at least three Hes/Hey-related genes and the Hes family underwent a large expansion in the course of the evolution of each lineage, resulting in the presence of numerous Hes paralogues in the present species. Unfortunately, expression and/or functional data are only known for a very small proportion of them, preventing a detailed picture of the Hes/Hey-related genes regulation versus non-regulation by Notch at a metazoan scale. We also know that the Notch pathway is a metazoan innovation, presumably already functional in the last common ancestor (LCA) . We propose that a regulation by Notch of Hes/Hey-related genes was already present at the dawn of metazoan diversity but does not imply that all Hes/Hey-related genes that frequently appeared during metazoan evolutionary history retained this regulation. Studying all the Hes/Hey-related genes repertory of a species, as it was performed for Branchiostoma and Platynereis (this study, Gazave and Balavoine, Notch signaling in the annelid Platynereis, in preparation ) will help to obtain a more realistic picture of Hes/Hey-related genes and Notch relationships.
approximate likelihood ratio test
CBF1, Su(H): Lag-1
central nervous system
Enhancer of split
expressed sequence tag
Hairy/enhancer of Split
Hairy/Enhancer of Split related with YRPW motif
hours post fertilization
microphthalmia-associated transcription factor
Notch intracellular domain
peripheral nervous system
segment addition zone
Sterol regulatory element binding protein
Suppressor of Hairless
ventral nerve cord
whole-mount in situ hybridization
Ledent V, Vervoort M: The basic helix-loop-helix protein family: comparative genomics and phylogenetic analysis. Genome Res. 2001, 11: 754-770. 10.1101/gr.177001.
Simionato E, Ledent V, Richards G, Thomas-Chollier M, Kerner P, Coornaert D, Degnan BM, Vervoort M: Origin and diversification of the basic helix-loop-helix gene family in metazoans: insights from comparative genomics. BMC Evol Biol. 2007, 7: 33-10.1186/1471-2148-7-33.
Fisher A, Caudy M: The function of hairy-related bHLH repressor proteins in cell fate decisions. Bioessays. 1998, 20: 298-306. 10.1002/(SICI)1521-1878(199804)20:4<298::AID-BIES6>3.0.CO;2-M.
Fisher AL, Ohsako S, Caudy M: The WRPW motif of the hairy-related basic helix-loop-helix repressor proteins acts as a 4-amino-acid transcription repression and protein-protein interaction domain. Mol Cell Biol. 1996, 16: 2670-2677.
Fischer A, Gessler M: Delta-Notch–and then? Protein interactions and proposed modes of repression by Hes and Hey bHLH factors. Nucleic Acids Res. 2007, 35: 4583-4596. 10.1093/nar/gkm477.
Zhou M, Yan J, Ma Z, Zhou Y, Abbood NN, Liu J, Su L, Jia H, Guo AY: Comparative and evolutionary analysis of the HES/HEY gene family reveal exon/intron loss and teleost specific duplication events. PLoS One. 2012, 7: e40649-10.1371/journal.pone.0040649.
Duncan EJ, Dearden PK: Evolution of a genomic regulatory domain: the role of gene co-option and gene duplication in the Enhancer of split complex. Genome Res. 2010, 20: 917-928. 10.1101/gr.104794.109.
Leimeister C, Externbrink A, Klamt B, Gessler M: Hey genes: a novel subfamily of hairy- and Enhancer of split related genes specifically expressed during mouse embryogenesis. Mech Dev. 1999, 85: 173-177. 10.1016/S0925-4773(99)00080-5.
Iso T, Kedes L, Hamamori Y: HES and HERP families: multiple effectors of the Notch signaling pathway. J Cell Physiol. 2003, 194: 237-255. 10.1002/jcp.10208.
Doetzlhofer A, Basch ML, Ohyama T, Gessler M, Groves AK, Segil N: Hey2 regulation by FGF provides a Notch-independent mechanism for maintaining pillar cell fate in the organ of Corti. Dev Cell. 2009, 16: 58-69. 10.1016/j.devcel.2008.11.008.
Katoh M: Integrative genomic analyses on HES/HEY family: Notch-independent HES1, HES3 transcription in undifferentiated ES cells, and Notch-dependent HES1, HES5, HEY1, HEY2, HEYL transcription in fetal tissues, adult tissues, or cancer. Int J Oncol. 2007, 31: 461-466.
Kageyama R, Ohtsuka T, Kobayashi T: The Hes gene family: repressors and oscillators that orchestrate embryogenesis. Development. 2007, 134: 1243-1251. 10.1242/dev.000786.
Mumm JS, Kopan R: Notch signaling: from the outside in. Dev Biol. 2000, 228: 151-165. 10.1006/dbio.2000.9960.
Greenwald I: LIN-12/Notch signaling: lessons from worms and flies. Genes Dev. 1998, 12: 1751-1762. 10.1101/gad.12.12.1751.
Artavanis-Tsakonas S, Rand MD, Lake RJ: Notch signaling: cell fate control and signal integration in development. Science. 1999, 284: 770-776. 10.1126/science.284.5415.770.
Kadesch T: Notch signaling: the demise of elegant simplicity. Curr Opin Genet Dev. 2004, 14: 506-512. 10.1016/j.gde.2004.07.007.
Baron M: An overview of the Notch signalling pathway. Cell Dev Biol. 2003, 14: 113-119. 10.1016/S1084-9521(02)00179-9.
Gazave E, Lapébie P, Richards GS, Brunet F, Ereskovsky AV, Degnan BM, Borchiellini C, Vervoort M, Renard E: Origin and evolution of the Notch signalling pathway: an overview from eukaryotic genomes. BMC Evol Biol. 2009, 9: 249-10.1186/1471-2148-9-249.
Gazave E, Renard E: Evolution of Notch Transmembrane Receptors, Encyclopedia of Life Sciences (ELS). 2010, Chichester: John Wiley & Sons
Cau E, Blader P: Notch activity in the nervous system: to switch or not switch?. Neural Dev. 2009, 4: 36-10.1186/1749-8104-4-36.
Lewis J, Hanisch A, Holder M: Notch signaling, the segmentation clock, and the patterning of vertebrate somites. J Biol. 2009, 8: 44-10.1186/jbiol145.
Kageyama R, Ohtsuka T, Kobayashi T: Roles of Hes genes in neural development. Dev Growth Differ. 2008, 50 (Suppl 1): S97-S103.
Jalali A, Bassuk AG, Kan L, Israsena N, Mukhopadhyay A, McGuire T, Kessler JA: HeyL promotes neuronal differentiation of neural progenitor cells. J Neurosci Res. 2011, 89: 299-309. 10.1002/jnr.22562.
Niwa Y, Masamizu Y, Liu T, Nakayama R, Deng CX, Kageyama R: The initiation and propagation of Hes7 oscillation are cooperatively regulated by Fgf and notch signaling in the somite segmentation clock. Dev Cell. 2007, 13: 298-304. 10.1016/j.devcel.2007.07.013.
Cinquin O: Understanding the somitogenesis clock: what's missing?. Mech Dev. 2007, 124: 501-517. 10.1016/j.mod.2007.06.004.
Lewis J, Ozbudak EM: Deciphering the somite segmentation clock: beyond mutants and morphants. Dev Dyn. 2007, 236: 1410-1415. 10.1002/dvdy.21154.
Fischer A, Gessler M: Hey genes in cardiovascular development. Trends Cardiovasc Med. 2003, 13: 221-226. 10.1016/S1050-1738(03)00082-3.
Ingham PW, Howard KR, Ish-Horowicz D: Transcription pattern of the Drosophila segmentation gene hairy. Nature. 1985, 318: 439-445. 10.1038/318439a0.
Ingham PW, Pinchin SM, Howard KR, Ish-Horowicz D: Genetic analysis of the hairy locus in Drosophila melanogaster. Genetics. 1985, 111: 463-486.
Ish-Horowicz D, Howard KR, Pinchin SM, Ingham PW: Molecular and genetic analysis of the hairy locus in Drosophila. Cold Spring Harb Symp Quant Biol. 1985, 50: 135-144. 10.1101/SQB.1985.050.01.019.
Jennings B, Preiss A, Delidakis C, Bray S: The Notch signalling pathway is required for Enhancer of split bHLH protein expression during neurogenesis in the Drosophila embryo. Development. 1994, 120: 3537-3548.
Delidakis C, Artavanis-Tsakonas S: The Enhancer of split [E(spl)] locus of Drosophila encodes seven independent helix-loop-helix proteins. Proc Natl Acad Sci USA. 1992, 89: 8731-8735. 10.1073/pnas.89.18.8731.
Zacharioudaki E, Magadi SS, Delidakis C: bHLH-O proteins are crucial for Drosophila neuroblast self-renewal and mediate Notch-induced overproliferation. Development. 2012, 139: 1258-1269. 10.1242/dev.071779.
Younger-Shepherd S, Vaessin H, Bier E, Jan LY, Jan YN: Deadpan, an essential pan-neural gene encoding an HLH protein, acts as a denominator in Drosophila sex determination. Cell. 1992, 70: 911-922. 10.1016/0092-8674(92)90242-5.
Monastirioti M, Giagtzoglou N, Koumbanakis KA, Zacharioudaki E, Deligiannaki M, Wech I, Almeida M, Preiss A, Bray S, Delidakis C: Drosophila Hey is a target of Notch in asymmetric divisions during embryonic and larval neurogenesis. Development. 2010, 137: 191-201. 10.1242/dev.043604.
Stollewerk A, Schoppmeier M, Damen WG: Involvement of Notch and Delta genes in spider segmentation. Nature. 2003, 423: 863-865. 10.1038/nature01682.
Damen WG, Weller M, Tautz D: Expression patterns of hairy, even-skipped, and runt in the spider Cupiennius salei imply that these genes were segmentation genes in a basal arthropod. Proc Natl Acad Sci USA. 2000, 97: 4515-4519. 10.1073/pnas.97.9.4515.
Chipman AD, Akam M: The segmentation cascade in the centipede Strigamia maritima: involvement of the Notch pathway and pair-rule gene homologues. Dev Biol. 2008, 319: 160-169. 10.1016/j.ydbio.2008.02.038.
Pueyo JI, Lanfear R, Couso JP: Ancestral Notch-mediated segmentation revealed in the cockroach Periplaneta americana. Proc Natl Acad Sci USA. 2008, 105: 16614-16619. 10.1073/pnas.0804093105.
Aranda M, Marques-Souza H, Bayer T, Tautz D: The role of the segmentation gene hairy in Tribolium. Dev Genes Evol. 2008, 218: 465-477. 10.1007/s00427-008-0240-1.
Kux K, Kiparaki M, Delidakis C: The two Tribolium E(spl) genes show evolutionarily conserved expression and function during embryonic neurogenesis. Mech Dev. 2013, 130: 207-225. 10.1016/j.mod.2013.02.003.
Wrischnik LA, Kenyon CJ: The role of lin-22, a hairy/enhancer of split homolog, in patterning the peripheral nervous system of C. elegans. Development. 1997, 124: 2875-2888.
Neves A, Priess JR: The REF-1 family of bHLH transcription factors pattern C. elegans embryos through Notch-dependent and Notch-independent pathways. Dev Cell. 2005, 8: 867-879. 10.1016/j.devcel.2005.03.012.
Song MH, Huang FZ, Gonsalves FC, Weisblat DA: Cell cycle-dependent expression of a hairy and Enhancer of split (hes) homolog during cleavage and segmentation in leech embryos. Dev Biol. 2004, 269: 183-195. 10.1016/j.ydbio.2004.01.025.
Rivera AS, Gonsalves FC, Song MH, Norris BJ, Weisblat DA: Characterization of Notch-class gene expression in segmentation stem cells and segment founder cells in Helobdella robusta (Lophotrochozoa; Annelida; Clitellata; Hirudinida; Glossiphoniidae). Evol Dev. 2005, 7: 588-599. 10.1111/j.1525-142X.2005.05062.x.
Rivera AS, Weisblat DA: And Lophotrochozoa makes three: Notch/Hes signaling in annelid segmentation. Dev Genes Evol. 2009, 219: 37-43. 10.1007/s00427-008-0264-6.
Thamm K, Seaver EC: Notch signaling during larval and juvenile development in the polychaete annelid Capitella sp. I. Dev Biol. 2008, 320: 304-318. 10.1016/j.ydbio.2008.04.015.
Marlow H, Roettinger E, Boekhout M, Martindale MQ: Functional roles of Notch signaling in the cnidarian Nematostella vectensis. Dev Biol. 2012, 362: 295-308. 10.1016/j.ydbio.2011.11.012.
Kasbauer T, Towb P, Alexandrova O, David CN, Dall'armi E, Staudigl A, Stiening B, Böttger A: The Notch signaling pathway in the cnidarian Hydra. Dev Biol. 2007, 303: 376-390. 10.1016/j.ydbio.2006.11.022.
Munder S, Kasbauer T, Prexl A, Aufschnaiter R, Zhang X, Towb P, Böttger A: Notch signalling defines critical boundary during budding in Hydra. Dev Biol. 2010, 344: 331-345. 10.1016/j.ydbio.2010.05.517.
Riesgo A, Andrade SC, Sharma PP, Novo M, Perez-Porro AR, Vahtera V, González VL, Kawauchi GY, Giribet G: Comparative description of ten transcriptomes of newly sequenced invertebrates and efficiency estimation of genomic sampling in non-model taxa. Front Zool. 2012, 9: 33-10.1186/1742-9994-9-33.
Jimenez-Delgado S, Crespo M, Permanyer J, Garcia-Fernandez J, Manzanares M: Evolutionary genomics of the recently duplicated amphioxus Hairy genes. Int J Biol Sci. 2006, 2: 66-72.
Minguillon C, Jimenez-Delgado S, Panopoulou G, Garcia-Fernandez J: The amphioxus Hairy family: differential fate after duplication. Development. 2003, 130: 5903-5914. 10.1242/dev.00811.
Ferrier DE: Evolutionary crossroads in developmental biology: annelids. Development. 2012, 139: 2643-2653. 10.1242/dev.074724.
Hui JH, Raible F, Korchagina N, Dray N, Samain S, Magdelenat G, Jubin C, Segurens B, Balavoine G, Arendt D, Ferrier DE: Features of the ancestral bilaterian inferred from Platynereis dumerilii ParaHox genes. BMC Biol. 2009, 7: 43-10.1186/1741-7007-7-43.
Raible F, Tessmar-Raible K, Osoegawa K, Wincker P, Jubin C, Balavoine G, Ferrier D, Benes V, de Jong P, Weissenbach J, Bork P, Arendt D: Vertebrate-type intron-rich genes in the marine annelid Platynereis dumerilii. Science. 2005, 310: 1325-1326. 10.1126/science.1119089.
Janssen R, Le Gouar M, Pechmann M, Poulin F, Bolognesi R, Schwager EE, Hopfen C, Colbourne JK, Budd GE, Brown SJ, Prpic NM, Kosiol C, Vervoort M, Damen WG, Balavoine G, McGregor AP: Conservation, loss, and redeployment of Wnt ligands in protostomes: implications for understanding the evolution of segment formation. BMC Evol Biol. 2010, 10: 374-10.1186/1471-2148-10-374.
Innan H, Kondrashov F: The evolution of gene duplications: classifying and distinguishing between models. Nat Rev Genet. 2010, 11: 97-108.
Hittinger CT, Carroll SB: Gene duplication and the adaptive evolution of a classic genetic switch. Nature. 2007, 449: 677-681. 10.1038/nature06151.
Hahn MW: Distinguishing among evolutionary models for the maintenance of gene duplicates. J Hered. 2009, 100: 605-617. 10.1093/jhered/esp047.
Lynch M: The Origins of Genome Architecture. 2007, Sunderland, USA: Sinauer Associates
Force A, Lynch M, Pickett FB, Amores A, Yan YL, Postlethwait J: Preservation of duplicate genes by complementary, degenerative mutations. Genetics. 1999, 151: 1531-1545.
Dorresteijn AWC, O’Grady B, Fischer A, Porchet-Henere E, Boilly-Marer Y: Molecular specification of cell lines in the embryo of Platynereis (Annelida). Roux's Arch Dev Biol. 1993, 202: 264-273.
Fischer AH, Henrich T, Arendt D: The normal development of Platynereis dumerilii (Nereididae, Annelida). Front Zool. 2010, 7: 31-10.1186/1742-9994-7-31.
de Rosa R, Prud'homme B, Balavoine G: Caudal and even-skipped in the annelid Platynereis dumerilii and the ancestry of posterior growth. Evol Dev. 2005, 7: 574-587. 10.1111/j.1525-142X.2005.05061.x.
Tessmar-Raible K, Steinmetz PR, Snyman H, Hassel M, Arendt D: Fluorescent two-color whole mount in situ hybridization in Platynereis dumerilii (Polychaeta, Annelida), an emerging marine molecular model for evolution and development. Biotechniques. 2005, 39: 460-464. 10.2144/000112023.
Rutherford K, Parkhill J, Crook J, Horsnell T, Rice P, Rajandream MA, Barrell B: Artemis: sequence visualization and annotation. Bioinformatics. 2000, 16: 944-945. 10.1093/bioinformatics/16.10.944.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410. 10.1016/S0022-2836(05)80360-2.
Moreno-Hagelsieb G, Latimer K: Choosing BLAST options for better detection of orthologs as reciprocal best hits. Bioinformatics. 2008, 24: 319-324. 10.1093/bioinformatics/btm585.
Marchler-Bauer A, Lu S, Anderson JB, Chitsaz F, Derbyshire MK, Geer LY, Geer RC, Gonzales NR, Gwadz M, Hurwitz DI, Lanczycki CJ, Lu F, Lu S, Marchler GH, Song JS, Thanki N, Yamashita RA, Zhang D, Bryant SH: CDD: a conserved domain database for the functional annotation of proteins. Nucleic Acids Res. 2011, 39: D225-D229. 10.1093/nar/gkq1189.
Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R: InterProScan: protein domains identifier. Nucleic Acids Res. 2005, 33: W116-W120. 10.1093/nar/gki442.
Edgar RC: MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics. 2004, 5: 113-10.1186/1471-2105-5-113.
Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32: 1792-1797. 10.1093/nar/gkh340.
Hall TA: Bioedit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser. 1999, 41: 95-98.
Le SQ, Gascuel O: An improved general amino acid replacement matrix. Mol Biol Evol. 2008, 25: 1307-1320. 10.1093/molbev/msn067.
Anisimova M, Gascuel O: Approximate likelihood-ratio test for branches: a fast, accurate, and powerful alternative. Syst Biol. 2006, 55: 539-552. 10.1080/10635150600755453.
Felsenstein J: Confidence limits on phylogenies: an approach using the bootstrap. Evolution. 1985, 39: 783-791. 10.2307/2408678.
Huelsenbeck JP, Ronquist F: MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics. 2001, 17: 754-755. 10.1093/bioinformatics/17.8.754.
Huelsenbeck JP, Ronquist F, Nielsen R, Bollback JP: Bayesian inference of phylogeny and its impact on evolutionary biology. Science. 2001, 294: 2310-2314. 10.1126/science.1065889.
Rambaut A: FigTree v1.4.http://tree.bio.ed.ac.uk/software/figtree/,
Maddison WP, Maddison DR: Mesquite: a modular system for evolutionary analysis. Version 2.73.http://mesquiteproject.org,
Gazave E, Behague J, Laplane L, Guillou A, Preau L, Demilly A, Balavoine G, Vervoort M: Posterior elongation in the annelid Platynereis dumerilii involves stem cells molecularly related to primordial germ cells. Dev Biol. 2013, 382: 246-267. 10.1016/j.ydbio.2013.07.013.
Hui JH, McDougall C, Monteiro AS, Holland PW, Arendt D, Balavoine G, Ferrier DE: Extensive chordate and annelid macrosynteny reveals ancestral homeobox gene organization. Mol Biol Evol. 2012, 29: 157-165. 10.1093/molbev/msr175.
Philippe H, Derelle R, Lopez P, Pick K, Borchiellini C, Boury-Esnault N, Vacelet J, Renard E, Houliston E, Quéinnec E, Da Silva C, Wincker P, Le Guyader H, Leys S, Jackson DJ, Schreiber F, Erpenbeck D, Morgenstern B, Wörheide G, Manuel M: Phylogenomics revives traditional views on deep animal relationships. Curr Biol. 2009, 19: 706-712. 10.1016/j.cub.2009.02.052.
Ryan JF, Pang K, Schnitzler CE, Nguyen AD, Moreland RT, Simmons DK, Koch BJ, Francis WR, Havlak P, Comparative Sequencing Program NISC, Smith SA, Putnam NH, Haddock SH, Dunn CW, Wolfsberg TG, Mullikin JC, Martindale MQ, Baxevanis AD: The genome of the ctenophore Mnemiopsis leidyi and its implications for cell type evolution. Science. 2013, 342: 1242592-10.1126/science.1242592.
Prokopenko SN, He Y, Lu Y, Bellen HJ: Mutations affecting the development of the peripheral nervous system in Drosophila: a molecular screen for novel proteins. Genetics. 2000, 156: 1691-1715.
Gazave E, Lapébie P, Ereskovsky AV, Vacelet J, Renard E, Cárdenas P, Borchiellini C: No longer Demospongiae: Homoscleromorpha formal nomination as a fourth class of Porifera. Hydrobiologia. 2012, 687: 3-10. 10.1007/s10750-011-0842-x.
Nakatani T, Mizuhara E, Minaki Y, Sakamoto Y, Ono Y: Helt, a novel basic-helix-loop-helix transcriptional repressor expressed in the developing central nervous system. J Biol Chem. 2004, 279: 16356-16367. 10.1074/jbc.M311740200.
Arendt D, Nubler-Jung K: Comparison of early nerve cord development in insects and vertebrates. Development. 1999, 126: 2309-2325.
Denes AS, Jekely G, Steinmetz PR, Raible F, Snyman H, Prud'homme B, Ferrier DE, Balavoine G, Arendt D: Molecular architecture of annelid nerve cord supports common origin of nervous system centralization in bilateria. Cell. 2007, 129: 277-288. 10.1016/j.cell.2007.02.040.
Demilly A, Steinmetz P, Gazave E, Marchand L, Vervoort M: Involvement of the Wnt/beta-catenin pathway in neurectoderm architecture in Platynereis dumerilii. Nat Commun. 2013, 4: 1915-
Christodoulou F, Raible F, Tomer R, Simakov O, Trachana K, Klaus S, Snyman H, Hannon GJ, Bork P, Arendt D: Ancient animal microRNAs and the evolution of tissue identity. Nature. 2010, 463: 1084-1088. 10.1038/nature08744.
Wech I, Bray S, Delidakis C, Preiss A: Distinct expression patterns of different enhancer of split bHLH genes during embryogenesis of Drosophila melanogaster. Dev Genes Evol. 1999, 209: 370-375. 10.1007/s004270050266.
Ono Y, Nakatani T, Minaki Y, Kumai M: The basic helix-loop-helix transcription factor Nato3 controls neurogenic activity in mesencephalic floor plate cells. Development. 2010, 137: 1897-1906. 10.1242/dev.042572.
Baek JH, Hatakeyama J, Sakamoto S, Ohtsuka T, Kageyama R: Persistent and high levels of Hes1 expression regulate boundary formation in the developing central nervous system. Development. 2006, 133: 2467-2476. 10.1242/dev.02403.
Tomer R, Denes AS, Tessmar-Raible K, Arendt D: Profiling by image registration reveals common origin of annelid mushroom bodies and vertebrate pallium. Cell. 2010, 142: 800-809. 10.1016/j.cell.2010.07.043.
Bier E, Vaessin H, Younger-Shepherd S, Jan LY, Jan YN: Deadpan, an essential pan-neural gene in Drosophila, encodes a helix-loop-helix protein similar to the hairy gene product. Genes Dev. 1992, 6: 2137-2151. 10.1101/gad.6.11.2137.
Rebscher N, Lidke AK, Ackermann CF: Hidden in the crowd: primordial germ cells and somatic stem cells in the mesodermal posterior growth zone of the polychaete Platynereis dumerillii are two distinct cell populations. Evodevo. 2012, 3: 9-10.1186/2041-9139-3-9.
Rebscher N, Zelada-Gonzalez F, Banisch TU, Raible F, Arendt D: Vasa unveils a common origin of germ cells and of somatic stem cells from the posterior growth zone in the polychaete Platynereis dumerilii. Dev Biol. 2007, 306: 599-611. 10.1016/j.ydbio.2007.03.521.
Martin BL, Kimelman D: Wnt signaling and the evolution of embryonic posterior development. Curr Biol. 2009, 19: R215-R219. 10.1016/j.cub.2009.01.052.
Saudemont A, Dray N, Hudry B, Le Gouar M, Vervoort M, Balavoine G: Complementary striped expression patterns of NK homeobox genes during segment formation in the annelid Platynereis. Dev Biol. 2008, 317: 430-443. 10.1016/j.ydbio.2008.02.013.
Zhang SO, Weisblat DA: Applications of mRNA injections for analyzing cell lineage and asymmetric cell divisions during segmentation in the leech Helobdella robusta. Development. 2005, 132: 2103-2113. 10.1242/dev.01802.
Zakrzewski AC: Molecular Characterization of Chaetae Formation in Annelida and Other Lophotrochozoa: Berlin. 2011
Cardoso-Moreira M, Emerson JJ, Clark AG, Long M: Drosophila duplication hotspots are associated with late-replicating regions of the genome. PLoS Genet. 2011, 7: e1002340-10.1371/journal.pgen.1002340.
Kluppel M, Wrana JL: Turning it up a Notch: cross-talk between TGF beta and Notch signaling. Bioessays. 2005, 27: 115-118. 10.1002/bies.20187.
This work was funded by the CNRS, the Agence Nationale de la Recherche (France) (ANR grant BLAN-0294 and ANR BLAN-METAMERE), the Institut Universitaire de France and the Who am I? laboratory of excellence (No.ANR-11-LABX-0071) funded by the French Government through its “Investments for the Future” program operated by ANR under grant No.ANR-11-IDEX-0005-01. We are grateful to the Genoscope and the EMBL for giving access to unpublished Platynereis ESTs and genomic sequences. We also thank Michel Vervoort and Pierre Kerner for support and advice provided with genome analyses.
The authors declare that they have no competing interests.
EG carried out most of the molecular experiments (gene cloning, in situ hybridizations and imaging). AG performed part of the gene cloning. EG and GB jointly performed genome research, genomic analyses and phylogenetic analyses. EG and GB conceived and designed the study, analyzed data, wrote the first draft of the manuscript and were involved in editing the final version of the manuscript. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Table S1: Detailed information about the sequences of Hes/Hey-related genes used in our study. For each sequence, the species and lineage to which it belongs, the sequence name, the presence or absence of the main domains (basic helix-loop-helix (bHLH) and Orange) as well as the genomic localization (when available) are provided. NR = non relevant, no genomic localization data are available. Stars indicate the presence of clustered genes located in close genomic localization, with four genes in the case of Lgi168394c and five genes in the case of DreHer4c. (XLSX 23 KB)
Additional file 2: Table S2: Information about the physical linkages between Hes/Hey-related genes in 14 species among metazoans. In this table, we report the different Hes/Hey-related genes that are physically linked for each species, the genomic scaffolds (or chromosome) to which they belong, their position in these scaffolds as well as their strand. (XLSX 15 KB)
Additional file 3: Figure S1: Parsimony reconstruction analysis of character evolution based on a consensus Metazoan phylogenetic tree. The characters used in this analysis are the numbers of genes per basic helix-loop-helix (bHLH) family per species. Each character state is mentioned by a color code. Double-colored branches indicate non-determination of character state in the branch. The squares below taxon names give character state in the considered taxon; no square means unknown/missing data (in this case, character-state in the corresponding branch is optimized according to character-states in related taxa). A = Hes family: 1 to 22 Hes genes are present in the sampling dataset, the Urmetazoan presumably possessed one Hes gene and many duplications occurred. Gene loss is evidenced in one case. B = Hey family: 0 to 3 Hey genes are present in the sampling dataset, the Urmetazoan presumably possessed one Hey gene and 1 to 2 duplications occurred in the lineage leading to Capitella teleta and (Danio Rerio + Homo sapiens) clade only. Gene losses are evidenced in three cases. C = NeuroD family: 0 to 4 NeuroD genes are present in the sampling dataset, Urbilateria presumably possessed one NeuroD gene and duplications occurred in the lineage leading to (Danio rerio + Homo sapiens) clade. Gene loss is evidenced in one case. D = Clock family: 0 to 3 Clock genes are present in the sampling dataset, the Urmetazoan presumably possessed one Clock gene and one to two duplications occurred in several lineages. Gene loss is evidenced in one case. (PDF 423 KB)
Additional file 4: Figure S2: Schematic drawings of Platynereis dumerilii general anatomy. Larval developmental stages studied as well as post-caudal regeneration posterior elongation process are shown. Those drawings are used in the main figures of the article for an easier comprehension of the expression patterns. A = 24 h post fertilization (hpf), ventral view; B = 33 hpf, ventral view, C = 48 hpf, ventral view; D = 72 hpf, ventral view (focusing on the neurectoderm); D = 72 hpf, deeper ventral view (focusing on internal structures such as the SAZ); E = post-caudal regeneration posterior elongation process, dorsal view; E’ = post-caudal regeneration posterior elongation process, ventral view. Ac = anal cirri; Ae = adult eye; At = apical tuft; bla = blastopore; ch = chaetae; Le = larval eye; Mg = midgut; Mid = midline; Para = parapodia; Pt = prototroch; Py = pygidium; S1 = 1st segment; S2 = 2nd segment; S3 = 3rd segment; S = stomodeum; SAZ = segment addition zone; Telo = telotroch; VNC = ventral nerve cord. (PDF 1 MB)
Additional file 5: Figure S3: Expression patterns of Pdu-Hes11 and Pdu-Stich at 72 h post fertilization (hpf). Whole-mount in situ hybridization (WMISH) for the 72hpf stage is shown. Pdu-Hes11 is expressed in various brain cells, stomodeum cells and mesodermal patches. In addition Pdu-Hes11+ cells are also observed in the segment addition zone (SAZ). Pdu-Stich is expressed in the midline cells, in various brain cells and mesodermal patches. Panels are mostly ventral views (anterior is up). A dorsal view (D) is also shown for Pdu-Stich.(PDF 16 MB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.